In this video from the OpenFabrics International Developer Workshop 2014, Tashneem Maistry from Pivotal presents: The Future of Unstructured Data Workloads.
As the latest installment of the Big Data Use Case series here on insideBIGDATA, we offer a compelling presentation by our friend , Jeremy Carroll, Operations engineer at Pinterest. Jeremy talks about how they use HBase at massive scale at Pinterest.
“A common thread for many of this year new IBM Fellows is their commitment to developing solutions and practical applications in the field of Big Data and Analytics. IBM is a leader in the space – with 1500 Big Data and Analytics-related patents in 2013 alone, and $24 billion in investments since 2005 through both acquisitions and R&D – and these fellows maintain the drumbeat of momentum that has made IBM number one in Big Data market share for the second year running.”
If you’ve ever spent valuable billable hours time thinking about an algorithm to seek out the optimal cheeseburger, and calculate metrics like the maximal meat-to-bun ratio, then this presentation by noted data scientist Hilary Mason at the Ignite NYC event last year is for you. Hilary, a self-admitted cheeseburger lover, found some data sets in […]
This instructional video explores how to use Hadoop and the Hortonworks Data Platform to analyze sentiment data to understand how the public feels about a product launch – highlighted is the release of the film “Iron Man 3.”
Bits are bits. Whether you are searching for whales in audio clips or trying to predict hospitalization rates based on insurance claims, the process is the same: clean the data, generate features, build a model, and iterate.
In this edition of insideBIGDATA’s Data Science 101 series, I’m going to offer up a short instructional video describing the use of the popular unsupervised learning algorithm, k-means clustering.
“As InfiniBand is getting used in scientific computing environments, there is a big demand to harness its benefits for enterprise environments for handling big data and analytics. This talk will focus on high-performance and scalable designs of Hadoop using native RDMA support of InfiniBand and RoCE. Designs for various components in Hadoop (such as HDFS, MapReduce, RPC, and HBASE) and their benefits based on the RDMA package for Apache Hadoop will be presented. RDMA-based design for scalable Memcached (used in Web 2.0) and the associated benefits will be presented.”