Adobe, Baidu, Netflix, Yandex. Some of the biggest names in social media and cloud computing use NVIDIA CUDA-based GPU accelerators to provide seemingly magical search, intelligent image analysis and personalized movie recommendations, based on a technology called advanced machine learning.
Guavus uses live analytics with responsive queries to garner insightful business metrics to serve up competitive advantage. “Guavas is unique in its ability to provide an end-to-end view across your business and operations in real time. Our operational intelligence platform processes over 2.5 petabytes of data per day, which equals to 250 billion records per day and 2.5 million transactions per second.”
Bits are bits. Whether you are searching for whales in audio clips or trying to predict hospitalization rates based on insurance claims, the process is the same: clean the data, generate features, build a model, and iterate.
In this edition of insideBIGDATA’s Data Science 101 series, I’m going to offer up a short instructional video describing the use of the popular unsupervised learning algorithm, k-means clustering.
Experfy, based at the Harvard Innovation Lab, announced that it has launched a paradigm-changing, online marketplace that will allow industry leaders to solve their Big Data talent needs. Enterprises now have a central platform for on-demand hiring of vetted experts with algorithmic skills and domain knowledge, primarily for short-term projects related to data, analytics and business intelligence.
An integral tool found in data science is Time Series Forecasting. Here is a useful instructional video on the subject from one of the authors of a free eBook available on OTexts – “Forecasting: Principles and Practice.” The presentation “Forecasting Time Series Using R” is made by Professor of Statistics Rob J Hyndman.
Last night I attended the Los Angeles Hadoop users Group (LA-HUG) meeting hosted by Shopzilla. The topic for the evening was “An Overview of Hulu’s Data Platform” presented by Prasan Samtani and Tristan Reid of Hulu. From all indications, Hulu is a significant player in the Hadoop user community and this talk documented the team’s command of big data technology.