Sign up for our newsletter and get the latest big data news and analysis.

Data Science 101: Data Agnosticism

Bits are bits. Whether you are searching for whales in audio clips or trying to predict hospitalization rates based on insurance claims, the process is the same: clean the data, generate features, build a model, and iterate.

MongoDB to Unlock Insights from Real-time Smart Grid Data

Silver Spring Networks, Inc. (NYSE: SSNI), a leading networking and solutions provider for smart energy networks, is building on the MongoDB database to seamlessly capture and store high volumes of rapidly changing, complex machine-to-machine (M2M) data for its new SilverLink™ Sensor Network

Interview: LC Technology International Stores Data in Cross-Platform Environments

When disaster strikes, lost data could cost you your business. LC Technology International is continually improving their data recovery products to meet these needs.

Big Data Humor: Apollo Blues

Humor_walk_world

A statistical tribute to our waning group of brave Apollo astronauts!     Sign up for the free insideBIGDATA newsletter.  

New Market Dynamics Report: HPC Life Sciences

HPC Life Sciences

Scientific research in the life sciences is often akin to searching for needles in haystacks. Finding the one protein, chemical, or genome that behaves or responds in the way the scientist is looking for is the key to the discovery process. For decades, high performance computing (HPC) systems have accelerated this process, often by helping to identify and eliminate in feasible targets sooner.

Data Science 101: k-means Clustering

In this edition of insideBIGDATA’s Data Science 101 series, I’m going to offer up a short instructional video describing the use of the popular unsupervised learning algorithm, k-means clustering.

Visualization of the Week: Coachella by the Numbers

Coachella

One of the annual extravaganzas in Southern California is the Coachella Valley Music and Arts Festival which starts this weekend and continues onto next weekend.

Rubicon.IO Uses Riak to Provide Real-Time Threat Analysis

An example of the Rubicon User Interface

Rubicon.IO is a start-up in the threat intelligence space that real-time analytic capabilities by scouring metadata from various sources: threat feeds, social media, SIEM data, and PCAPs. It uses an HPC engine that aggregates and humanizes geospatial, TECHINT, HUMINT, and OSINT data sources.

Data Science 101: The Data Analytics Handbook

“Data Analytics Handbook” is a new resource meant to inform young professionals about the field of data science. Written by a group of students at UC Berkeley: Brian Liou, Tristan Tao, and Elizabeth Lin. Edition One of the book includes in-depth interviews with Data Scientists & Data Analysts.

Netflix Reveals All (well, at least a lot)

netflixlogo

Last night I had the distinct pleasure of attending a Data Science Track event sponsored by the LA Machine Learning meetup group: Data Science @ Netflix.