Scientific research in the life sciences is often akin to searching for needles in haystacks. Finding the one protein, chemical, or genome that behaves or responds in the way the scientist is looking for is the key to the discovery process. For decades, high performance computing (HPC) systems have accelerated this process, often by helping to identify and eliminate in feasible targets sooner.
In this edition of insideBIGDATA’s Data Science 101 series, I’m going to offer up a short instructional video describing the use of the popular unsupervised learning algorithm, k-means clustering.
“Data Analytics Handbook” is a new resource meant to inform young professionals about the field of data science. Written by a group of students at UC Berkeley: Brian Liou, Tristan Tao, and Elizabeth Lin. Edition One of the book includes in-depth interviews with Data Scientists & Data Analysts.
MapR Technologies, Inc., provider of a leading distribution for Apache Hadoop, today announced a strategic partnership with Databricks and the addition of the complete Apache Spark technology stack to the MapR Distribution.
Cloudera and Revolution Analytics allow you to derive new business insights from Big Data by providing a joint solution to store, process, and analyze all your data at scale.
Sqrrl, the company that develops secure NoSQL database software for Big Data applications, has collaborated with Macmillan Education Australia, a leading educational publisher, to help them power a next generation education portal. Sqrrl’s NoSQL database, Sqrrl Enterprise, enables Macmillan to securely store massive amounts of student and teacher data and ensures data is only accessed in authorized ways.
MongoDB today announced the general availability of MongoDB 2.6, the newest release of the popular database. The release builds on five years of innovation and hundreds of thousands of deployments to simplify provisioning and operating MongoDB deployments.