Ask a Data Scientist: Handling Missing Data

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” This week’s question is from a reader who seeks a discussion of missing data handling methods such as imputation.

Skytree and IBM Partner to Elevate Big Data Analytics with Advanced Machine Learning

Skytree®, The Machine Learning Company®, today announced a partnership with IBM®, a global leader in cutting-edge technology. The technical integration allows Skytree’s scalable, enterprise-grade machine learning platform, Skytree Infinity™, to run on top of IBM PureData™ System for Analytics, powered by IBM Netezza® technology.

Data Science 101: Data Agnosticism – Feature Engineering Without Domain Expertise

From the SciPy2013 conference, here is a compelling talk “Data Agnosticism: Feature Engineering Without Domain Expertise” by Nicholas Kridler of Accretive Health in Chicago.

Ask a Data Scientist: The Bias vs. Variance Tradeoff

Monica Martinez-Canales, PhD. Intel

This week’s Ask a Data Scientist question is from a reader who wants an explanation of the “bias vs. variance tradeoff in statistical learning.” Intel’s Dr. Monica Martinez-Canales is this week’s guest Data Scientist

Revolution Analytics Introduces Revolution R Open and Revolution R Plus

Revolution

Revolution Analytics, the only commercial provider of open source R software, today announced two new offerings that support the open source R community and elevate R’s capabilities to enterprise-level performance. Revolution R Open is a free, open source R distribution that enhances R performance, makes it easier to share R scripts and improves collaboration on R-based advanced analytics applications.

Ask a Data Scientist: Curse of Dimensionality

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” Once a week you’ll see reader submitted questions of varying levels of technical detail answered by a practicing data scientist – sometimes by me and other times by an Intel data scientist. This week’s question is from a reader who wants to know more about the “curse of dimensionality.”

RapidMiner Moves Predictive Analytics, Data Mining and Machine Learning into the Cloud

Pioneering predictive analytics leader RapidMiner has announced the general availability of RapidMiner Cloud.

Predictive Modeling and Production Deployment

insideBIGDATA_Guide_PA

Using predictive analytics involves understanding and preparing the data, defining the predictive model, and following the predictive process. Predictive models can assume many shapes and sizes, depending on their complexity and the application for which they are designed. The first step is to understand what questions you are trying to answer for your organization.

Cornerstone OnDemand to Acquire Evolv for $42.5 Million

evolv-logo

Cornerstone OnDemand (NASDAQ:CSOD), a global leader in cloud-based talent management solutions, announced that the company has entered into a definitive agreement to acquire privately-held Evolv Inc. With its leading machine learning and data science platform, the acquisition of Evolv will allow Cornerstone clients to leverage the power of big data analytics to make better workforce decisions.

Prelert Introduces Real-Time Analysis of Complex Anomalies in Big Data Sets

prelert_logo

Prelert, the anomaly detection company, has announced a new feature of its Anomaly Detective machine learning engine that enables multidimensional analysis to be conducted on large volumes of data at speeds never before possible. This new feature, Stats Reduce, dramatically shrinks data transfer sizes, making it possible to perform the complex behavioral analysis of terabytes of data per hour.