Data Science 101: Data Agnosticism – Feature Engineering Without Domain Expertise

From the SciPy2013 conference, here is a compelling talk “Data Agnosticism: Feature Engineering Without Domain Expertise” by Nicholas Kridler of Accretive Health in Chicago.

Ask a Data Scientist: The Bias vs. Variance Tradeoff

Monica Martinez-Canales, PhD. Intel

This week’s Ask a Data Scientist question is from a reader who wants an explanation of the “bias vs. variance tradeoff in statistical learning.” Intel’s Dr. Monica Martinez-Canales is this week’s guest Data Scientist

Revolution Analytics Introduces Revolution R Open and Revolution R Plus

Revolution

Revolution Analytics, the only commercial provider of open source R software, today announced two new offerings that support the open source R community and elevate R’s capabilities to enterprise-level performance. Revolution R Open is a free, open source R distribution that enhances R performance, makes it easier to share R scripts and improves collaboration on R-based advanced analytics applications.

Ask a Data Scientist: Curse of Dimensionality

datascientist2_featured

Welcome back to our series of articles sponsored by Intel – “Ask a Data Scientist.” Once a week you’ll see reader submitted questions of varying levels of technical detail answered by a practicing data scientist – sometimes by me and other times by an Intel data scientist. This week’s question is from a reader who wants to know more about the “curse of dimensionality.”

RapidMiner Moves Predictive Analytics, Data Mining and Machine Learning into the Cloud

Pioneering predictive analytics leader RapidMiner has announced the general availability of RapidMiner Cloud.

Predictive Modeling and Production Deployment

insideBIGDATA_Guide_PA

Using predictive analytics involves understanding and preparing the data, defining the predictive model, and following the predictive process. Predictive models can assume many shapes and sizes, depending on their complexity and the application for which they are designed. The first step is to understand what questions you are trying to answer for your organization.

Cornerstone OnDemand to Acquire Evolv for $42.5 Million

evolv-logo

Cornerstone OnDemand (NASDAQ:CSOD), a global leader in cloud-based talent management solutions, announced that the company has entered into a definitive agreement to acquire privately-held Evolv Inc. With its leading machine learning and data science platform, the acquisition of Evolv will allow Cornerstone clients to leverage the power of big data analytics to make better workforce decisions.

Prelert Introduces Real-Time Analysis of Complex Anomalies in Big Data Sets

prelert_logo

Prelert, the anomaly detection company, has announced a new feature of its Anomaly Detective machine learning engine that enables multidimensional analysis to be conducted on large volumes of data at speeds never before possible. This new feature, Stats Reduce, dramatically shrinks data transfer sizes, making it possible to perform the complex behavioral analysis of terabytes of data per hour.

Fortscale Introduces User Behavior Analytics Solution for User-Related Threat Mitigation

Fortscale-logo

Fortscale is officially introducing its innovative flagship product that helps enterprise security analysts identify user-related threats, malicious insiders, compromised accounts, suspicious behavior and risky access to data by extracting Big Data repositories with user behavior analytics.

Data Access and Exploratory Data Analysis

insideBIGDATA_Guide_PA

Enterprise data assets are what feed the predictive analytic process, and any tool must facilitate easy integration with all the different types data sources required to answer critical business questions. Robust predictive analytics needs to access analytical and relational databases, OLAP cubes, flat files, and enterprise applications.