I ran across a Tweet recently that pointed me to a discussion over on Professor Andrew Gelman’s blog, “Statistics is the least important part of data science.” Dr. Gelman is a Professor of Statistics and Political Science at Columbia University and prior Ph.D. adviser of Rachel Schutt, author of Doing Data Science which I reviewed earlier this month.
A very timely article recently appeared in Forbes that focused on several issues that may hold back big data from contributing to the bottom line in the near term, How Soon Will Big Data Yield Big Profits? This topic is on the mind of many companies in the process or contemplating the move into big data technology solutions.
MLDemos is a dandy open-source visualization tool for machine learning algorithms created to help studying and understanding how various algorithms function and how their parameters affect and modify the results in problems of classification, regression, clustering, dimensionality reduction, dynamical systems and reward maximization. MLDemos is open-source and free for personal and academic use. Much insight […]
A very important technique in unsupervised machine learning as well as dimensionality reduction is Principal Component Analysis (PCA). But PCA is difficult to understand without the fundamental mathematical underpinnings. The two instructional videos below (Part 1 and 2) demonstrate PCA at an introductory level to provide an appreciation for this powerful tool used in big data applications.
A recent announcement appearing in MIT News, “Machine learning branches out,” highlights new research in probabilistic graphical models. In a paper being presented in December at the annual conference of the Neural Information Processing Systems Foundation, MIT researchers describe a new technique that expands the class of data sets whose structure can be efficiently deduced. […]
Apparently the MOOC (Massive Open Online Course) ecosystem is practicing what they preach. As part of the Artificial Intelligence in Education (AIED 2013) conference this past July, a special workshop was held – the “moocshop” – that included representatives from many of the high-flying MOOCs, Coursera, Edx, and researchers from top universities.
The corporate finance community is specifically tailored to gain benefit from the growth of big data technology. For finance it’s the name of the game to utilize analytics to gain a strategic advantage. A recent Forbes interview, Emerging Big Data Opportunities for CFOs, with Carlos Passi, assistant controller of IBM, discussed emerging types of data […]