Jeremy Howard made a presentation to the Melbourne R meetup group, where he gave a brief overview of his “data scientist’s toolbox” (using a few Kaggle competitions as practical examples), and also provided an introduction to ensembles of decision trees (including the well-known Random Forest™ algorithm).
Here is an informative short video presentation from the recent Strata Conference 2014 in Santa Clara featuring Mike Hendrickson, VP Content Strategy over at O’Reilly Media interviewing Vince Dell’anno, Big Data Lead for Accenture. The discussion centers around big data trends such as what types of organizations tend to need big data technology more than […]
Global education giant Pearson uses data science to make their products better especially for student outcomes. The video below features a recent Cloud episode of Google Developers Live, with Felipe Hoffa hosting Pearson’s Director of Data Science Collin Sellman, to celebrate Python Pandas release 0.13 and its Google BigQuery connector. Jacob Schaer and Sean Schaefer join them to demo its capabilities, and how Pearson uses data science to improve education.
Everyone’s talking about hiring data scientists but most pundits continue to focus on skills rather than the mindset required for this challenging role. Talent Analytics CEO Greta Roberts spoke about this topic to a group of data scientists attending the Boston area Big Data Analytics Meetup group on February 4th, 2014.
The video below comes to us from the Strata Conference 2014: How Companies are Using Spark, and Where the Edge in Big Data Will Be. While the first big data systems made a new class of applications possible, organizations must now compete on the speed and sophistication with which they can draw value from data. […]
Last week saw evidence for the big data industry steamroller effect as the Strata Conference 2014 in Santa Clara came and went. With thousands of attendees, an abundance of informative presentations, and a very healthy exhibitor ecosystem, the show defined the current state-of-the-art for all that is big data. If you missed the big event, O’Reilly Media has graciously made available the slides and videos for some of the presentations.
H2O, the open source in-memory machine learning and predictive analytics company for big data, announced a partnership with Cloudera, a leader in enterprise data management powered by Apache™ Hadoop.