One of the attractions of the Hadoop Summit 2014 was the Big Data & Brews interview series – “Live from Hadoop Summit.” These short, well-focused discussions always provide good light into important industry trends. In the episode below, the conversation turns to the subject of SQL on Hadoop. Stefan Groschupf, the CEO of Datameer, recorded a special interview with Ovum analyst Tony Baer who gave his thoughts on the topic.
“With a novel data analytics approach, Mellanox is providing double performance to data Extract, Transform, Load (ETL) solutions based on Hadoop Map Reduce. The code to enable such performance gains is part of the Hadoop community code. With acceleration of the networking stack, Mellanox is quadrupling the number of clients served on a single Memcached server, a dominating key-value caching service for large scale web applications.”
Feeling left out because your boss wouldn’t let you attend the Hadoop Summit 2014 happening this week? Not to worry! Here’s a free alternative, especially attractive if you happen to live/work in Los Angeles. The Big Data Camp LA 2014 is a free, all-day conference on Saturday, June 14 hosted at the DirectTV campus near LAX.
In the video presentation below from the SpringOne 2GX 2012 conference in Washington, DC, Costin Leau looks at the architecture of Big Data pipelines, the challenges ahead and how to build manageable and robust solutions using Open Source software such as Apache Hadoop, Hive, Pig, Spring for Apache Hadoop, Batch and Integration.
“We give Hadoop the predictability it needs, let organizations see what it’s doing (with detailed usage metrics for every user, job, and task, in real time), and help organizations get the most out of their hardware investment. We are not for the organizations that have just entered into their first Hadoop project (because they don’t rely on it… yet). We are here for those who already rely on the business-critical data and functionality Hadoop can deliver.”
Hortonworks and Yahoo! are pleased to host the 7th Annual Hadoop Summit, the leading conference for the Apache Hadoop community. This event, expanded now to three days, June 3-5, will feature many of the Apache Hadoop thought leaders who will showcase successful Hadoop use cases, share development and administration tips and tricks, and educate organizations about how best to leverage Apache Hadoop as a key component in their enterprise data architecture.
“GridBank provides a comprehensive information governance framework to help organizations meet compliance regulations for retention management and disposal, and to mitigate data related risk by using end-to-end data protection. The GridBank Metabase, a distributed metadata repository, enables enterprise search and discovery and provides integration for big data analytics tools for increased data insights.”
The recent Big Ideas for Sustainable Prosperity research conference brought together some of the world’s preeminent environment & economy thinkers for a two day conference to share knowledge and think big about Policy Innovation for Greening Growth. In the video presentation below, Dr. Matthew E. Kahn argues that the combination of Big Data and field experiments can sharply improve urban quality of life.