SQLstream, Inc., the Big Data Stream Processors company, recently announced the availability of its Stream Processor for Apache Storm, the open source distributed stream computing framework, making possible the real-time analysis of machine data in Storm using continuous SQL queries.
MapR Technologies, Inc., the leader in Apache™ Hadoop® technology for Big Data deployments, recently announced the early access release of the new HP Vertica Analytics Platform on MapR. This high-performance, interactive SQL-on-Hadoop solution tightly integrates HP Vertica’s high-performance analytic platform directly on MapR’s enterprise-grade distribution for Hadoop.
Looker, a business analytics software company, today announced its enhanced Looker Datafold Engine, with support for persistent derived tables to deliver faster, more meaningful business insights. Persistent derived tables rely on Looker’s in-database architecture to empower data analysts and reduce their workloads.
DataStax today announced DataStax Enterprise 4.0, the latest version of its always-on database platform based on Apache Cassandra. DataStax Enterprise 4.0 introduces a powerful new in-memory option and enterprise search enhancements for significantly faster performance in use cases where companies must process data at the speed of business.
We sat down with Cristian Borcea, PhD from the New Jersey Institute of Technology to discuss the IoT and Big Data applications. “New machine learning techniques could help us extract knowledge from these data – this happens especially for knowledge that we don’t expect and we don’t even know exists – we cannot search for something that we don’t know exists.”
OneFold believes that the expertise of a data scientist should be used for solving the most complex of analysis and not for the more day-to-day aspects of data extraction. The company uses a plug and play DHQL platform for automated data extraction, transformation, querying and reporting. Navneet Aron and Uday Sandhar, co-founders of OneFold, teamed up to answer our questions about this intriguing technology.
Global education giant Pearson uses data science to make their products better especially for student outcomes. The video below features a recent Cloud episode of Google Developers Live, with Felipe Hoffa hosting Pearson’s Director of Data Science Collin Sellman, to celebrate Python Pandas release 0.13 and its Google BigQuery connector. Jacob Schaer and Sean Schaefer join them to demo its capabilities, and how Pearson uses data science to improve education.
As a practicing data scientist and big data journalist, I often find myself down in the trenches on pursuit of new trends, products, and services. Earlier this week I attended a local machine learning meetup group event and I came away with a real gem. The presenter mentioned in passing a new cloud service called “Domino” and I rushed back to my office to learn more. I wasn’t disappointed.