Data Science 101: Real-time Analytics using Cassandra, Spark and Shark

In the video below, Evan Chan (Software Engineer at Ooyala), describes his experience using the Spark and Shark frameworks for running real-time queries on top of Cassandra data.

Where There’s Spark There’s Fire: The State of Apache Spark in 2014

Matei Zaharia, CTO of Databricks and Creator of Apache Spark

In this special guest feature, Matei Zaharia, CTO of Databricks and Creator of Apache Spark, explores open-source Apache Spark ‘s status in the Hadoop community.

Book Reviews: The Bootstrap Resampling Technique

Bootstrap

In the spirit of the importance of bootstrap methods to contemporary machine learning, I’d like to review several prominent books on the subject. Some of the titles are relatively new, while others can be considered “classics.”

Guavus Enhances its Reflex Operational Intelligence Platform with Apache Spark and Hadoop YARN

guavus logo new

Guavus, a leading provider of big data analytics solutions for operational intelligence, has unveiled Reflex 2.0 with support for Apache Spark and Hadoop YARN. The Guavus Reflex™ Operational Intelligence Platform provides a real-time analysis across business and operations for better quality decision-making.

Data Science 101: Introduction to Deep Learning on Hadoop

As the data world undergoes its Cambrian explosion phase, our data tools need to become more advanced to keep pace. Deep Learning has emerged as a key tool in the non-linear arms race of machine learning. In the video below Josh Patterson and Adam Gibson take a look at how we can parallelize Deep Belief Networks in Deep Learning on Hadoop’s next generation YARN framework with Iterative Reduce.

Revolution R Enterprise vs. SAS Performance Benchmark

RRE vs. SAS Benchmark Results

The debate over which statistical platform sits premiere over the others for data science applications rages on. The discussion often turns to the popular R and SAS environments. But to focus the dialog on performance only, a new benchmark study was just completed by commercial R provider Revolution Analytics.

MicroStrategy Analytics Platform™ Certified on Apache Spark

Spark_logo

MicroStrategy® Incorporated (Nasdaq: MSTR), a leading worldwide provider of enterprise software platforms, has announced the certification of the MicroStrategy Analytics Platform™ on Apache Spark, the in-memory processing engine that is now a part of all major Hadoop distributions.

TIBCO Announces Jaspersoft for AWS Surpasses 1,000 Active Customers

TIBCO_logo

TIBCO Software Inc. (NASDAQ: TIBX) today announced that more than 1,000 active customers have subscribed to the hourly-priced, TIBCO Jaspersoft® for AWS. The tremendous growth since launch in February 2013 illustrates the market’s high demand for pay-as-you-go Business Intelligence (BI).

Building a Better Brain: Saffron Cognitive Computing Platform Replicates How We Associate Facts

Saffron_logo

Saffron Technology has been on a quest since 1999 to replicate the way the human brain learns using associative memory. Saffron is now commercially available as a cognitive computing platform following beta testing for real-time operational risk intelligence and decision support in defense, energy, healthcare and manufacturing applications.

In-Memory Database vs. In-Memory Data Grid

Nikita Ivanov, CTO of GridGain

In-memory computing is comprised of two main categories: In-Memory Databases and In-Memory Data Grids. Nikita Ivanov, CTO of GridGain delves into the differences between the two and when to apply each technology.