How to Become a Big Data Superhero

Cloudera Logo April 2014

Capgemini and Cloudera have created the new infographic below on how organizations can maximize their Big Data potential. Capgemini and Cloudera recently announced an extended partnership to help organizations accelerate their Big Data initiatives.

Developing an Application that Can Display Millions of Data Points on a Map

Million_Tweet_Map

Maptimize, the company behind the ‘One Million Tweet Map’, provides a clustering application to display massive amounts of information – such as Tweets, users or points of interest – on a map.

Where There’s Spark There’s Fire: The State of Apache Spark in 2014

Matei Zaharia, CTO of Databricks and Creator of Apache Spark

In this special guest feature, Matei Zaharia, CTO of Databricks and Creator of Apache Spark, explores open-source Apache Spark ‘s status in the Hadoop community.

Accenture and Hortonworks Join Forces to Help Businesses Manage Big Data

hortonworks

Accenture (NYSE: ACN) has entered into an alliance agreement with Hortonworks, a leading contributor to and provider of enterprise Apache™ Hadoop®, in a further strategic move to build its big data and digital capabilities and bring big outcomes from big data and analytics to its clients.

The userR!2014 Conference in Review

useR_JohnChambers_summary

FIELD REPORT Last week I attended the long-anticipated useR!2014 international conference at the UCLA campus, my alma mater. The four day event had something for everyone in attendance – all the brain cycles centered around the use of the R statistical environment. Since R is a primary tool for my work in data science and […]

Interview: Dataguise Perspectives on Big Data Security

Dataguise_logo

Big Data security all too often is an afterthought when deploying solutions like Hadoop, and companies slowly are discovering that security is just as important as any other aspect of the project. In the interview below, I was able to catch up with officials at a leading big data security vendor Dataguise to talk about […]

Production Deployment Environments for R

Machine_Learning

To help our audience leverage the power of machine learning, the editors of insideBIGDATA have created this weekly article series called “The insideBIGDATA Guide to Machine Learning.” This is our eighth and final installment, “Production Deployment Environments for R.”

Leaving Data on the Table: Data Scientists Reveal Obstacles to Big Data

Paradigm4-data-scientist-survey-Infographic-FINAL

The huge volume of Big Data produced by sensors, genomic sequencers, electronic exchanges, and connected devices continues to generate headlines but it’s the diverse types of data, not the volume, that’s a bigger challenge to data scientists and is causing them to “leave data on the table.”

Production Deployment with R

Machine_Learning2

To help our audience leverage the power of machine learning, the editors of insideBIGDATA have created this weekly article series called “The insideBIGDATA Guide to Machine Learning.” This is our seventh installment, “Production Deployment with R.”

Teradata Lifts the Limitations on Open Source R Analytics

teradata_logo_mi

Teradata (NYSE: TDC), the analytic data platforms, marketing applications, and services company, today introduced Teradata® Aster® R, which extends the power of open source R analytics by lifting the memory and processing limitations. Teradata Aster R offers the R analyst an enterprise-ready business analytics solution that is massively scalable, reliable, and easy-to-use.