RapidMiner Moves Predictive Analytics, Data Mining and Machine Learning into the Cloud

Pioneering predictive analytics leader RapidMiner has announced the general availability of RapidMiner Cloud.

Apache Spark Beats the World Record for Fastest Processing of Big Data

Databricks

Databricks, the company founded by the creators of popular open-source Big Data processing engine Apache Spark, announced today that it has broken the world record for the GraySort, a third-party, industry benchmarking competition for sorting large on-disk datasets.

Types of In-Memory Computing

insideBIGDATA_Guide_IMC

In this installment we’ll set the stage for in-memory computing technology in terms of its current state as well as its next stage of evolution. We’ll begin with a discussion of the capabilities of in-memory databases (IMDBs) and in-memory data grids (IMDGs), and show how they differ. We’ll finish up the section by demonstrating how neither one is sufficient for a company’s strategic move to IMC; instead, we will explain why a comprehensive in-memory data platform is needed.

Splunk Introduces Splunk Enterprise 6.2

splunk_logo

Splunk Inc. (NASDAQ:SPLK), provider of a leading software platform for real-time Operational Intelligence, has announced Splunk® Enterprise 6.2, the latest version of the award-winning platform for machine data. Splunk Enterprise 6.2 delivers simplified analysis and powerful pattern detection that enables more users across IT and the business to discover relationships in their data and build advanced analytics.

Spark Panel Discussion with Cloudera, MapR & Pivotal

Spark_logo_feature

The panel discussion video below comes from the Los Angeles Spark Users Group. The talk fosters a lively discussion on Spark’s initial goals, where it came from and what the future holds for Spark. Many leading Big Data vendors are responding by introducing Spark’s capabilities into their architectures. The panel discussion is between the top Hadoop distribution vendors – Cloudera, MapR, and Pivotal.

Sematext Announces Monitoring Support for Apache Spark

sematext_logo

Sematext Group, Inc., a Brooklyn­ based Performance Monitoring and Log Management solution provider, has announces the first dedicated monitoring support for Apache Spark in the latest release of its SPM performance monitoring solution.

Lustre 101

lustre logo

This week’s lustre 101 article looks at the history of lustre and the typical configuration of this high-performance scalable storage solution for big data applications.

Alteryx Secures $60 Million in Funding for Data Blending and Advanced Analytics

alteryx-logo

Alteryx, Inc., the leader in data blending and advanced analytics, today announced a $60 Million investment led by Insight Venture Partners with participation from existing investors, SAP Ventures and Toba Capital. This new investment is in response to the significant growth Alteryx has experienced in the last year, with over 200 percent growth in its customer base.

Credit Scoring and Back Trading/Testing

Guide to Big Data Finance - Thumbnail

This article is the third in an editorial series that has the goal to provide direction for enterprise thought leaders on ways of leveraging big data technologies in support of analytics proficiencies designed to work more independently and effectively in today’s climate of working to increase the value of corporate data assets.

Pepperdata Enables Opower to Rely on Hadoop for Real-time Big Data Analytics

Pepperdata-logo

Pepperdata, a leader in Apache Hadoop performance optimization, has announced that Opower, a leader in cloud-based software for the utility industry, has deployed Pepperdata Supervisor, a real-time cluster optimizer, to ensure its production Hadoop environment can scale to hundreds of billions of data points reliably, efficiently and on time.