The State of Big Data: What the Surveys Say

survey

As a data scientist, I should believe in the value of surveys and other data collection mechanisms. In the case IT industry surveys, I’m not convinced how accurately the respondents report their reality while rushing through online surveys. So taking the results with a grain of salt, I found an intriguing article appearing in Forbes: The State of Big Data: What The Surveys Say.

SciDB – How Linear Algebra Operations Scale

SciDB

I found a very interesting technical report that shows SciDB’s usefulness for machine learning applications: “SciDB – How Linear Algebra Operations Scale.”

Paper Shows Big Data Fostering Serendipity

architecture

Can Big Data be used to foster serendipity? That’s the premise of an award-winning paper in the 2013 Semantic Web Challenge. Entitled “Fostering Serendipity through Big Linked Data” the paper was written by Muhammad Saleem, Maulik R. Kamdar, Aftab Iqbal, Shanmukha Sampath, Helena F. Deus and Axel-Cyrille Ngonga Ngomo. The amount of bio-medical data available […]

Latest Trends in High Performance Computing Usage and Spending Identified by IDC

images

The latest International Data Corporation (IDC) worldwide study of high performance computing (HPC) end-user sites, is now available. The 2013 study included sites representing 905 HPC systems, nearly double the 488 systems profiled in the previous version of the study.

RESEARCH: MLI – An API for Distributed Machine Learning

Machine_Learning

MLI is an Application Programming Interface (API) designed to address the challenges of building machine learning algorithms in a distributed setting based on data-centric computing. Its primary goal is to simplify the development of high-performance, scalable, distributed algorithms. A new research paper is available on the arXiv pre-print server which describes the new API. MLI […]

Big Data Success – Substance Behind the Hype

All the important indicators point to greater Big Data spending by businesses, but the question remains: what steps are you taking to get the most out of your investment? To insure a reasonable ROI, experts say businesses shouldn’t just invest on Big Data technologies, but on Big Data staffing as well. The latter is getting […]

2013 is the Year of Experimentation for Big Data

It’s not surprising that big data, data science and machine learning are hugely dominant topics in the IT and business world. For the past few years, it seems that there’s been more articles, blogs and op-ed pieces about big data than you could keep track of. And with good reason – plenty of organizations across […]

Video: Large Scale Storage for Data Intensive Science at KIT

In this video from the DDN User Group Meeting at ISC’13, Jos van Wezel from the Karlsruhe Institute of Technology (KIT) presents: Large Scale Storage for Data Intensive Science at KIT. In building a high-speed Big Data grid with other leading research institutions, we have demonstrated the power of leading-edge technologies including DDN | WOS […]

Video: Advancing Research at London's Global University

In this video from the DDN User Group Meeting at ISC’13, Dr. Daniel Hanlon from the University College of London presents: Advancing Research at London’s Global University. As UCL’s storage demands grow, the university expects to build a storage foundation that will scale up to 100PB. Looking for a storage solution that was massively scalable […]

TACC's Hadoop Cluster Makes Big Data Research More Accessible

Over at the Texas Advanced Computing Center, Aaron Dubrow writes that researchers are using a specialized cluster at TACC to do experimental Hadoop-style studies on a current production system. This system offers researchers a total of 48, eight-processor nodes on TACC’s Longhorn cluster to run Hadoop in a coordinated way with accompanying large-memory processors. A […]