As a data scientist, I should believe in the value of surveys and other data collection mechanisms. In the case IT industry surveys, I’m not convinced how accurately the respondents report their reality while rushing through online surveys. So taking the results with a grain of salt, I found an intriguing article appearing in Forbes: The State of Big Data: What The Surveys Say.
Can Big Data be used to foster serendipity? That’s the premise of an award-winning paper in the 2013 Semantic Web Challenge. Entitled “Fostering Serendipity through Big Linked Data” the paper was written by Muhammad Saleem, Maulik R. Kamdar, Aftab Iqbal, Shanmukha Sampath, Helena F. Deus and Axel-Cyrille Ngonga Ngomo. The amount of bio-medical data available […]
MLI is an Application Programming Interface (API) designed to address the challenges of building machine learning algorithms in a distributed setting based on data-centric computing. Its primary goal is to simplify the development of high-performance, scalable, distributed algorithms. A new research paper is available on the arXiv pre-print server which describes the new API. MLI […]
All the important indicators point to greater Big Data spending by businesses, but the question remains: what steps are you taking to get the most out of your investment? To insure a reasonable ROI, experts say businesses shouldn’t just invest on Big Data technologies, but on Big Data staffing as well. The latter is getting […]
It’s not surprising that big data, data science and machine learning are hugely dominant topics in the IT and business world. For the past few years, it seems that there’s been more articles, blogs and op-ed pieces about big data than you could keep track of. And with good reason – plenty of organizations across […]
In this video from the DDN User Group Meeting at ISC’13, Jos van Wezel from the Karlsruhe Institute of Technology (KIT) presents: Large Scale Storage for Data Intensive Science at KIT. In building a high-speed Big Data grid with other leading research institutions, we have demonstrated the power of leading-edge technologies including DDN | WOS […]
In this video from the DDN User Group Meeting at ISC’13, Dr. Daniel Hanlon from the University College of London presents: Advancing Research at London’s Global University. As UCL’s storage demands grow, the university expects to build a storage foundation that will scale up to 100PB. Looking for a storage solution that was massively scalable […]
Over at the Texas Advanced Computing Center, Aaron Dubrow writes that researchers are using a specialized cluster at TACC to do experimental Hadoop-style studies on a current production system. This system offers researchers a total of 48, eight-processor nodes on TACC’s Longhorn cluster to run Hadoop in a coordinated way with accompanying large-memory processors. A […]