I recently ran across a blog post that discusses a very important characteristic for machine learning solutions – Generalization. If you’ve ever wondered about the primary reason why machines can learn, generalization is the concept you need to understand. It is the premise underlying all statistical learning and it goes something like this. We start […]
“For customers who need to retain and access hundreds of terabytes of unstructured data, Quantum Lattus Object Storage is a self-healing, self-protecting private cloud solution that enables more efficient primary storage usage, delivers extreme archive data resiliency and protection, and offers low latency disk access to archive data. Compared to RAID or tape storage, Lattus Object Storage provides the most effective solution on a cost/performance basis for active access, retention and protection of unstructured data in large archive environments.”
Machine learning has resulted in the development of solutions that are getting exponentially better each year. Already, algorithms using machine learning can drive cars, grade essays, write magazine articles, and read and understand newspapers. In the video presentation below from the recent TEDxSF conference, data scientist Jeremy Howard explains what the state of machine learning […]
Cloudera, a major player in enterprise analytic data management powered by Apache™ Hadoop®, and WANdisco, a provider of continuous availability software for global enterprises to meet the challenges of big data, announced that WANdisco’s Non-Stop Hadoop technology is certified to run on Cloudera’s Distribution for Hadoop version 4 (CDH4) providing 100% uptime for global multi-data center deployments.
On Tuesday Facebook announced it hired machine learning pioneer Yann LeCun to run its newly created artificial intelligence lab. Scooping up one of the biggest names in the field is a major move for the company, but it’s not a surprising one. If anything, Facebook is late to enter to the data science arms race that’s underway in Silicon Valley and the country as a whole.
For all you Rubyists out there, here is a great talk from the recent Ruby Conf 2013 that took place in Miami Beach on Nov. 8-10 – Thinking About Machine Learning with Ruby by Bryan Liles. Not sure where to cluster or where to classify? Have you seen a linear regression lately? Every wanted to […]
As a data scientist, I should believe in the value of surveys and other data collection mechanisms. In the case IT industry surveys, I’m not convinced how accurately the respondents report their reality while rushing through online surveys. So taking the results with a grain of salt, I found an intriguing article appearing in Forbes: The State of Big Data: What The Surveys Say.
“DataRPM is a revolutionary business intelligence and data analytics solution that provides a natural language question answering and search interface to analyze and visualize any data residing anywhere in corporate databases, big data systems, files, applications, 3rd party systems, data warehouses and even other business intelligence tools. Available on the cloud, on premises and embeddable in SaaS/ISV applications.”
“With data growing exponentially, one of the greatest data management challenges is end-to-end protection, governance, discovery and access, no matter the file type, device or social origination.” said Steve Duplessie, founder and senior analyst, Enterprise Strategy Group. “Tarmin’s latest release of GridBank is tailored to meet all the needs of organizations facing massive growth with unstructured data.”