Last night I attended the Los Angeles Hadoop users Group (LA-HUG) meeting hosted by Shopzilla. The topic for the evening was “An Overview of Hulu’s Data Platform” presented by Prasan Samtani and Tristan Reid of Hulu. From all indications, Hulu is a significant player in the Hadoop user community and this talk documented the team’s command of big data technology.
“Datameer is all about providing a self-service, end-to-end experience for big data analytics on Hadoop. From data integration to analytics to visualization, we are wizard-led, point-and-click. Most recently we announced our Smart Analytics module, which allows business users to use data mining algorithms through a drag and drop UI. These new capabilities complement what data scientists are doing and enable business analysts to take advantage of advanced algorithms without involving IT.”
Big data applications represent a fast-growing category of high-value applications that are increasingly employed by business and technical computing users. However, they have exposed an inconvenient dichotomy in the way resources are utilized in data centers. A new white paper that focuses on these issues is available here on insideBIGDATA.
“Machine logs contain simple and complex data – some logs contain time stamped data (i.e. syslogs) that are tactical events or errors used by sys admins to troubleshoot IT infrastructure. But other logs have more complex, unstructured or multi-structured text with sections on configuration info, statistics and other non-time stamped data. To make sense of the data in these logs, one needs a powerful language and processing engine to provide meaning and structure to the information. Once structure is defined, complex analytics and trend reporting can be performed.”
“Splunk Enterprise is a platform for machine data. The technology delivers powerful and fast analytics to quickly unlock the value of machine data to IT and other users throughout an organization. In short, it’s a simple, effective way to collect, analyze and secure the massive streams of machine data generated by all IT systems and technology infrastructure.”
“Intel’s goal is to encourage more innovative and creative uses for data as well as to demonstrate how big data and analytics technologies are impacting many facets of our daily lives, including sports. For example, coaches and their staffs are using real-time statistics to adjust games on-the-fly and throughout the season. From intelligent cameras to wearable sensors, a massive amount of data is being produced that, if analyzed in real-time, can provide a significant competitive advantage. Intel is among those making big data technologies more affordable, available, and easier to use for everything from helping develop new scientific discoveries and business models to even gaining the upper hand on good-natured predictions of sporting events.”
“Active archives are ideal for organizations that face exponential data growth or regularly manage high-volume unstructured data or digital assets. Target markets include life sciences, media and entertainment, education, research, government, financial services, oil and gas, and telecommunications, as well as general IT organizations requiring online data archive options.”
“Data has become the foundation upon which all smart business decisions are made. Archives have traditionally been inaccessible, offline and practically invisible. The new business culture demands more. The Dternity S unlocks the potential of your data, providing easy online access to all of your archived content.”