Discovering Gold with Big Data Analytics and Data-Intensive Computing

Entries filed under “Analytics”

Five Big Business Opportunities for Big Data

Over at Information Week, Doug Henschen writes that Big data project leaders are seeking key technology ingredients starting with SQL analysis.

There’s a big push to deliver SQL-analysis capabilities on top of Hadoop, and the talent shortage is just one reason. The second reason for the trend is that Apache Hive, Hadoop’s incumbent data warehousing infrastructure, offers a limited subset of SQL-like query capabilities and suffers from slow performance tied to behind-the-scenes MapReduce processing. Answering the call for broader, faster SQL querying on Hadoop are projects and initiatives including Cloudera Impala, EMC’s HAWQ query feature on the Pivotal HD distribution, Hortonworks Stinger, IBM Big SQL, MapR-supported Apache Drill, and Teradata SQL-H.

This is great stuff if you want to better understand the business opportunities out there in Big Data. Read the Full Story.


Also posted in Business of Big Data | Leave a comment

Slidecast: Get 2know – Big Data For The Shared Economy

In this slidecast, Guy Fraker presents: get2know – Big Data for the Shared Economy.

With shared rides, cars, bikes, and even rooms, the issue of trust is huge. The folks at get2know have a developed a “Trust Engine” that uses Big Data to help you decide who you trust to share your stuff. Amazing!

As we build out to scale, we’ll provide a playground for alliance partners to reward consumers who utilize shared services in postive ways. We will deliver a searchable aggregated view of shared economy providers WITH utilization incentives. By doing both in a single view, using single sign-on, we provide an economic reason to be scored. We believe that by partnering with the Collaborative Consumption community, a market is created where no user asks, “ok- I got my score- now what?” get2kno is about creating a market, not building a platform.”

Learn more at the get2know BlogView the slidesDownload the MP3Subscribe on iTunesSubscribe to RSS


Also posted in Podcasts, Startups, Video | Leave a comment

Slidecast: Teradata Rolls Out Intelligent Memory Technology

In this slidecast, Scott Gnau from Teradata Labs presents: Teradata Intelligent Memory.

The introduction of Teradata Intelligent Memory allows our customers to exploit the performance of memory within Teradata Platforms, which extends our leadership position as the best performing data warehouse technology at the most competitive price,” said Scott Gnau, president, Teradata Labs. “Teradata Intelligent Memory technology is built into the data warehouse and customers don’t have to buy a separate appliance. Additionally, Teradata enables its customers to buy and configure the exact amount of in-memory capability needed for critical workloads. It is unnecessary and impractical to keep all data in memory, because all data do not have the same value to justify being placed in expensive memory.”

 

How does Intelligent Memory work? This animation video does a good job of making this advanced technology look simple.

Read the Full Story * View the slides * Download the MP3Subscribe on iTunesSubscribe to RSS


Also posted in Business of Big Data, Hardware, I/O, Software, Video | Leave a comment

SAS Adds to its Big Data Analytics Product Lineup

SAS this week announced six new high-performance analytics products to further strengthen the company’s leadership role as a provider of advanced analytics for Big Data.

According to a recent IDC report, SAS analytics account for 35.2 percent worldwide market share, exhibiting steady growth over the past three years.  The company’s next four closest competitors combined hold only 21 percent of the market.

The new SAS products, available in June, are focused on a variety of analytic techniques, including data mining, text mining, optimization, forecasting, statistics and econometrics.


 
According to the press release, users can choose the relevant SAS offerings that will allow them to perform analytical tasks in-memory across Big Data to meet specific business challenges.

The new SAS High-Performance Analytics products help organizations overcome bottlenecks to answer the really tough questions, those problems that often involve large amounts of data,” said Henry Morris, IDC Senior Vice President of Worldwide Software and Services. “These new functionally specific analytic packages differ significantly from other in-memory products because they are broad and useful across any industry. Instead of being limited to a particular problem, or lacking real predictive capabilities, the new SAS analytics are useful for virtually any analytic purpose.”

The six new products include:

  • SAS High-Performance Statistics
  • SAS High-Performance Data Mining
  • SAS High-Performance Text Mining
  • SAS High-Performance Optimization
  • SAS High-Performance Econometrics
  • SAS High-Performance Forecasting.

The new software packages operate in massively parallel processing (MPP) environments, distributing complex analytic tasks across numerous server blades to perform computations in parallel. Because each blade has its own memory, execution is rapid; jobs that once took hours now take only minutes or even seconds.

These new offerings extend options for new configurations with Teradata, Greenplum (now part of Pivotal) and Hadoop, as well as adding Oracle support.

The announcement was made at SAS Global Forum 2013, underway this week in San Francisco.  Key sessions are being live streamed at the SAS video portal.

Read the Full Story.


Also posted in Business of Big Data | Leave a comment

Algorithm Predicts Whether Startups Will Succeed

Over at Oregon Business, Linda Baker writes that Thomas Thurston from Growth Science has created a model that accurately predicts whether or not a Startup will succeed.

How does Thurston’s model work? It’s rooted in the mountains of data he has collected on market and corporate dynamics, including the anticipation of future changes in the marketplace. Patterns of success or failure then emerge depending on these different market and business behavior factors. “The key is identifying variables that are predictive of success and failure,” says Thurston, who is very hush-hush about revealing those variables. It’s a process that involves “lots of hard, hard work,” he says. “You go through a whole haystack to find one needle.”

Read the Full Story.


Also posted in Business of Big Data, Startups | Leave a comment

Video: Real-Time Big Data Analytics from Deployment to Production

In this video from the Strata 2013 Conference, David Smith from Revolution Analytics describes the five stages of real-time analytics deployment, and the technologies supporting each stage, including Hadoop, R, and database warehousing systems. He also shares some best practices for setting up a the technology stack and processes for model deployment, based on some real-life case studies.


Also posted in Events, Software, Video | Leave a comment

Video: How CERN Handles Big Data from the LHC

This video looks at how Big Data is handled from the Large Hadron Collider (LHC).

The LHC produces millions of collisions every second in each detector, generating approximately one petabyte of data per second. None of today’s computing systems are capable of recording such rates, so sophisticated selection systems are used for a first fast electronic pre-selection, only passing one out of 10,000 events. Tens of thousands of processor cores then select 1% of the remaining events for analysis. Even after such drastic data reduction, the four big experiments, ALICE, ATLAS, CMS and LHCb, together need to store over 25 petabytes per year. The LHC data are aggregated in the CERN Data Centre, which performs initial data reconstruction is performed, and a copy is archived to long-term tape storage. Another copy is sent to several large data centres around the world.


Also posted in HPC, Research, Video | Leave a comment

ScaleOut hServer Opens Up Hadoop Analysis for Live Data

Today ScaleOut announced their new hServer, the first in a series of products from ScaleOut Software that is filling in the need for real-time analytics with Hadoop.

While it’s a powerful platform for analyzing large, static data sets, Hadoop has always been limited by its inability to perform analytics on live data,” said Bill Bain, ScaleOut Software CEO. “There is an increasing drumbeat for real-time analytics using Hadoop, and we’re excited to take an important step towards meeting that need with this release.”

ScaleOut hServer will be available in both a free community edition and in several commercial editions. The community edition enables up to a four-server combined Hadoop/hServer grid for analyzing memory-based data sets of up to 256GB. Read the Full Story or check out our podcast interview with Bill Bain.


Also posted in Business of Big Data, Podcasts | Leave a comment

Teradata Streamlines Hadoop Access for the Enterprise

Today Teradata announced that the new Enterprise Access for Hadoop and Unified Data Architecture enable business analysts to reach through Teradata directly into Hadoop to find new business value from the analysis of big, diverse data.

Today’s announcement of Teradata Enterprise Access for Hadoop is another example of our aggressive commitment to building out the Teradata Unified Data Architecture™,” said Scott Gnau, president, Teradata Labs. “Teradata Enterprise Access for Hadoop empowers organizations to dig deeply into files and data residing in Hadoop and combine the data with production business data for analyses – and action.”

Teradata Enterprise Access for Hadoop includes two new, innovative features that make access to data in Hadoop easy and secure for business analysts across the enterprise:

  • Teradata smart loader for Hadoop. For the first time, business analysts have point-and-click convenience to easily browse and move data between Teradata and Hadoop for analysis and self-service business intelligence.
  • Teradata SQL-H. The new Teradata SQL-H gives any user or application across the enterprise direct, on-the-fly access to data stored within Hadoop through standard ANSI SQL, leveraging the security, workload management, and performance of the Teradata data warehouse.

Read the Full Story or check out our RichReport Podcast interview with Scott Gnau.


Also posted in Hadoop, Hardware, Software, Storage | Leave a comment

Video: The CIA’s Grand Challenges with Big Data

In this video from the Structure: Data 2013 conference, Central Intelligence Agency CTO Ira “Gus” Hunt presents: The CIA’s Grand Challenges with Big Data.

Sensors, agents and an Internet of Things are all producing data, all of the time. It would be a vast understatement to say that the CIA has experience in acquiring, handling and analyzing big quantities of data. In this talk, the CTO of the CIA will talk about the scale of the problems his team deals with now, the coming inflection point in the increase in data, the grand challenges we face and why an emphasis on analytics is critical for the future. This is a talk not to be missed.


Also posted in Events, Public sector, Video | Leave a comment

Advertisement

NAS for Dummies Ad

View All Videos

inside-bigdata.com is a production of insideHPC, LLC. © 2011-2013 Sitemap