Discovering Gold with Big Data Analytics and Data-Intensive Computing

Entries filed under “Cloud”

Video: Cassandra Performance and Scalability on AWS

In this video, Adrian Cockcroft from Netflix presents: Cassandra Performance and Scalability on AWS.

Netflix published a Cassandra scalability benchmark in 2011 that showed linear scalability as the number of nodes in the cluster was increased from 48 to 288 and over a million triple replicated writes per second. This talk will summarize a range of new benchmarks that take advantage of more powerful EC2 instances and improvements in Cassandra itself.

Recorded at the Cassandra 2012 conference in Santa Clara.


Also posted in Analytics, Cassandra, Events, Software, Video | 1 Comment

Video: Using Puppet to Maintain Hundreds of OpenStack Deployments

In this video, Christopher Aedo, Sr. Director of Technology Operations at Morphlabs presents: Using Puppet as a service to maintain hundreds of OpenStack deployments.

In this session, Morphlabs will explain how we use Puppet as a global service to monitor hundreds of disparate OpenStack deployments around the world. mCloud Optimum, Morphlabs global remote service, is powered by Puppet Enterprise. It maintains configurations and redefines what customers expect from a converged cloud infrastructure company.

Recorded at the PuppetConf 2012 user group event in San Francisco. See more videos from the conference at the Puppet Labs Channel.


Also posted in Software, Storage, Video | Leave a comment

Ziembicki: Times Datacenter Story on Wasted Energy is Idiotic

Microsoft Cloud Architect David Ziembicki has nothing good to say about a recent New York Times article that describes Big Datacenters as wasteful, polluting energy hogs.

The New York Times has an idiotic “analysis” of data center power efficiency on the front page today no less. The headline: “Power, Pollution and the Internet” should make clear where its going. What starts as a reasonable assertion, that many data centers waste a large amount of energy, quickly degrades into an unbelievable amount of disinformation and poor analysis.

Read the Full Story.

Ziembicki is not alone in this. Charles Babcock from Information Week says that the Times story misses the point and that datacenters are actually getting more efficient as a result of virtualization.


Also posted in Business of Big Data | Leave a comment

Shuttleworth Invests $1 Million in Inktank, a Ceph Storage Startup

Canonical founder Mark Shuttleworth has reportedly invested $1 million in Inktank, a Startup providing enterprise-level support for the Ceph Distributed Storage System.

Ceph redefines storage as an open source service, meeting one of the key challenges for enterprises and cloud providers who need scalable and cost-effective storage,” said Mark Shuttleworth, founder of Canonical. “The distinctive architecture of Ceph gives it natural advantages over both proprietary and open source competitors. We found Ceph to be mature enough to warrant its inclusion in Ubuntu 12.10 as a storage option for our OpenStack reference architecture, widely deployed in both telco and corporate environments. I’m also delighted to support the team behind Ceph in their goal of building a commercial success story around this tremendous technology, as an investor in Inktank. Today Ceph is in the Linux kernel and Ubuntu, soon it will be everywhere.”

As a long-time champion of open source and industry-transforming technologies, Shuttleworth was eager to get involved with the Ceph storage platform and its worldwide community of developers. The Ceph project has been very active in the OpenStack community alongside Canonical, and the distributed storage system was integrated into the new version of Ubuntu 12.04 LTS, released in May 2012. Ubuntu is the popular Linux desktop distribution that was created by Shuttleworth and is backed by Canonical. Read the Full Story.


Also posted in Business of Big Data, Storage | 1 Comment

Big Data has a Date with the Cloud

Over at Forbes, Holger Kisker from Forrester Research writes that Big Data has a date with the Cloud.

There are three good reasons why (depending on the use case scenario) big data can make a lot of sense in the cloud:

  1. Big data requires a spectrum of advanced technologies, skills, and investments. Do you really need/want this all in-house?
  2. Big data includes huge amounts of external data. Does it make sense to move and manage all this data behind your firewall?
  3. Big data needs a lot of data services. Focus on the value of your differentiated data analysis instead of big data management.

Read the Full Story.


Also posted in Business of Big Data | Leave a comment

Tap into Big Data with Google BigQuery on Demand

Ian Hardenburgh writes that Google’s BigQuery service allows you to run SQL-like queries against large datasets on a subscription basis.

Google BigQuery is essentially an on-demand big data storage and querying service. One can store as much data as he/she feels necessary, and only have to pay for what he/she uses (to the extent of certain storage limits). Furthermore, one can scale his/her data to the amount of hundreds of terabytes of data, with no additional management needed.

Read the Full Story.


Also posted in Software | Leave a comment

Slidecast: Garantia Data – The in-memory NOSQL Cloud

In this slidecast, Ofer Bengal from Garantia Data presents an overview of the company’s fully-automated cloud service for hosting Memcached and Redis.

The Garantia Data Memcached Cloud was built from the ground up over a true Memcached cluster architecture and with a highly reliable infrastructure. We distribute your dataset across multiple shards in multiple nodes of our Memcached cluster and constantly monitor your shards to ensure optimal performance. When needed, we add more shards and nodes to your dataset so it can continuously and limitlessly scale. We have also added to our Memcached cluster replication, data persistence, backup and auto-failover capabilities that guarantee your dataset is always up and running.

Download the MP3 * Subscribe on iTunes * If Dropbox is blocked, download from this Google page.


Also posted in Podcasts, Software, Video | Leave a comment

Panel: Marrying Big Data and Cloud

In this video, Jeff Kaplan from ThinkStrategies leads a panel discuss on Big Data. Recorded at the All About the Cloud summit 2012 in Mountain View, California.

 

Data, data, data! Data is everywhere yet very few companies are using that data to gain edge on their competitors. We are no longer referring only to internal data but in our increasingly social world we have many additional data pools to leverage. The combination of internal data with these external data pools provides companies have an opportunity to engage customers, partners, and employees more efficiently. More and more companies are leveraging data-driven strategies to compete and innovate. This session will explore the opportunities available with Big Data stacks. How are companies implementing these Big Data stacks? What are the barriers to becoming a more data-driven company and how do you overcome those hurdles?

 

Panelists:

  • Chris Baker, Senior Vice President, Global ISV/OEM Sales, Oracle Corporation
  • Anjul Bhambhri, Vice President, Big Data, IBM Corporation
  • Shailesh Rao, Director, New Products & Solutions, Google Enterprise

Also posted in Analytics, Business of Big Data, Events, Video | Leave a comment

Slidecast: Top Benefits of Cloud Databases

In this slidecast, Karen Tegan Padir from EnterpriseDB presents: Top Benefits of Cloud Databases.

PostgreSQL is the #1 enterprise-class open source database with a feature set comparable to the major proprietary RDBMS vendors and a customer list that spans every industry. EnterpriseDB’s Postgres Plus solutions let you confidently develop and deploy PostgreSQL-backed applications that scale all the way from embedded solutions to massive OLTP and data warehouse systems that serve thousands of users.

Download the MP3Subscribe on iTunes * If Dropbox is blocked, download from this Google page.


Also posted in Data Mining, Podcasts, Video | Leave a comment

Yahoo Genome Brings Big Data Service to the Cloud

Yahoo is now offering big data analytics as a service with its new Genome offering, which is designed to let companies deliver highly targeted online advertising and marketing campaigns.

With the overwhelming amount of Big Data, businesses struggle to transform consumer information into insights and action. Genome from Yahoo! utilizes the most advanced campaign and user analytics to understand consumer needs, anticipate audiences’ future performance, and enable you to develop efficient media buys.

Read the Full Story.


Also posted in Analytics, Business of Big Data | Leave a comment

View All Videos

inside-bigdata.com is a production of insideHPC, LLC. © 2011-2013 Sitemap