Case Study: IDEXX Scales Business Using Hadoop

Big data storageBIG DATA CASE STUDY

As a compelling case study illustrating the power and cost-effectiveness of Apache Hadoop, IDEXX Laboratories, Inc., a provider of diagnostics and information technology solutions for animal health, is using the MapR Distribution for Hadoop via Amazon Web Services (AWS) to flexibly scale its business at lower cost, and gain access to critical customer data instantly for rapid response times.

IDEXX aggregates and analyzes data from participating practices and offers a free industry benchmark report to the participating practices. IDEXX also utilizes the aggregated and de-identified data to create syndicated reports for pharmaceutical and nutrition companies to identify industry trends.

Looking for a system that would scale
As the IDEXX businesses continued to grow, its primary data store, a relational database hosted on Amazon Web Services (AWS), could not keep pace. The growing size of this database meant that daily jobs to aggregate and summarize data were taking too long to run and consuming database resources that were impacting online operations. The new solution had to be compatible with existing systems, which included the AWS infrastructure and Java 7.

Our primary reason for choosing MapR M3 on Amazon Elastic Compute Cloud (Amazon EC2) was the ability to run Hadoop under Java 7 against Java 7 compiled applications,” said Terry Schutte, IDEXX senior systems administrator for software R&D. “It was hard to find support for Java 7 in the MapReduce ecosystem. MapR’s performance and architectural improvements stood out.  From an operational perspective, MapR is easier to use than other distributions we tested and higher performing, and we benefit from the optimizations made to Hadoop.”

Increased flexibility, immediate access to data, ease of experimentation

IDEXX has realized multiple benefits from its MapR and AWS solution, including increased flexibility and control to scale its business, faster customer response times, the ability to retain all its data and ease of experimentation, and support for additional lines of business.

By running MapR clusters on EC2, we retain full control over the configuration and operations of the MapR cluster, while continuing to have the benefits of EC2 hosting which means no capital expenditures and the flexibility to scale our environment based on demand. We pay only for capacity that we use and it lets us easily scale to support growth of the business,” said Schutte.

The new MapR/AWS solution improves the company’s ability to respond to customer requests. For example, if a customer asks a specific question related to the marketplace or trends, IDEXX can respond immediately. The new system removes prior constraints on how much data IDEXX can store and process.

 

 

 

 

 

Resource Links: