Discovering Gold with Big Data Analytics and Data-Intensive Computing

Entries filed under “Graph Computing”

YarcData Launches uRIKA Big Data Appliance

Cray launched its new YarcData division only a few weeks ago, but in a surprise move today, the company announced its first Big Data appliance and a bevy of beta customers. Available today, the YarcData uRiKA graph appliance beging called a purpose-built solution for Big Data relationship analytics.

The uRiKA graph appliance fills an unmet need in the rapidly-growing Big Data market. While many critical Big Data problems are based on graphs, most current Big Data solutions are based on partitioned data structures that scale out on clusters. Current Big Data approaches, including graph databases, result in low performance on graphs since graphs are hard to partition across cluster nodes, are non-deterministic, and are highly dynamic. The launch of the uRiKA solution addresses the challenge of delivering insightful analytics on graphs, not only in terms of its ability to handle size and complexity of relationships, but also in terms of its response time and speed of processing.

Ok, so what exactly is inside this appliance, you might ask? With “massively-multithreaded graph processors supporting 128 threads/processor, and highly scalable I/O with data ingest rates of up to 350 terabytes per hour,” it looks to be a spinoff of the Cray XMT, the third generation of the Cray MTA supercomputer architecture originally developed by Tera. The Cray XMT has vanished from the company site, so the company is now down to three compute product lines: Cray XK6 (AMD/Nvidia hybrid), Cray XE6 (AMD), and now uRIKA.

Early adopters for the uRiKA graph appliance include the Institute of Systems Biology (ISB), Mayo Clinic, Noblis, Swiss CSCS, and an unamed US government organization. Read the Full Story.


Also posted in Hardware | Leave a comment

Whitepaper: Hybrid-core: The “Big Data” Computing Architecture

Convey Computer is out with a new whitepaper that explains how their hybrid-core architecture is well suited to Big Data and Graph Computing:

As we have reviewed, solving graph problems takes a different approach to computing. One such approach is the Convey HC (hybrid-core) family of computer systems. The Convey systems offer a balanced architecture: reconfigurable (via Field Programmable Gate Arrays—FPGAs) compute elements, and a supercomputing-inspired memory subsystem (Figure 3).Figure 3. Overview of the Convey hybrid-core computing architecture.The benefit of hybrid-core computing is that the compute-intensive kernel of the Graph500 breadth-first search is implemented in hardware on the FPGAs in the coprocessor. The FPGA implementation allows much more parallelism than a commodity system (the Convey memory subsystem allows up to 8,192 outstanding concurrent memory references). The increase in parallelism combined with the hardware implementation of the logic portions of the algorithm allow for increased overall performance with much less hardware.

Download the PDF on this summary page.

Coming to SC11? This year, the Convey exhibit will include a ‘Graph Corner’ where you learn about the Graph500 benchmark and the company’s GraphConstructor. In addition, Bob Masson and Kirby Collins will present: “Heterogeneous Computing Architecture Supporting Applications in Data-intensive Sciences” on Thursday, November 17th, at 2:30 p.m. as part of the SC11 Exhibitor Forum.


Also posted in HPC, Whitepapers | 1 Comment

Interview: Russian Supercomputer Outperforms All Others on Graph500

This week T-Platforms announced that Russia’s most powerful supercomputer at Moscow State University outperformed all competitors during the recent Graph500 benchmark tests. To learn more, I caught up with Anton Korzh, a systems architect at T-Platforms.

insideHPC: How is it that computer system from T-Platforms installed at the M.V. Lomonosov Moscow State University was able to achieve performance leadership in the Graph500 test?

Korzh: The results shown by the ”Lomonosov” system in the Graph500 test are primarily evidence of the versatility of the T-Platforms systems in various modes. This test is designed to identify the actual system performance when processing large amounts of data, where the outcome is influenced not only, and not even so much, by processor power, but by access time and data processing speed. Therefore, the only solutions able to show outstanding results in this test will optimally combine CPU speed, system interconnect, memory performance, and many other factors. An example of such a system is the “Lomonosov” supercomputer, the power of which, as a result of recent modernization, has been increased to a record-breaking for Russia 1.3Pflops.

insideHPC: Is T-Platforms solutions architecture optimized for handling of large volumes of data?

Korzh: Our solutions architecture is best suited to address a broad range of tasks, including, as shown by the Graph500 test results, processing of large data sets. For example, a lot of scientific groups are now solving different problems using the “Lomonosov” supercomputer complex: from the development of new nanomaterials and modeling protein molecule structure to analysis of global climate processes on the planet. Some of these tasks require a lot of computing power from processors and graphics cards, and others require processing of huge amounts of data, where the processor speed is secondary. In our work, we pay great attention not only to technology excellence, but also to the versatility of our solutions. Therefore, the systems from T-Platforms demonstrate consistently high performances in different tests.

insideHPC: Over the past year, we have seen leadership of Asian computing systems in the Top500 rating, but there was a Russian system that won the latest Graph500 test. Will, in your opinion, the trend of winning high positions in such benchmarks by Russian computer systems continue?

Korzh: If we analyze the Top500 ratings of recent years, the trend of strengthening Russia’s position in the global supercomputer industry becomes apparent. Our solutions are repeatedly included in the first hundred of the Top500 List, and the “Lomonosov” computer system, after a recent upgrade, has risen from the 17th to 13th position in the rating. Today, we are developing technologies that in the foreseeable future will be the basis of new, more powerful computing systems. So there is every reason to expect that domestic systems will continue to show ever-increasing performance, ranking higher and higher in the Top500 List. Leadership in the Top500 ranking is determined by the capacity of the system, which depends on the amount of funds invested in it. Thus, the Asian leadership in the Top500 List is determined only by the large amounts of funding allocated. By some estimates, creation of the Japanese K computer cost about one billion U.S. dollars. At the same time, the leadership in Graph500 can be achieved without such large investments – new original ideas, as well as new models and programming paradigms can give a huge effect, as we have seen in “Lomonosov’s” Graph500 results.

insideHPC: How much do you think the benchmark system used by the Graph500 creators influences the development of the supercomputer industry?

Korzh: This test conveniently illustrates the effectiveness of current systems for processing o data sets. It allows developers to optimize their solutions for such problems and increase their flexibility. In addition, the possible results of this test will contribute to the emergence of highly specialized systems focusing on high-speed data processing.

insideHPC: Big Data is said to be the next frontier of Cloud Computing and HPC. Do you agree with that? What is T-Platforms doing to optimize its position in this area going forward?

Korzh: Perhaps, this will really happen in the future, but so far we have not observed the obvious trends towards this.  At present, HPC, Cloud Computing and Parallel Computing lines are developing independently, complementing each other in some areas.


Also posted in Hardware | Leave a comment

Supercomputing the Semantic Web

The Semantic Web is a group of methods to allow machines to understand the meaning – or “semantics” – of information on the Internet. And while high performance computing has largely moved on to massively parallel clusters, there are still problems like the semantic web that just don’t map well onto distributed architectures.

As described in this SemanticWeb article by Paul Miler, the mulithreading Cray XMT is tailor-made to solve the data-intensive problems of this semantic web.

Everything about the hardware is optimised to churn through large quantities of data, very quickly, with vital statistics that soon become silly. A single processor “can sustain 128 simultaneous threads and is connected with up to 8 GB of memory.” The Cray XMT comes with at least 16 of those processors, and can scale to over 8,000 of them in order to handle over 1 million simultaneous threads with 64 TB of shared system memory. Should you want to, you could easily hold the entire Linked Data Cloud in main memory for rapid analysis without the usual performance bottleneck introduced by swapping data on and off disks.

A descendant of the multithreading MTA architecture invented by Burton Smith at Tera Computer, the Cray XMT uses custom chips that plug into AMD HyperTransport slots. In this way, the Cray XMT is a clever application of custom engineering that leverages economies of scale.

When I started my career back in the 80′s, monolithic, big Cray systems were the only game in town when you wanted to crunch big data. Now that notion seems to have come full circle. Graph computing may just have found the right tool at the right time.


Also posted in HPC | Leave a comment

View All Videos

inside-bigdata.com is a production of insideHPC, LLC. © 2011-2013 Sitemap