<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Inside-BigData &#187; MapReduce</title>
	<atom:link href="http://inside-bigdata.com/category/mapreduce/feed/" rel="self" type="application/rss+xml" />
	<link>http://inside-bigdata.com</link>
	<description>Discovering Gold with Big Data Analytics and Data-Intensive Computing</description>
	<lastBuildDate>Fri, 24 May 2013 15:43:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.1</generator>
		<item>
		<title>Radio Free HPC Fireside Chat &#8211; HPC Embraces Big Data</title>
		<link>http://inside-bigdata.com/radio-free-hpc-fireside-chat-hpc-embraces-big-data/</link>
		<comments>http://inside-bigdata.com/radio-free-hpc-fireside-chat-hpc-embraces-big-data/#comments</comments>
		<pubDate>Wed, 08 May 2013 13:43:54 +0000</pubDate>
		<dc:creator>Rich</dc:creator>
				<category><![CDATA[Business of Big Data]]></category>
		<category><![CDATA[Graph Computing]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Podcasts]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://inside-bigdata.com/?p=2941</guid>
		<description><![CDATA[<p>In this slidecast, the Radio Free HPC team interviews Fritz Ferstl, CTO of Univa. Topics include Big Data, HPC, and the continuing convergence of both. While what we think of as traditional HPC may differ greatly from Big Data analytics, that seems to be changing. With a long history in high performance computing and customers [...]</p><p>The post <a href="http://inside-bigdata.com/radio-free-hpc-fireside-chat-hpc-embraces-big-data/">Radio Free HPC Fireside Chat &#8211; HPC Embraces Big Data</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></description>
			<content:encoded><![CDATA[<p><iframe width="510" height="287" src="http://www.youtube.com/embed/mK5IV4I3uKM?rel=0" frameborder="0" allowfullscreen></iframe></p>
<p><a href="http://www.univa.com/about/management.php"><img alt="" src="http://www.isc-events.com/cloud10/var/cloud/storage/images/media/images/ferstl_95x98px/5577-1-eng-GB/Ferstl_95x98px.jpg" title="Fritz Ferstl" class="alignright" width="95" height="98" /></a>In this slidecast, the Radio Free HPC team interviews Fritz Ferstl, CTO of <a href="http://univa.com">Univa</a>. Topics include Big Data, HPC, and the continuing convergence of both.</p>
<blockquote><p>While what we think of as traditional HPC may differ greatly from Big Data analytics, that seems to be changing. With a long history in high performance computing and customers in both worlds, Ferstl shares his unique perspective on where the two worlds overlap and where the potential is greatest for synergy in the future.</p></blockquote>
<p>This has to be our best show yet, so be sure to check it out.</p>
<p><a href="http://slidesha.re/109YEDx">View the slides on Slideshare</a> * <a href="http://bit.ly/VpbZIf">Download the MP3</a> * <a href="http://bit.ly/18sFTPg">Download the mobile video</a> * <a href="http://bit.ly/13vmcmA">Download 1024p Video</a> * <a href="http://bit.ly/WgEZzd">Subscribe on iTunes</a> * <a title="RSS" href="http://bit.ly/QXKy3V">RSS Feed</a></p>
<br /><div class="linkedInShareButton"><script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://inside-bigdata.com/radio-free-hpc-fireside-chat-hpc-embraces-big-data/"></script></div><div class="ad" style="padding-top: 10px; border-top: 1px dotted gray; padding-bottom: 5px; font-size: .95em;">&nbsp;</div><p>The post <a href="http://inside-bigdata.com/radio-free-hpc-fireside-chat-hpc-embraces-big-data/">Radio Free HPC Fireside Chat &#8211; HPC Embraces Big Data</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://inside-bigdata.com/radio-free-hpc-fireside-chat-hpc-embraces-big-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Video: Building Big Data Pipelines with OSS</title>
		<link>http://inside-bigdata.com/video-building-big-data-pipelines-with-oss/</link>
		<comments>http://inside-bigdata.com/video-building-big-data-pipelines-with-oss/#comments</comments>
		<pubDate>Sun, 24 Feb 2013 16:33:40 +0000</pubDate>
		<dc:creator>Rich</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Video]]></category>

		<guid isPermaLink="false">http://inside-bigdata.com/?p=2405</guid>
		<description><![CDATA[<p>In this video from the SpringOne 2012 event, Costin Leau presents: Building Big Data Pipelines with OSS. Hadoop is not an island. To deliver a complete Big Data solution, a data pipeline needs to be developed that incorporates and orchestrates many diverse technologies. A Hadoop focused data pipeline not only needs to coordinate the running [...]</p><p>The post <a href="http://inside-bigdata.com/video-building-big-data-pipelines-with-oss/">Video: Building Big Data Pipelines with OSS</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></description>
			<content:encoded><![CDATA[<p><iframe width="510" height="287" src="http://www.youtube.com/embed/6Tg8vdXkqD4?rel=0" frameborder="0" allowfullscreen></iframe></p>
<p>In this video from the <a href="http://www.springone2gx.com/">SpringOne 2012</a> event, <a href="http://www.springone2gx.com/conference/washington/2012/10/speakers/costin_leau">Costin Leau</a> presents: <em>Building Big Data Pipelines with OSS</em>.</p>
<blockquote><p>Hadoop is not an island. To deliver a complete Big Data solution, a data pipeline needs to be developed that incorporates and orchestrates many diverse technologies. A Hadoop focused data pipeline not only needs to coordinate the running of multiple Hadoop jobs (MapReduce, Hive, Pig or Cascading), but also encompass real-time data acquisition and the analysis of reduced data sets extracted into relational/NoSQL databases or dedicated analytical engines.</p></blockquote>
<br /><div class="linkedInShareButton"><script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://inside-bigdata.com/video-building-big-data-pipelines-with-oss/"></script></div><div class="ad" style="padding-top: 10px; border-top: 1px dotted gray; padding-bottom: 5px; font-size: .95em;">&nbsp;</div><p>The post <a href="http://inside-bigdata.com/video-building-big-data-pipelines-with-oss/">Video: Building Big Data Pipelines with OSS</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://inside-bigdata.com/video-building-big-data-pipelines-with-oss/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Paper: A Map-Reduce-Like System for Emerging Parallel Architectures</title>
		<link>http://inside-bigdata.com/paper-a-map-reduce-like-system-for-emerging-parallel-architectures/</link>
		<comments>http://inside-bigdata.com/paper-a-map-reduce-like-system-for-emerging-parallel-architectures/#comments</comments>
		<pubDate>Sat, 01 Sep 2012 16:14:39 +0000</pubDate>
		<dc:creator>Rich</dc:creator>
				<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[MapReduce]]></category>

		<guid isPermaLink="false">http://inside-bigdata.com/?p=1855</guid>
		<description><![CDATA[<p>Can MapReduce be used as an effective means of processing data-intensive HPC workloads? In his dissertation from Ohio State University, Wei Jiang writes that one first needs to overcome with performance scaling, fault tolerance, and GPU acceleration support. We performed a comparative study showing that the map-reduce processing style could cause signiﬁcant overheads for a [...]</p><p>The post <a href="http://inside-bigdata.com/paper-a-map-reduce-like-system-for-emerging-parallel-architectures/">Paper: A Map-Reduce-Like System for Emerging Parallel Architectures</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></description>
			<content:encoded><![CDATA[<p><img alt="" src="http://bc.tech.coop/blog/images/mapreduce-beer.jpg" title="MapReduce" class="alignright" width="191" height="136" />Can <a href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a> be used as an effective means of processing data-intensive HPC workloads? In his dissertation from Ohio State University, Wei Jiang writes that one first needs to overcome with performance scaling, fault tolerance, and GPU acceleration support.</p>
<blockquote><p>We performed a comparative study showing that the map-reduce processing style could cause signiﬁcant overheads for a set of data mining applications. Based on the observation, we developed a map-reduce system with an alternate API (MATE) using a user-declaredreduction-object to be able to further improve the performance of map-reduce programs in multi-core environments. To address the limitation in MATE that the reduction object must ﬁt in memory, we extended the MATE system to support the reduction object ofarbitrary sizes in distributed environments and apply it to a set of graph mining applications, obtaining better performance than the original graph mining library based on map-reduce.</p></blockquote>
<p><a href="http://etd.ohiolink.edu/send-pdf.cgi/Jiang%20Wei.pdf?osu1343677821">Download the paper (PDF)</a>.</p>
<br /><div class="linkedInShareButton"><script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://inside-bigdata.com/paper-a-map-reduce-like-system-for-emerging-parallel-architectures/"></script></div><div class="ad" style="padding-top: 10px; border-top: 1px dotted gray; padding-bottom: 5px; font-size: .95em;">&nbsp;</div><p>The post <a href="http://inside-bigdata.com/paper-a-map-reduce-like-system-for-emerging-parallel-architectures/">Paper: A Map-Reduce-Like System for Emerging Parallel Architectures</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://inside-bigdata.com/paper-a-map-reduce-like-system-for-emerging-parallel-architectures/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is It Time for CDOs (Chief Data Officers)?</title>
		<link>http://inside-bigdata.com/is-it-time-for-cdos-chief-data-officers/</link>
		<comments>http://inside-bigdata.com/is-it-time-for-cdos-chief-data-officers/#comments</comments>
		<pubDate>Sat, 08 Oct 2011 13:00:00 +0000</pubDate>
		<dc:creator>Ralph</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Business of Big Data]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[MapReduce]]></category>

		<guid isPermaLink="false">http://inside-bigdata.com/?p=594</guid>
		<description><![CDATA[<p>Michael Vizard explains that current IT culture is used to giving people access to only a finite amount of data. But new data management frameworks such as MapReduce and Hadoop make it possible to cost-effectively analyze large amounts of data. Many IT organizations don’t have the skills in place to master those technologies. This gap [...]</p><p>The post <a href="http://inside-bigdata.com/is-it-time-for-cdos-chief-data-officers/">Is It Time for CDOs (Chief Data Officers)?</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.crunchbase.com/person/michael-vizard"><img class="alignright" title="Michael Vizard" src="http://ww1.prweb.com/prfiles/2009/09/15/176351/gI_0_1_MVizard.jpg" alt="" width="167" height="250" /></a>Michael Vizard explains that current IT culture is used to  giving people access to only a finite amount of data. But new data  management frameworks such as MapReduce and Hadoop make it possible to cost-effectively analyze large amounts of data. Many IT organizations don’t have the skills in place to master those  technologies. This gap between the IT skills at hand and the desires of the business   community is starting to create some tension, which could be resolved   with the appointment of someone who will function as chief data scientist or officer.</p>
<blockquote><p>One might argue that because chief information officers are  theoretically in charge of information, this task would fall under their  purview. But there is a world of difference between managing data and  understanding the business value of that data; hence the need for a new  class of business data specialists.</p></blockquote>
<p>Read the <a href="http://www.itbusinessedge.com/cm/blogs/vizard/changing-the-analytics-culture/?cs=48737">Full Story</a></p>
<br /><div class="linkedInShareButton"><script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://inside-bigdata.com/is-it-time-for-cdos-chief-data-officers/"></script></div><div class="ad" style="padding-top: 10px; border-top: 1px dotted gray; padding-bottom: 5px; font-size: .95em;">&nbsp;</div><p>The post <a href="http://inside-bigdata.com/is-it-time-for-cdos-chief-data-officers/">Is It Time for CDOs (Chief Data Officers)?</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://inside-bigdata.com/is-it-time-for-cdos-chief-data-officers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Platform MapReduce Crosses Big Data and High-Performance Computing</title>
		<link>http://inside-bigdata.com/platform-mapreduce-crosses-big-data-and-high-performance-computing/</link>
		<comments>http://inside-bigdata.com/platform-mapreduce-crosses-big-data-and-high-performance-computing/#comments</comments>
		<pubDate>Sat, 27 Aug 2011 15:38:48 +0000</pubDate>
		<dc:creator>Rich</dc:creator>
				<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://inside-bigdata.com/?p=26</guid>
		<description><![CDATA[<p>On June 29, 2011, Platform Computing Platform announced the availability of Platform MapReduce, the industry’s first enterprise-class, distributed runtime engine for MapReduce applications. Built on the company’s core technologies, LSF and Symphony, Platform MapReduce enables businesses to focus on moving MapReduce applications into production by providing enterprise-class manageability and scale, high resource utilization and availability, [...]</p><p>The post <a href="http://inside-bigdata.com/platform-mapreduce-crosses-big-data-and-high-performance-computing/">Platform MapReduce Crosses Big Data and High-Performance Computing</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.platform.com/press-releases/2011/PlatformBringsMapReducetotheEnterprise"><img class="alignright" title="Platform logo" src="http://www.platform.com/press-releases/2011/PlatformBringsMapReducetotheEnterprise/++resource++platformstyle.images/logo.gif" alt="" width="118" height="53" /></a>On June 29, 2011, Platform Computing Platform announced the availability of Platform MapReduce, the industry’s first enterprise-class, distributed runtime engine for MapReduce applications. Built on the company’s core technologies, LSF and Symphony, Platform MapReduce enables businesses to focus on moving MapReduce applications into production by providing enterprise-class manageability and scale, high resource utilization and availability, ease of operation, multiple application support and an open distributed file system architecture, including immediate support for Hadoop Distributed File System (HDFS) and Appistry Cloud IQ.</p>
<p style="padding-left: 30px;"><em>“High-Performance Analytics – a SAS specialty – happens at the intersection of Big Data and High-Performance Computing. Our mutual customers have benefited from Platform’s expertise and unique capabilities to manage and support these complex, distributed clusters,” said Paul Kent, SAS Vice President of Platform Research and Development. “Platform MapReduce is a welcome addition to the rapidly evolving Hadoop ecosystem. Platform Computing can play a critical role in the evolution and adoption of Hadoop in the Enterprise.”</em></p>
<p>Read the <a href="http://www.platform.com/press-releases/2011/PlatformBringsMapReducetotheEnterprise">Full Story</a>.</p>
<br /><div class="linkedInShareButton"><script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://inside-bigdata.com/platform-mapreduce-crosses-big-data-and-high-performance-computing/"></script></div><div class="ad" style="padding-top: 10px; border-top: 1px dotted gray; padding-bottom: 5px; font-size: .95em;">&nbsp;</div><p>The post <a href="http://inside-bigdata.com/platform-mapreduce-crosses-big-data-and-high-performance-computing/">Platform MapReduce Crosses Big Data and High-Performance Computing</a> appeared first on <a href="http://inside-bigdata.com">Inside-BigData</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://inside-bigdata.com/platform-mapreduce-crosses-big-data-and-high-performance-computing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
