<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Blueroo Interactive &#187; Uncategorized</title>
	<atom:link href="http://blueroo-interactive.com/category/uncategorized/feed/" rel="self" type="application/rss+xml" />
	<link>http://blueroo-interactive.com</link>
	<description>dont let your website get lost in cyberspace</description>
	<lastBuildDate>Mon, 25 Apr 2011 04:22:43 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>MapReduce: Now Google Invents New Ways to Manage Data!</title>
		<link>http://blueroo-interactive.com/uncategorized/google-data-basenow-google-invents-new-ways-to-manage-data/</link>
		<comments>http://blueroo-interactive.com/uncategorized/google-data-basenow-google-invents-new-ways-to-manage-data/#comments</comments>
		<pubDate>Wed, 03 Feb 2010 05:11:55 +0000</pubDate>
		<dc:creator>BlueRoo</dc:creator>
				<category><![CDATA[Search Engine Optimization]]></category>
		<category><![CDATA[Search Marketing News]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[blended search]]></category>
		<category><![CDATA[data storage]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google base data]]></category>
		<category><![CDATA[Google data]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[search engine]]></category>
		<category><![CDATA[search evolution]]></category>

		<guid isPermaLink="false">http://blueroo-interactive.com/?p=1135</guid>
		<description><![CDATA[Once opon a time, if you wished to make order of large sets of data, you would need to do two things: Firstly, create a meticulously manganed and maintained database, using tags and catagories as data land-marks. Secondly you would need a very large computer, to sift through your data using complex query.

This is all [...]]]></description>
			<content:encoded><![CDATA[<p>Once opon a time, if you wished to make order of large sets of data, you would need to do two things: Firstly, create a meticulously manganed and maintained database, using tags and catagories as data land-marks. Secondly you would need a very large computer, to sift through your data using complex query.</p>
<p><span id="more-1135"></span></p>
<p>This is all fine, untill your data grows in size to petabyte scale, then old way simply isn&#8217;t feasible. Tagging, sorting, and categorizing, would take an emmence amount of time. A single computer, no matter how large, just can&#8217;t crunch that many numbers at once.</p>
<div id="attachment_1134" class="wp-caption alignleft" style="width: 577px"><a href="http://blueroo-interactive.com/wp-content/uploads/2010/02/pb_sorting_f.jpg"><img class="size-full wp-image-1134" title="Google Data Structure" src="http://blueroo-interactive.com/wp-content/uploads/2010/02/pb_sorting_f.jpg" alt="Google Data Structure" width="567" height="296" /></a><p class="wp-caption-text">Google Data Structure</p></div>
<p>Google use a very different approach, when sifting and ordering the world wide web. Thier solution for working with colossal data sets, is an approach called <strong>MapReduce</strong>.</p>
<p>It works like this:</p>
<h3>1. Collect</h3>
<p>MapReduce does not depend on traditional databases, where information is collected then categorized. We&#8217;ll just gather up the full text of every book Google has scanned.</p>
<h3>2. Map</h3>
<p>You then write a function to map the data: &#8220;Count every use of every word in Google Books.&#8221; The request then splits among all the computers within your army, each is assigned a chunk of data to work with.</p>
<h3>3. Save</h3>
<p>Each PC doing a map, writes the results to its local hard drive, cutting down on data transfer time. Then the computers that have been assigned a &#8220;reduce&#8221; function grab the lists from the mappers.</p>
<h4>4. Reduce</h4>
<p>Then the Reduce computers correlate the lists of words. Now you would know the frequency of a particular word that is used, and in which books.</p>
<h5>5. Solve</h5>
<p>The system finally creates a data set about your data! In my example, the final list of words is stored as separate sets, so it can be quickly referenced or queried. So then you don&#8217;t have to plow through unrelated data to get your answer.</p>
<p>Please drop us a comment, and share our lovely and insightfull post&#8230;thanks! <img src='http://blueroo-interactive.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
         ]]></content:encoded>
			<wfw:commentRss>http://blueroo-interactive.com/uncategorized/google-data-basenow-google-invents-new-ways-to-manage-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

