<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Analytics Team &#187; data warehousing</title>
	<atom:link href="http://www.analyticsteam.com/category/data-warehousing/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.analyticsteam.com</link>
	<description>Predictive analytics, data mining, business intelligence and more.  Information useful to analysts and data people of all kinds</description>
	<lastBuildDate>Mon, 06 Feb 2012 05:07:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Managing mountains of data</title>
		<link>http://www.analyticsteam.com/2011/10/18/managing-mountains-of-data/</link>
		<comments>http://www.analyticsteam.com/2011/10/18/managing-mountains-of-data/#comments</comments>
		<pubDate>Wed, 19 Oct 2011 03:58:38 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[computer world]]></category>
		<category><![CDATA[library of congress]]></category>
		<category><![CDATA[mazda]]></category>
		<category><![CDATA[nielsen]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=1240</guid>
		<description><![CDATA[Check out this Computer World article. They&#8217;ve got some stories about the following companies/organizations and their usage of data: Library of Congress Amazon.com Mazda Motor Corp. The Nielsen Company]]></description>
			<content:encoded><![CDATA[<p>Check out this <a href="http://www.computerworld.com/s/article/9220504/Really_big_data_The_challenges_of_managing_mountains_of_information?taxonomyId=9&#038;pageNumber=2" title="Computer World" target="_blank">Computer World</a> article.  They&#8217;ve got some stories about the following companies/organizations and their usage of data:</p>
<ul>
<li>Library of Congress</li>
<li>Amazon.com</li>
<li>Mazda Motor Corp.</li>
<li>The Nielsen Company</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2011/10/18/managing-mountains-of-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Orbitz chooses cloud-based data warehouse solution</title>
		<link>http://www.analyticsteam.com/2011/10/11/orbitz-chooses-cloud-based-data-warehouse-solution/</link>
		<comments>http://www.analyticsteam.com/2011/10/11/orbitz-chooses-cloud-based-data-warehouse-solution/#comments</comments>
		<pubDate>Wed, 12 Oct 2011 04:26:02 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[kognitio]]></category>
		<category><![CDATA[orbitz]]></category>
		<category><![CDATA[slides]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=1197</guid>
		<description><![CDATA[Travel site, Orbitz, has outsources part of it&#8217;s data infrastructure to Kognitio. Check out the GigaOm article. Orbitz is also a big user of Hadoop as shown in these slides: Hadoop and Hive at Orbitz, Hadoop World 2010 View more presentations from Jonathan Seidman]]></description>
			<content:encoded><![CDATA[<p>Travel site, Orbitz, has outsources part of it&#8217;s data infrastructure to <a href="http://www.kognitio.com/" title="Kognitio" target="_blank">Kognitio</a>.  Check out the <a href="http://gigaom.com/cloud/orbitz-outsources-analytics-to-the-cloud/" title="GigaOm" target="_blank">GigaOm</a> article.  Orbitz is also a big user of Hadoop as shown in these slides:</p>
<p><center>
<div style="width:425px" id="__ss_5867188"> <strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/jseidman/hadoop-and-hive-at-orbitz" title="Hadoop and Hive at Orbitz, Hadoop World 2010" target="_blank">Hadoop and Hive at Orbitz, Hadoop World 2010</a></strong> <iframe src="http://www.slideshare.net/slideshow/embed_code/5867188" width="425" height="355" frameborder="0" marginwidth="0" marginheight="0" scrolling="no"></iframe>
<div style="padding:5px 0 12px"> View more <a href="http://www.slideshare.net/" target="_blank">presentations</a> from <a href="http://www.slideshare.net/jseidman" target="_blank">Jonathan Seidman</a> </div>
</p></div>
<p></center></p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2011/10/11/orbitz-chooses-cloud-based-data-warehouse-solution/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM builds 120PB cluster of 200,000 hard disks</title>
		<link>http://www.analyticsteam.com/2011/09/03/ibm-builds-120pb-cluster-of-200000-hard-disks/</link>
		<comments>http://www.analyticsteam.com/2011/09/03/ibm-builds-120pb-cluster-of-200000-hard-disks/#comments</comments>
		<pubDate>Sun, 04 Sep 2011 02:23:08 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[vendors]]></category>
		<category><![CDATA[extremetech]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[o'reilly radar]]></category>
		<category><![CDATA[storage]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=1019</guid>
		<description><![CDATA[IBM Research has developed a hardware and software solution to join 200,000 hard disks together into a single 120 petabyte storage cluster.  Here&#8217;s an article from ExtremeTech and here&#8217;s one from O&#8217;Reilly Radar with more details.  As of last year, Facebook had the worlds largest Hadoop cluster at 21 petabytes.  This IBM cluster is for [...]]]></description>
			<content:encoded><![CDATA[<p>IBM Research has developed a hardware and software solution to join 200,000 hard disks together into a single 120 petabyte storage cluster.  Here&#8217;s an article from <a title="ExtremeTech" href="http://www.extremetech.com/computing/94082-ibm-builds-120-petabyte-cluster-made-out-of-200000-hard-drives" target="_blank">ExtremeTech</a> and here&#8217;s one from <a title="O'Reilly Radar" href="http://radar.oreilly.com/2011/09/ibm-data-array-infochimps-api-hurricane-irene.html" target="_blank">O&#8217;Reilly Radar</a> with more details.  As of last year, Facebook had the <a title="Worlds largest Hadoop Cluster" href="http://hadoopblog.blogspot.com/2010/05/facebook-has-worlds-largest-hadoop.html" target="_blank">worlds largest Hadoop cluster</a> at 21 petabytes.  This IBM cluster is for a customer, likely a government agency.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2011/09/03/ibm-builds-120pb-cluster-of-200000-hard-disks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Open hardware</title>
		<link>http://www.analyticsteam.com/2011/08/26/open-hardware/</link>
		<comments>http://www.analyticsteam.com/2011/08/26/open-hardware/#comments</comments>
		<pubDate>Fri, 26 Aug 2011 15:21:33 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[vendors]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[hardware]]></category>
		<category><![CDATA[open compute project]]></category>
		<category><![CDATA[venture beat]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=991</guid>
		<description><![CDATA[Check out this VentureBeat article about Facebook open sourcing the hardware in their data center through the Open Compute Project.]]></description>
			<content:encoded><![CDATA[<p>Check out this <a href="http://venturebeat.com/2011/08/25/facebook-open-source-hardware/" title="VentureBeat">VentureBeat</a> article about Facebook open sourcing the hardware in their data center through the <a href="http://opencompute.org/" title="Open Compute Project">Open Compute Project</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2011/08/26/open-hardware/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Facebook moves 30 petabytes of Hadoop data</title>
		<link>http://www.analyticsteam.com/2011/07/28/facebook-moves-30-petabytes-of-hadoop-data/</link>
		<comments>http://www.analyticsteam.com/2011/07/28/facebook-moves-30-petabytes-of-hadoop-data/#comments</comments>
		<pubDate>Fri, 29 Jul 2011 01:05:35 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=979</guid>
		<description><![CDATA[Strata describes the process Facebook recently went through to move 30 petabytes of Hadoop data from one data center to another.]]></description>
			<content:encoded><![CDATA[<p><a href="http://radar.oreilly.com/2011/07/facebook-hadoop-nebula-libraries.html">Strata</a> describes the process Facebook recently went through to move 30 petabytes of Hadoop data from one data center to another.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2011/07/28/facebook-moves-30-petabytes-of-hadoop-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hadoop hardware</title>
		<link>http://www.analyticsteam.com/2011/06/05/hadoop-hardware/</link>
		<comments>http://www.analyticsteam.com/2011/06/05/hadoop-hardware/#comments</comments>
		<pubDate>Mon, 06 Jun 2011 03:08:49 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[vendors]]></category>
		<category><![CDATA[dbms2]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[monash research]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=954</guid>
		<description><![CDATA[If you&#8217;re interested in using Hadoop as a tool within your enterprise, it can be quite an endeavor &#8211; figuring out what software components you need, what configuration you need, and what hardware it should run on. Lots of people are running different configurations and while the community does share a lot of information, there [...]]]></description>
			<content:encoded><![CDATA[<p>If you&#8217;re interested in using Hadoop as a tool within your enterprise, it can be quite an endeavor &#8211; figuring out what software components you need, what configuration you need, and what hardware it should run on.  Lots of people are running different configurations and while the community does share a lot of information, there aren&#8217;t many good recaps of hardware being used.  <a href="http://www.dbms2.com/2011/06/04/hardware-for-hadoop/">Monash Research has a good writeup</a> that also compares how Hadoop hardware has changed over the past couple years.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2011/06/05/hadoop-hardware/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sum of world knowledge is 295 exabytes</title>
		<link>http://www.analyticsteam.com/2011/02/12/sum-of-world-knowledge-is-295-exabytes/</link>
		<comments>http://www.analyticsteam.com/2011/02/12/sum-of-world-knowledge-is-295-exabytes/#comments</comments>
		<pubDate>Sun, 13 Feb 2011 05:39:12 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[bbc]]></category>
		<category><![CDATA[data]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=823</guid>
		<description><![CDATA[Scientists have estimated that to store the sum of all the world&#8217;s knowledge would take up 295 exabytes. Check out the BBC article.]]></description>
			<content:encoded><![CDATA[<p>Scientists have estimated that to store the sum of all the world&#8217;s knowledge would take up 295 exabytes.  Check out the <a href=" http://www.bbc.co.uk/news/technology-12419672">BBC article</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2011/02/12/sum-of-world-knowledge-is-295-exabytes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Planning for derived data</title>
		<link>http://www.analyticsteam.com/2010/12/31/planning-for-derived-data/</link>
		<comments>http://www.analyticsteam.com/2010/12/31/planning-for-derived-data/#comments</comments>
		<pubDate>Fri, 31 Dec 2010 18:16:44 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[dbms2]]></category>
		<category><![CDATA[derived data]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=755</guid>
		<description><![CDATA[Since by definition, derived data is based on lower level raw data, it could be derived again as needed. DBMS2 takes us through some thinking about how better to handle derived data.]]></description>
			<content:encoded><![CDATA[<p>Since by definition, derived data is based on lower level raw data, it could be derived again as needed.  <a href="http://www.dbms2.com/2010/11/29/data-that-is-derived-augmented-enhanced-adjusted-or-cooked">DBMS2</a> takes us through some thinking about how better to handle derived data.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2010/12/31/planning-for-derived-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Cloudera and EMC Greenplum form data warehouse alliance</title>
		<link>http://www.analyticsteam.com/2010/09/22/cloudera-and-emc-greenplum-form-data-warehouse-alliance/</link>
		<comments>http://www.analyticsteam.com/2010/09/22/cloudera-and-emc-greenplum-form-data-warehouse-alliance/#comments</comments>
		<pubDate>Thu, 23 Sep 2010 03:30:00 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[vendors]]></category>
		<category><![CDATA[cloudera]]></category>
		<category><![CDATA[data wareshouse]]></category>
		<category><![CDATA[emc]]></category>
		<category><![CDATA[greenplum]]></category>
		<category><![CDATA[hadoop]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=692</guid>
		<description><![CDATA[Cloudera has formed an integration alliance with Greenplum. Cloudera will integrate their distribution of Hadoop with Greenplum&#8217;s Chorus product. Read more at ZDNet.]]></description>
			<content:encoded><![CDATA[<p>Cloudera has formed an integration alliance with Greenplum.  Cloudera will integrate their distribution of Hadoop with Greenplum&#8217;s Chorus product.  Read more at <a href="http://www.zdnet.com/blog/btl/cloudera-emc-greenplum-form-data-warehousing-alliance/39483">ZDNet</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2010/09/22/cloudera-and-emc-greenplum-form-data-warehouse-alliance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM to acquire Netezza</title>
		<link>http://www.analyticsteam.com/2010/09/20/ibm-to-acquired-netezza/</link>
		<comments>http://www.analyticsteam.com/2010/09/20/ibm-to-acquired-netezza/#comments</comments>
		<pubDate>Tue, 21 Sep 2010 01:09:58 +0000</pubDate>
		<dc:creator></dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[data warehousing]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[vendors]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[netezza]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.analyticsteam.com/?p=684</guid>
		<description><![CDATA[IBM has just announce they intend to purchase Netezza. Check out the press release. And check out DBMS2 and Techcrunch for some comments.]]></description>
			<content:encoded><![CDATA[<p>IBM has just announce they intend to purchase Netezza.  Check out the <a href="http://www-03.ibm.com/press/us/en/pressrelease/32514.wss">press release</a>.</p>
<p><object width="640" height="385"><param name="movie" value="http://www.youtube.com/v/k1aMepDa79Q&#038;color1=0xb1b1b1&#038;color2=0xd0d0d0&#038;hl=en_US&#038;feature=player_embedded&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowScriptAccess" value="always"></param><embed src="http://www.youtube.com/v/k1aMepDa79Q&#038;color1=0xb1b1b1&#038;color2=0xd0d0d0&#038;hl=en_US&#038;feature=player_embedded&#038;fs=1" type="application/x-shockwave-flash" allowfullscreen="true" allowScriptAccess="always" width="640" height="385"></embed></object></p>
<p>And check out <a href="http://www.dbms2.com/2010/09/20/ibm-netezza-acquisition/">DBMS2</a> and <a href="http://techcrunch.com/2010/09/20/ibm-buys-data-warehousing-appliance-maker-netezza-for-1-7-billion/">Techcrunch</a> for some comments.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.analyticsteam.com/2010/09/20/ibm-to-acquired-netezza/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

