Here’s a great article from DataWrangling about using Amazon’s Cloud services to analyze traffic data from Wikipedia. The 320 GB data set is available to the public here.
Check out what Eric Peterson, founder of Twitalyzer.com, has to say about web analytics. Eric is the author of the book: Web Analytics Demystified: A Marketer’s Guide to Understanding How Your Web Site Affects Your Business
Who has the biggest database? Due to the increasing amount of behavioral information tracked during a web browsing session, some internet properties are starting to rack up some pretty hefty databases.
Ebay has a 6.5 petabyte Greenplum warehouse and a 2.5 petabyte Teradata warehouse. This system ingests hundreds of billions of new rows of data every day.
Facebook has a 2.5 petabyte Hadoop system
Yahoo has more than 1 petabyte running on their homemade system
Information Architects have finalized their webtrend visualization showing current Internet trends and how companies and individuals fit in. In addition to the interactive online version, they are also producing 1000 limited edition prints. Get yours before they’re gone.
Techcrunch reports that the Google Analytics API is now in public Beta. The solution has been in private beta for about a year. Developers will now be able to leverage the Google Analytics data to build new web analytics solutions available both online and offline. One of these new tools is Polaris, built by Desktop Reporting.
According to Google search trends for soup, springtime must be here.
Here’s a similar article from last fall talking about predicting flu outbreaks based on google searches.
In an article from Hitwise today, Hitwise used their data to determine that Craigslist had made it to the top of the weekly search results list beating 3 year champ, MySpace.
However, Techcrunch has a very different view based on a look at Google Trends. They rank Craigslist third with Facebook having a huge lead over MySpace. With different sites using different tracking methods, it’s no wonder we’re seeing such varied results. Keep your eyes open.
Techcrunch has taken a look at the value of the traffic it is getting from Twitter. While not as valuable as the initial set of users it gained from Twitter, these new users are still driving traffic.
Here’s a similar look for another website.
Twitter has been getting a lot of publicity recently. So where do all those users go after Twitter? Hitwise has the answers. The majority of traffic was directed to social media sites like Facebook and MySpace.


