Amazon has been recommending products based on past purchase and browse data for some time. More recently, Facebook has been suggesting users who you may want to add as friends. Now twitter is adding a suggestions for you feature based on the people you follow. A few more details from Techcrunch.
There are some analytics products that receive high degrees of notoriety. Palantir is not one of them. Primarily used by the government, their technology allows non-technical users to see relationships between disparate data. Until this Techcrunch article I hadn’t heard of them at all.
Part of BMW Oracle’s upper hand in the most recent America’s Cup may have come from the use of data mining. The boat and all its sensors can generate 2,500 data points 10 times per second. Check out this article from the Oracle Data Mining and Analytics blog to read the rest.
This Boing Boing article questions whether the US military may be gathering data from unsuspecting teens and using it for data mining exercises to improve recruiting.
With the first Netflix Prize coming to a conclusion, Netflix has announced Netflix Prize 2. This one will be shorter than the first with prizes being awarded after 6 and 18 months. Here’s an article from O’Reilly with more info.
In an effort to increase user engagement, satisfaction, and profitability, many websites are offering their users various types of recommendations. Many people are familiar with Amazon.com’s people who bought also bought or people who browsed also browsed. But how do they do that?
Darren Vengroff, chief scientist from RichRelevance, explains some of the components of a recommendation system in a GigaOM article.
In the past, most executives have viewed data management as a necessary but boring cost they must tolerate. The newest DM News Data Management Survey indicates that this is starting to change.
Here’s the article. And here’s the full survey.
Pig is an open source platform for analyzing large data sets that works in conjunction with Hadoop clusters and Map-Reduce jobs. They recently announced their 0.20 release featuring a 5X performance gain over the previous version. Check out the details.
Rexer Analytics is just about to close their data mining survey and could use your input. The link from the Oracle data mining blog will give you the code you need to participate. After the survey, you can access previous versions of the compiled survey results.

