Jul 31

SETI (Search for Extra-terrestrial Intelligence) is fairly well-known for their utilization of distributed computing. For decades, they have allowed the home computer user to donate time and computing resources to analyze radio signals hoping to identify signs of extra-terrestrials. This article from O’Reilly Radar details some of the changes SETI has made.

Tagged with:
Jul 31

Amazon has been recommending products based on past purchase and browse data for some time. More recently, Facebook has been suggesting users who you may want to add as friends. Now twitter is adding a suggestions for you feature based on the people you follow. A few more details from Techcrunch.

Tagged with:
Jul 01

Check out this GigaOm article talking about the challenges of finding patterns within the mass of social data being generated.

Jun 27

There are some analytics products that receive high degrees of notoriety. Palantir is not one of them. Primarily used by the government, their technology allows non-technical users to see relationships between disparate data. Until this Techcrunch article I hadn’t heard of them at all.

Tagged with:
Feb 21

Part of BMW Oracle’s upper hand in the most recent America’s Cup may have come from the use of data mining. The boat and all its sensors can generate 2,500 data points 10 times per second. Check out this article from the Oracle Data Mining and Analytics blog to read the rest.

Tagged with:
Oct 24

This Boing Boing article questions whether the US military may be gathering data from unsuspecting teens and using it for data mining exercises to improve recruiting.

Tagged with:
Aug 10

With the first Netflix Prize coming to a conclusion, Netflix has announced Netflix Prize 2. This one will be shorter than the first with prizes being awarded after 6 and 18 months. Here’s an article from O’Reilly with more info.

Tagged with:
May 09

In the past, most executives have viewed data management as a necessary but boring cost they must tolerate. The newest DM News Data Management Survey indicates that this is starting to change.

Here’s the article. And here’s the full survey.

Tagged with:
Apr 15

Pig is an open source platform for analyzing large data sets that works in conjunction with Hadoop clusters and Map-Reduce jobs. They recently announced their 0.20 release featuring a 5X performance gain over the previous version. Check out the details.

Tagged with:
Mar 25

Rexer Analytics is just about to close their data mining survey and could use your input. The link from the Oracle data mining blog will give you the code you need to participate. After the survey, you can access previous versions of the compiled survey results.

Tagged with:
preload preload preload