DBMS2 reports on the launch of mapreduce.org by Aster Data. The site contains lots of good Mapreduce information and links.
Here’s a post from Joydeep Sen Sarma about the combo of Hbase and Mapreduce.
DBMS2 takes a look at these three myths about mapreduce…
* MapReduce is something very new
* MapReduce involves strict adherence to the Map-Reduce programming paradigm
* MapReduce is a single technology
Database vendor, Greenplum, is now offering a free download of the single node version of its database. It is available for several different operating systems. Here’s a DBMS2 article with some more information.
In another DBMS2 article, there’s some information about the pace of Greenplum’s recent customer acquisitions which bring it to 100+ as of this quarter.
And finally, here’s some information about Greenplum’s pricing: either subscription or perpetual.
MapReduce has been facing some criticism based on some recent performance tests. Don’t worry. The outcome is basically not to compete with DBMS in areas where DBMS is already good. The article suggests MapReduce should be used to solve the following problem types:
- text tokenization, indexing, and search
- Creation of other kinds of data structures (e.g., graphs)
- Data mining and machine learning
- Data transformation
Cloudera put together this list of 10 MapReduce tips.
You might also want to check out their list of 5 common Hadoop questions.
Amazon has announced the public beta of their hosted Hadoop framework. Using Elastic MapReduce, you can quickly launch as much processing power as needed for your analytics task. Data can be stored on the S3 platform. Sign in to the AWS Management Console to kick things off.

