Cassandra is an open sourced distributed database that’s part of the Apache project. It was originally developed at Facebook. Twitter has announced that they will continue to use MySQL to store tweets but will be using Cassandra to develop a real-time analytics capability. Read the rest in the Techcrunch article.
There are some analytics products that receive high degrees of notoriety. Palantir is not one of them. Primarily used by the government, their technology allows non-technical users to see relationships between disparate data. Until this Techcrunch article I hadn’t heard of them at all.
Revolution Analytics has a commercial version of the open sourced R analytics platform. Read more from Flowing Data.
Here’s a post from Joydeep Sen Sarma about the combo of Hbase and Mapreduce.
Don’t miss the Big Data Workshop coming up on April 23rd from 9am – 5pm at the Computer History Museum in Mountain View, CA.
A lot is happening these days with open source solutions to data problems and PostgreSQL and Hadoop are both at the center of the solutions. Each offers unique capabilities. Tim Sell from Last.fm has put together some information about how the two can be used together. Check out the slides and video.
Analytics means a lot of different things to different people and companies but if there is one commonality across the majority of companies, it’s that analytics is significantly underutilized to drive the business.
- Standard Reports
- Ad hoc Reports
- Drill down – OLAP
- Alert notifications
- Statistical Analysis
- Forecasting
- Modeling
- Optimization
1 to 1 Media provides a bit more color.
Over at the Process Trends site, they’ve collected a list of R scripts and data file links related to analyzing climate data.
The FIrst Coffee blog reports that SAS and Netezza have expanded their partnership to allow for SAS model code to run in parallel on Netezza’s TwinFin appliance.
IBM has acquired SPSS and more recently acquired business analytics firm Red Pill. Now they are announcing an internal analytics product called Blue Insight, the largest private cloud computing business analytics environment in the world. Check out the Techcrunch article.

