Category: HBase
It's hard to keep track of all the database-related terms you hear these days. What constitutes "big data"? What is NoSQL, and why are your developers so interested in it? And now "NewSQL"? Where do in-memory databases fit into all of this? In this series, we'll untangle the mess of terms and tell you what you need to know.Continue
A few months ago we told you about a paper by Microsoft researchers, Erik Meijer and Gavin Bierman which argued that non-relational data stores will need to create a standardized database query language in order to achieve widespread adoption.
Today a new potential standard for document databases (and possibly other NoSQL databases) was announced: UnQL.
Sponsor
Continue
Sometimes best practices and practical tips can obscure mistakes that you should avoid. Sometimes what you really need is a list of worst practices.
Iwona Bialynicka-Birula has written a post organizing things not to do in Apache Hadoop into three categories: efficiency, scalability and reliability.
Sponsor

Bialynicka-Birula writes:Continue
Dhruba Borthakur, a Hadoop Engineer at Facebook, has published part of a paper he co-authored with several of his engineering co-workers on Apache Hadoop. The first part of the paper explains Facebook's requirements and non-requirements for a data store for its revamped Facebook Messages application and the reasons it chose Apache Hadoop to power it. The paper will be published at SIGMOD 2011.Continue
EMC World is taking place in Las Vegas today. In addition to the announcement of EMC's own Apache Hadoop appliance and distribution, several other companies have announced new products ranging from software integration tools to storage appliances.
We've covered the increasing competition and innovation in the Hadoop market, and those trends show no signs of slowing down.
Sponsor
Continue
First of all, let's leave aside the issue of whether we're in a bubble or not, and just assume that we are. Ashlee Vance has an excellent piece in Business Week looking at one tragic aspect of this bubble: too many mathematicians are flocking to Silicon Valley to work for companies like Google, Facebook and Zynga to work on advertising platforms.Continue
Earlier today we told you about how the Hadoop business is heating up. Hadoop isn't a database, but Hbase and its alternatives are a core part of the platform. IBM is offering Hadoop products, and EMC is making an announcement next month. Microsoft has its own alternative, Azure Table Storage.Continue
Cloudera, one of the primary contributors to Apache Hadoop, has released a new version Hadoop distribution: Cloudera's Distribution including Apache Hadoop v3 (CDH3).
The new version contains over 1,000 patches and changes, many of which will be contributed back to the open source project. CDH3 includes a full stack of software, from the operating system to tools for working with Hadoop, such as Pig and Hive. CDH3 is free and open source - Cloudera makes its money selling enterprise support and management tools.Continue
Facebook will follow up on its Open Compute intiiatiive with a new initiative to improve its data storage.
According to Fast Company, the emphasis on storage is needed due to the continued sheer volume of data that Facebook's users are producing. Those hundreds of millions of users take a lot of pictures. They use Facebook to post their video. Continue