NoSQLBenchmarking.com

NoSQL benchmarking and analysis

Entries for the ‘Benchmarking’ Category

Updated benchmark methodology

As my work on the benchmark progressed, I needed to update my methodology to fit my new needs and to take into account the various feedback I got. First here is a little reminder of the basis of my benchmark. It was inspired by Wikipedia because they can provide me with a lot of real [...]

New results for Cassandra 0.7.2

As I updated my benchmark to work with a more up to date HBase version, I thought that I had to do the same for the other databases if I wanted to be fair. Moreover I had some problems with the Cassandra implementation of MapReduce on the 0.6.10 version (you can read this post to [...]

New results for HBase 0.90.0

The first results I published were not in favor of HBase, the performances for both read/update and MapReduce were decreasing with the size of the cluster and were very instable. I spent a lot of time trying to figure out what was the problem and I think I finally have found what could be the [...]

A scalable benchmark architecture

The first version of my benchmark simply consisted in a single client that started as many thread as there was node in the cluster. It is kind of trivial to see that this approach cannot scale to a large number of nodes, the client would be overloaded very fast. There was two way to use [...]

Easy build and dependency support

A few people asked me to add support for an easiest way to build the benchmark, this is now the case. Instead of having to download yourself all the dependencies  and set them up in Eclipse, Maven will now do all the work for you. Concretely, if you want to test the benchmark all you [...]

Wikipedia benchmark code

As promised when I did my presentation at FOSDEM 2011, here is the code of my simple benchmark inspired by Wikipedia. The code is hosted at github : https://github.com/toflames/Wikipedia-noSQL-Benchmark You can now build this project using Maven. As a few dependencies where not available in any maven repository, I have decided to provide them myself [...]

About my presentation at FOSDEM 2011 (2)

This post is the second part of this previous post, so please start with this one. If you have already read the first post, this one will be about the details and my experiences with the various MapReduce implementations. The first working implementation was the mongoDB one. There is not much to tell about it as [...]

About my presentation at FOSDEM 2011

After my presentation in the Data Analytics devroom at the FOSDEM this year, I felt that I needed to complete the information that I have provided there. This post will try to explain in more details my methodology and the results. But before I start, I would like to say that all of this benchmark [...]