This is not open source but we are using Vertica and it works very nicely for us. There is a 1TB community edition but above that it costs money. It has really advanced SQL (analytical functions, etc), works like an RDBMS, has R/Java/C++ SDK and scales nicely. There is a similar option of Redshift available but Vertica has more features (pattern matching functions, etc).
Again, not open source so I would be interested to know what you end up going with and what your experience is. On Mon, Feb 2, 2015 at 12:08 AM, Samuel Marks <samuelma...@gmail.com> wrote: > Well what I am seeking is a Big Data database that can work with Small > Data also. I.e.: scaleable from one node to vast clusters; whilst > maintaining relatively low latency throughout. > > Which fit into this category? > > Samuel Marks > http://linkedin.com/in/samuelmarks >