On Mon, Feb 1, 2010 at 1:50 PM, Santi Saez <[email protected]> wrote: > El 01/02/10 17:56, Paul Davis escribió: > > Dear Paul, > >> Well, 2^32 of anything is 4GiB per byte stored. So, minimum of four >> bytes and you're at 16GiB. Even with just 1KiB overhead you're at >> 4TiB. >> >> I'm left wondering why you would want to store a list of numbers in >> the first place. > > Imagine a service like Netcraft. > > I know that there aren't 2^32 active servers, but I wanted to test it with > 4.3 billion documents and stress/benchamark CouchDB with other DBs. > > Regards, > > -- > Santi Saez > http://woop.es >
If you're looking for benchmark data I'd also suggest something like the Enron email dataset. I can't imagine 4.3 billion integer documents is going to be very informative about real world usage. The Wikipedia abstracts data set another candidate as well. HTH, Paul Davis
