Re: [Pytables-users] Data organization

2010-12-09 Thread Francesc Alted
A Thursday 09 December 2010 10:19:22 Nicholas Potter escrigué: > Hello everyone, > > I am working with economic data for 3140 counties and the 50 states > as well as 500 industries, and trying to figure out the best way to > store and access the data. The two options seem to be to have one > tabl

[Pytables-users] Data organization

2010-12-09 Thread Nicholas Potter
Hello everyone, I am working with economic data for 3140 counties and the 50 states as well as 500 industries, and trying to figure out the best way to store and access the data. The two options seem to be to have one table of ~32 million rows, like this: Region | Industry | variable | value **d

Re: [Pytables-users] Data organization question

2010-03-11 Thread Francesc Alted
A Thursday 11 March 2010 12:51:21 Jorge Scandaliaris escrigué: > Francesc Alted pytables.org> writes: > > Yes, having everything in a single table and using compression to reduce > > unused space is the simplest option. If your maximum length for variable > > length field is high (>1000 bytes), y

Re: [Pytables-users] Data organization question

2010-03-11 Thread Jorge Scandaliaris
Francesc Alted pytables.org> writes: > Yes, having everything in a single table and using compression to reduce > unused space is the simplest option. If your maximum length for variable > length field is high (>1000 bytes), you can still use a VLArray for keeping > them, and add another lev

Re: [Pytables-users] Data organization question

2010-03-11 Thread Francesc Alted
A Wednesday 10 March 2010 08:48:33 Jorge Scandaliaris escrigué: > Hi, > Some time ago I started using pytables for organizing large amounts of > data, I it has worked really nice until now. Up to now I had an append > only usage pattern. Because of the nature of the data, for each > measurement

[Pytables-users] Data organization question

2010-03-09 Thread Jorge Scandaliaris
Hi, Some time ago I started using pytables for organizing large amounts of data, I it has worked really nice until now. Up to now I had an append only usage pattern. Because of the nature of the data, for each measurement there are a number of fixed fields plus a variable length array, I was using