[Pytables-users] Best way to store sequences of arbitrary objects containing arrays

Anand Patil Mon, 19 May 2008 10:02:31 -0700

Hi all,


I'd like to store a long sequence of python objects with pytables. The  
only things I know about the objects are:

- Their memory footprint is dominated by a big numpy array, and
- The attribute name of the big array for each object is the same;  
it's x1.big_array, x2.big_array, etc.

I would rather not require the array to be the same shape for each  
object.


I think I'd want to to make a group with a single ObjectAtom array and  
a whole bunch of arrays whose atoms correspond to big_array.dtype. To  
store an object, I'd destroy all its references to its big_array,  
pickle it in the ObjectAtom array, and store its big_array in one of  
the other arrays.

My questions are:
- Is this the best way to go?
- What kind of performance penalty am I incurring by storing each of  
the big_array attributes in its own pytables array, rather than making  
them cells in a table? How can I mitigate it?
- How can I make sure that all of an object's references to its  
big_array get destroyed, so that the latter doesn't get pickled with  
the object?


Thanks in advance for any help,

Anand

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft 
Defy all challenges. Microsoft(R) Visual Studio 2008. 
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Pytables-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/pytables-users

[Pytables-users] Best way to store sequences of arbitrary objects containing arrays

Reply via email to