Re: [sqlite] suggestion on improving performance on UPDATE

John Stanton Wed, 14 Nov 2007 07:04:17 -0800

I have only glanced at the problem so I may have missed something but myapproach to a large matrix would be to realise it is a flat file andmmap it. Your program would then treat it as a memory residentstructure. The VM features of the OS would perform paging as necessaryto keep a working set of the matrix in real memory.


Andreas wrote:

Hello Benilton,
some years ago i came across pyTables ( http://www.pytables.org ). It'sa wrapper for the HDF5-format. PyTables claims to handle highdata-thruput very well. It supports Matrix/Array-formats as these aretypically used in scientific-projects. PyTables does not provide anyform of relational-model, but it sounds to me that this is probably notwhat you need in first place. Maybe u can boost the performance of u'rcalculations as soon as u can load/store Arrays/Matrixes en piece.I used it for document-clustering and was very happy being able tostore compressed-Arrays generated with Numeric/NumArray-packages. Theperformance on ~1000 documents inside the cluster was fine though notas critical as yours. I appreciated the ease of use and the chance toeasily add metadata into the dataset. Yes and a Jva-Gui is also avail.I don't know if your data-sets/data-types fit into this scenario, butmaybe you want to take a look into the FAQ [ http://www.pytables.org/moin/FAQ#head- b32537aba805dac2a1bf9cd6606c4fddcd964f96 ].
good luck, andreas
-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------



-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------

Re: [sqlite] suggestion on improving performance on UPDATE

Reply via email to