On Tuesday 12 May 2009 10:02:53 Armando Serrano Lombillo wrote:
> Hello list. I have a (potentially very big) table in PyTables. I now want
> to extract all the unique values of each column. I have tried doing:
>
> dict((name, set(t.colinstances[ind])) for name in t.colnames)
>
> (where t is of course the table), but it is VERY slow.
>
> Is there a faster way?

We need a bit more info.  How exactly large your table is in comparison with 
your available memory?  Are you using compression?  If yes, which compressor 
exactly?  Finally, have you specified the `expectedrows` parameter in the 
Table constructor?

Cheers,

-- 
Francesc Alted

------------------------------------------------------------------------------
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com
_______________________________________________
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users

Reply via email to