On Tuesday 12 May 2009 10:02:53 Armando Serrano Lombillo wrote: > Hello list. I have a (potentially very big) table in PyTables. I now want > to extract all the unique values of each column. I have tried doing: > > dict((name, set(t.colinstances[ind])) for name in t.colnames) > > (where t is of course the table), but it is VERY slow. > > Is there a faster way?
We need a bit more info. How exactly large your table is in comparison with your available memory? Are you using compression? If yes, which compressor exactly? Finally, have you specified the `expectedrows` parameter in the Table constructor? Cheers, -- Francesc Alted ------------------------------------------------------------------------------ The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com _______________________________________________ Pytables-users mailing list Pytables-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/pytables-users