Dinesh B Vadhia (el 2008-05-10 a les 10:10:29 -0700) va dir::

> I'm using the OS filesystem to store 32,000 images files.  I'm now
> going to move them into a datastore and the choices are pysqlite or
> MySQL or PyTables.  The number of images will grow rapidly (to the
> millions and more) and hence performance is critical.  Multiple images
> will be accessed from the data strore at a time.  There are no write
> operations just read only.
> 
> The data schema is: image index (on the image filename), image
> filename, image (jpg initially but will be other formats in the
> future).
> 
> Any and all suggestions would be appreciated.

Well, I don't quite understand the data schema (are you describing a row
of three fields in a table), but you may have a look at the
``tables.nodes.filenode`` module, which contains a ``FileNode`` class
which offers a Python file-like interface to a PyTables dataset (a one-
dimensional ``EArray`` ) holding the bytes of the file.  I should be
specially useful if you keep images stored with a file format like JPEG,
PNG and the like.

Also, I'd recommend not cramming all images under a single group to
avoid performance problems when opening the group, but to pack them in
groups of at most 4096 (see ``tables.parameters.MAX_GROUP_WIDTH``)
images per group.

Hope that helps,

::

        Ivan Vilata i Balaguer   >qo<   http://www.carabos.com/
               Cárabos Coop. V.  V  V   Enjoy Data
                                  ""

Attachment: signature.asc
Description: Digital signature

-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
_______________________________________________
Pytables-users mailing list
Pytables-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/pytables-users

Reply via email to