In the SVN there are some (preliminary and not tested) improvements to 
the support of CSV files (used by the imdbpy2sql.py script to speed up
the insertion of the data in a SQL database).

Basically, you can use the --csv-only-write and --csv-only-load to
decouple two phases: in the first run of the imdbpy2sql.py script,
the old database is scrapped and the CSV files are created; in the
second run, these CSV files are used to (re)populate the database.

This should be useful under some circumstances.

Moreover, in the 'goodies' directory under 'docs' you'll find two
shell (bash) scripts to handle the plain text data files: applydiffs.sh
applies one or more 'diffs files' to your set of plain text data files;
reduce.sh takes a set of plain text data files and create a "slimmed
down" version of them, considering only 1% of any file.
This is mostly useful to debug and test the imdbpy2sql.py script.


-- 
Davide Alberani <davide.alber...@gmail.com> [GPG KeyID: 0x465BFD47]
http://www.mimante.net/

------------------------------------------------------------------------------
The Planet: dedicated and managed hosting, cloud storage, colocation
Stay online with enterprise data centers and the best network in the business
Choose flexible plans and management services without long-term contracts
Personal 24x7 support from experience hosting pros just a phone call away.
http://p.sf.net/sfu/theplanet-com
_______________________________________________
Imdbpy-devel mailing list
Imdbpy-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-devel

Reply via email to