Hi,
I've been attempting to import the IMDb database into a SQLite database,
and have found and fixed a couple of problems that were preventing me.
The first, and most serious, is that soundex() appears to have a length
limit in SQLite (at least in Windows). There are some character entries
in the "VG EverQuest 2" entry that are extremely long, over 600
characters. When soundex() was called on them, it crashed python (not a
python exception, a GPF). By adding the following line of code before
soundex() is called in name_soundexes(), the crash is avoided:
name = name[:min(512,len(name))]
title_soundex() probably needs to have something similar added also. I
experienced this on Windows XP with the ActivePython installation
(Python 2.5.1.1) and also with Python 2.5.1.2. I used SQLObject 0.9 and
0.10.0b3, and pysqlite as bundled in Python 2.5.1.1 and also pysqlite
2.4.1. (I was updating everything I could to newer versions trying to
solve the problem).
The second is that the CharactersCache class is not allowed for custom
queries and thus performance in SQLite is _slow_ when flushing the
characters cache. This is sovled by adding 'AFTER_CHARACTERS_TODB' and
'BEFORE_CHARACTERS_TODB' to ALLOWED_TIMES, and adding them to
CUSTOM_QUERIES during option scanning along with the other
'BEFORE_XXX_TODB' and 'AFTER_XXX_TODB' strings.
Hope this helps anyone else having these problems. Currently waiting for
indexing.... ;)
Clark Bassett
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Imdbpy-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/imdbpy-devel