On 10/12/2008 05:17 PM, Davide Alberani wrote:
> In short: I've introduced a new step between preprocess_string
> and parse_dom.
> 

I wonder how I never noticed that the reference gathering parser caused
the same page to be parsed twice.

Nice idea to manipulate the dom object to get to the series info for
episodes without touching the xpaths. I think there is an unnecessary
-and hazardous- lxml import in the preprocess_dom method of the
DOMHTMLMovieParser.

> For _fix_rowspans it will be probably required to write some other
> "adapter functions" to add/replace specific nodes, but I don't
> think it would be too difficult.
> 

I was going to look at this today, but man you're fast :-)

Turgut


-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Imdbpy-devel mailing list
Imdbpy-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-devel

Reply via email to