Re: [Imdbpy-devel] First patch for DOM parser

H. Turgut Uyar Wed, 25 Jun 2008 08:26:18 -0700

> I'm sure that some (many?) complex parser will require some (a lot
> of?) code to be written, but there are also many simpler code.
> Do you think these can be handled by a generic "feed_dom" code,
> using a set of provided parameters (like the one you've put in
> the _k dictionary of the HTMLSearchMovieParser class)?
>


I've tried to implement a more generic dom parser (parse_dom in the 
ParserBase class). It will use the rules as defined by the _paths 
attribute in the current parser. I've adapted the search parsers to this 
method and also written part of the person main details parser. I hope 
it holds up for other parsers as well.

Turgut

> By the way: I think it's time to give up to the _ModuleProxy approach:
> let's leave it in place to be used by ParserBase-based parsers, but
> it was just a bad idea.
> 
> Actually I'm studing the bases of XPath: once a stable design is in
> place, I hope I'll be ready to start writing some parsers.
> 
> 
> Thanks!
> 
> +++
> [1] so far I won't touch the IMDbURLopener, even if it's guessing
>     for encodings can be handled better by BeautifoulSoup and friends.


-------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php
_______________________________________________
Imdbpy-devel mailing list
Imdbpy-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-devel

Re: [Imdbpy-devel] First patch for DOM parser

Reply via email to