Re: [Imdbpy-help] imdbpy 4.6 can't fetch person head shot (and who add IMDb to the names ?)

Davide Alberani Wed, 29 Sep 2010 02:45:33 -0700

On Tue, Sep 28, 2010 at 10:50 PM, Israel Fruchter
<[email protected]> wrote:
> I fixed my two issues (Person name & headshot),


Great!  Thank you very much.

I'll have time to check the patch and commit it to Mercurial only
tomorrow.  By the way, anyone should feel free to fork the IMDbPY
repository on Bitbucket (specifically
http://bitbucket.org/alberanid/imdbpy_new_search_parsers/ ),
commit his changes and ask for a pull.

> BTW, I really like the parser, it's a good scraper,

Most of the credit should go to H. Turgut Uyar, who wrote that
wonderful DOM/XPath-based parser.

> tried building it myself, imdb page killed lxml, elementtree and
> BeautifulSoup,

Strange: we're based on lxml (falling back to BeautifulSoup if
lxml is not installed).

> you think they are writing an ill-formed html deliberately to keep us
> scrapers away ??

Hmmm... I fear that, like every big portal, they have to cope with
so many browsers and environment that a nice and standard HTML
can't work. :-)


Thanks again!
-- 
Davide Alberani <[email protected]>  [PGP KeyID: 0x465BFD47]
http://www.mimante.net/

------------------------------------------------------------------------------
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
_______________________________________________
Imdbpy-help mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/imdbpy-help

Re: [Imdbpy-help] imdbpy 4.6 can't fetch person head shot (and who add IMDb to the names ?)

Reply via email to