I fixed my two issues (Person name & headshot),
moved headshot into the bio page,
and taking the name from <h1 name='header'> and not from <title>, not that
solid

attached the diff and my test of the fix. ( I hope you don't mind my
git-patch-format :) )
didn't had time to go over all the attributes of a person. (too many)

if any other people has issues, I'd love to take a look

BTW, I really like the parser, it's a good scraper,
tried building it myself, imdb page killed lxml, elementtree and
BeautifulSoup,
you think they are writing an ill-formed html deliberately to keep us
scrapers away ??

Fruch

On Mon, Sep 27, 2010 at 9:05 PM, Davide Alberani
<davide.alber...@gmail.com>wrote:

> On Mon, Sep 27, 2010 at 11:42 AM, Fruch <israel.fruch...@gmail.com> wrote:
> >
> > I don't believe Hugh Laurie doesn't have a head shot in imdb
>
> Hi!
>
> Unfortunately IMDbPY is badly broken since the introduction of a new set
> of web pages.
> See:
> http://groups.google.com/group/imdbpy-devel/browse_thread/thread/b1226f91645fd209
>
> Currently IMDbPY is using an account which uses the old set of web pages,
> but this only applies to movies' information, and so anything related to
> persons, characters and maybe companies is broken.
>
> I'll try to fix the main problems ASAP, but unfortunately right now my
> time is _extremely_ limited, and it may takes some weeks, to solve these
> problems.
>
> If anyone is willing to help, feel free to ask any kind of question. :-)
>
>
>
> --
> Davide Alberani <davide.alber...@gmail.com>  [PGP KeyID: 0x465BFD47]
> http://www.mimante.net/
>

Attachment: 0002-name-headshot-fix.patch
Description: Binary data

Attachment: 0003-added-strip.patch
Description: Binary data

Attachment: fix.py
Description: Binary data

------------------------------------------------------------------------------
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
_______________________________________________
Imdbpy-help mailing list
Imdbpy-help@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-help

Reply via email to