I fixed my two issues (Person name & headshot), moved headshot into the bio page, and taking the name from <h1 name='header'> and not from <title>, not that solid
attached the diff and my test of the fix. ( I hope you don't mind my git-patch-format :) ) didn't had time to go over all the attributes of a person. (too many) if any other people has issues, I'd love to take a look BTW, I really like the parser, it's a good scraper, tried building it myself, imdb page killed lxml, elementtree and BeautifulSoup, you think they are writing an ill-formed html deliberately to keep us scrapers away ?? Fruch On Mon, Sep 27, 2010 at 9:05 PM, Davide Alberani <davide.alber...@gmail.com>wrote: > On Mon, Sep 27, 2010 at 11:42 AM, Fruch <israel.fruch...@gmail.com> wrote: > > > > I don't believe Hugh Laurie doesn't have a head shot in imdb > > Hi! > > Unfortunately IMDbPY is badly broken since the introduction of a new set > of web pages. > See: > http://groups.google.com/group/imdbpy-devel/browse_thread/thread/b1226f91645fd209 > > Currently IMDbPY is using an account which uses the old set of web pages, > but this only applies to movies' information, and so anything related to > persons, characters and maybe companies is broken. > > I'll try to fix the main problems ASAP, but unfortunately right now my > time is _extremely_ limited, and it may takes some weeks, to solve these > problems. > > If anyone is willing to help, feel free to ask any kind of question. :-) > > > > -- > Davide Alberani <davide.alber...@gmail.com> [PGP KeyID: 0x465BFD47] > http://www.mimante.net/ >
0002-name-headshot-fix.patch
Description: Binary data
0003-added-strip.patch
Description: Binary data
fix.py
Description: Binary data
------------------------------------------------------------------------------ Start uncovering the many advantages of virtual appliances and start using them to simplify application deployment and accelerate your shift to cloud computing. http://p.sf.net/sfu/novell-sfdev2dev
_______________________________________________ Imdbpy-help mailing list Imdbpy-help@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/imdbpy-help