[Imdbpy-devel] Bug Report about movieParser.py

Boran Basar Thu, 01 Jan 2009 06:34:14 -0800

Hi,
Got problem fetching information about a movie.It's named Disaster
Movie. The code I used and the output is below.


Traceback (most recent call last):
  File "main.py", line 115, in <module>
    parse_imdb(parse_movie_feeds())
  File "main.py", line 44, in parse_imdb
    access.update(movie)
  File "/home/wolf/Torrent-X/backend/imdb/__init__.py", line 609, in
update
    ret = method(mopID)
  File "/home/wolf/Torrent-X/backend/imdb/parser/http/__init__.py", line
421, in get_movie_main
    return self.mProxy.movie_parser.parse(cont, mdparse=self._mdparse)
  File "/home/wolf/Torrent-X/backend/imdb/parser/http/utils.py", line
699, in parse
    data = self.parse_dom(dom)
  File "/home/wolf/Torrent-X/backend/imdb/parser/http/utils.py", line
796, in parse_dom
    data = attr_postprocess(data)
  File "/home/wolf/Torrent-X/backend/imdb/parser/http/movieParser.py",
line 807, in <lambda>
    int(x.replace('Top 250: #', '')))),
ValueError: invalid literal for int() with base 10: 'Bottom 100: #2'

I simply commented out the lines below in movieParser.py, because I
didn't need that information, but this needs to be fix I think.

801#                Extractor(label='top 250',
802#
path="//d...@class='left']/a[starts-with(@href, " \
803#                                    "'/chart/')]",
804#                            attrs=Attribute(key='top 250 rank',
805#                                            path="./text()",
806#                                            postprocess=lambda x: \
807#                                            int(x.replace('Top 250:
#', '')))),


-- 
Boran Basar <boranba...@e-kolay.net>


------------------------------------------------------------------------------
_______________________________________________
Imdbpy-devel mailing list
Imdbpy-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/imdbpy-devel

[Imdbpy-devel] Bug Report about movieParser.py

Reply via email to