Hi,
Got problem fetching information about a movie.It's named Disaster
Movie. The code I used and the output is below.
Traceback (most recent call last):
File "main.py", line 115, in <module>
parse_imdb(parse_movie_feeds())
File "main.py", line 44, in parse_imdb
access.update(movie)
File "/home/wolf/Torrent-X/backend/imdb/__init__.py", line 609, in
update
ret = method(mopID)
File "/home/wolf/Torrent-X/backend/imdb/parser/http/__init__.py", line
421, in get_movie_main
return self.mProxy.movie_parser.parse(cont, mdparse=self._mdparse)
File "/home/wolf/Torrent-X/backend/imdb/parser/http/utils.py", line
699, in parse
data = self.parse_dom(dom)
File "/home/wolf/Torrent-X/backend/imdb/parser/http/utils.py", line
796, in parse_dom
data = attr_postprocess(data)
File "/home/wolf/Torrent-X/backend/imdb/parser/http/movieParser.py",
line 807, in <lambda>
int(x.replace('Top 250: #', '')))),
ValueError: invalid literal for int() with base 10: 'Bottom 100: #2'
I simply commented out the lines below in movieParser.py, because I
didn't need that information, but this needs to be fix I think.
801# Extractor(label='top 250',
802#
path="//d...@class='left']/a[starts-with(@href, " \
803# "'/chart/')]",
804# attrs=Attribute(key='top 250 rank',
805# path="./text()",
806# postprocess=lambda x: \
807# int(x.replace('Top 250:
#', '')))),
--
Boran Basar <[email protected]>
------------------------------------------------------------------------------
_______________________________________________
Imdbpy-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/imdbpy-devel