Hi, See this repository: https://github.com/aivarsk/scrapy-proxies
Merci. --------- Lhassan Baazzi | Web Developer PHP - Symfony - JS - Scrapy Email/Gtalk: [email protected] - Skype: baazzilhassan - Twitter: @baazzilhassan <http://twitter.com/baazzilhassan> Blog: http://blog.jbinfo.io/ Donate - PayPal - <https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=BR744DG33RAGN> 2014-07-09 11:25 GMT+00:00 bing <[email protected]>: > During my crawling, some pages return a response with partial html body > and status 200, after I compare the response body with the one I open in > browser, the former one miss something. How can I catch this unexpected > partial response body case in spider or in download middleware? > > Below is about the log example: > > 2014-01-23 16:31:53+0100 [filmweb_multi] DEBUG: Crawled (408) > http://www.filmweb.pl/film/Labirynt-2013-507169/photos> (referer: > http://www.filmweb.pl/film/Labirynt-2013-507169) ['*partial*'] > > -- > You received this message because you are subscribed to the Google Groups > "scrapy-users" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/scrapy-users. > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "scrapy-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/scrapy-users. For more options, visit https://groups.google.com/d/optout.
