Hi,

See this repository: https://github.com/aivarsk/scrapy-proxies



Merci.
---------
Lhassan Baazzi | Web Developer PHP - Symfony - JS - Scrapy
Email/Gtalk: [email protected] - Skype: baazzilhassan - Twitter:
@baazzilhassan <http://twitter.com/baazzilhassan>
Blog: http://blog.jbinfo.io/
Donate - PayPal -
<https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=BR744DG33RAGN>


2014-07-09 11:25 GMT+00:00 bing <[email protected]>:

> During my crawling, some pages return a response with partial html body
> and status 200, after I compare the response body with the one I open in
> browser, the former one miss something. How can I catch this unexpected
> partial response body case in spider or in download middleware?
>
> Below is about the log example:
>
> 2014-01-23 16:31:53+0100 [filmweb_multi] DEBUG: Crawled (408)
> http://www.filmweb.pl/film/Labirynt-2013-507169/photos> (referer:
> http://www.filmweb.pl/film/Labirynt-2013-507169) ['*partial*']
>
> --
> You received this message because you are subscribed to the Google Groups
> "scrapy-users" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at http://groups.google.com/group/scrapy-users.
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Reply via email to