Hi Jun,

This can be a banning strategy from the site. You are crawling it fast and
from one single IP, they might be detecting this and what a better way of
banning you than returning bad responses? I'm not completely sure about
this, but I think it's very likely that this is happening.

2014-11-13 18:32 GMT-02:00 <j...@borderxlab.com>:

> (I've posted this on github but because I couldn't attach non-images on
> github so I repost here.)
>
> Hi Scrapy experts,
> I am new to Scrapy. But it's really a cool tool. I like it very much!
> Recently I came across a problem. I am not sure if this is an existing
> issue -- I searched ON Google, github and didn't find anything similar.
> Here is my question. I wrote a crawler to crawl products on mango.com. I
> set CONCURRENT_REQUESTS = 16 in settings.py. However, I found sometimes the
> response passed to my parser function is ANOTHER page. I used
> inspect_response(response, self) to stop in my parser function to debug. I
> found response.url is correct. But if I type view(response), it shows a
> completely different page from response.url. Later if I set
> CONCURRENT_REQUESTS = 1 and reran the crawler, the problem seems gone.
>
> I wrote a very simple crawler (mango_test) and pasted my debug results in
> this thread. The first couple of responses are correct. But if you look at
> the last one in the results. The response.url is
> http://shop.mango.com/US/p0/women/accessories/crisscross-leather-sandals/?id=33080334_02&ident=0__0_1415909108906&n=1&s=accesorios&ts=1415909108906.
> However, if I run a command to get product name. It extracts a total
> different product  "Wedge suede ankle boots", which does not exist on the
> web page at all. If I run view(response), it also shows a totally different
> page.
>
> I also attached my crawler (mango_test) in this thread. Hopefully it will
> help reproduce the issue.
>
> Thanks,
> Jun
>
> ======Debug results
>
> scrapy crawl spider_mt
>
> 2014-11-13 12:19:11-0800 [scrapy] INFO: Scrapy 0.24.4 started (bot:
> mango_test)
>
> 2014-11-13 12:19:11-0800 [scrapy] INFO: Optional features available: ssl,
> http11
>
> 2014-11-13 12:19:11-0800 [scrapy] INFO: Overridden settings:
> {'NEWSPIDER_MODULE': 'mango_test.spiders', 'SPIDER_MODULES':
> ['mango_test.spiders'], 'BOT_NAME': 'mango_test'}
>
> 2014-11-13 12:19:11-0800 [scrapy] INFO: Enabled extensions: LogStats,
> TelnetConsole, CloseSpider, WebService, CoreStats, SpiderState
>
> 2014-11-13 12:19:12-0800 [scrapy] INFO: Enabled downloader middlewares:
> HttpAuthMiddleware, DownloadTimeoutMiddleware, UserAgentMiddleware,
> RetryMiddleware, DefaultHeadersMiddleware, MetaRefreshMiddleware,
> HttpCompressionMiddleware, RedirectMiddleware, CookiesMiddleware,
> ChunkedTransferMiddleware, DownloaderStats
>
> 2014-11-13 12:19:12-0800 [scrapy] INFO: Enabled spider middlewares:
> HttpErrorMiddleware, OffsiteMiddleware, RefererMiddleware,
> UrlLengthMiddleware, DepthMiddleware
>
> 2014-11-13 12:19:12-0800 [scrapy] INFO: Enabled item pipelines:
>
> 2014-11-13 12:19:12-0800 [spider_mt] INFO: Spider opened
>
> 2014-11-13 12:19:12-0800 [spider_mt] INFO: Crawled 0 pages (at 0
> pages/min), scraped 0 items (at 0 items/min)
>
> 2014-11-13 12:19:12-0800 [scrapy] DEBUG: Telnet console listening on
> 127.0.0.1:6025
>
> 2014-11-13 12:19:12-0800 [scrapy] DEBUG: Web service listening on
> 127.0.0.1:6082
>
> 2014-11-13 12:19:12-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women> (referer: None)
>
> 2014-11-13 12:19:12-0800 [spider_mt] DEBUG: Filtered offsite request to '
> www.facebook.com': <GET http://www.facebook.com/mango.com>
>
> 2014-11-13 12:19:12-0800 [spider_mt] DEBUG: Filtered offsite request to '
> www.twitter.com': <GET http://www.twitter.com/mango>
>
> 2014-11-13 12:19:12-0800 [spider_mt] DEBUG: Filtered offsite request to '
> plus.google.com': <GET https://plus.google.com/117396569580388428996/posts
> >
>
> 2014-11-13 12:19:12-0800 [spider_mt] DEBUG: Filtered offsite request to '
> instagram.com': <GET http://instagram.com/mango>
>
> 2014-11-13 12:19:12-0800 [spider_mt] DEBUG: Filtered offsite request to '
> pinterest.com': <GET http://pinterest.com/mangofashion/>
>
> 2014-11-13 12:19:12-0800 [spider_mt] DEBUG: Filtered offsite request to '
> www.youtube.com': <GET http://www.youtube.com/MANGOFASHION>
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/brooklyn-girl> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/men/new> (referer: http://shop.mango.com/US/women
> )
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/catalogue/2014-autumn> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories/foulards-and-scarves>
> (referer: http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/catalogue> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/lookbooks> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/new> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:13-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/lookbooks/fw-2014-preview> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories/bags> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories/hats-and-caps> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories/leather-goods> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories/belts> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/clothing/jumpsuits> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories/jewellery> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Redirecting (302) to <GET
> http://shop.mango.com/preHome.faces?state=violeta_400_US> from <GET
> http://shop.mango.com/countries/women>
>
> 2014-11-13 12:19:14-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/violeta/accessories/shoes> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/social> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/social/tv> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Filtered duplicate request:
> <GET http://shop.mango.com/US/violeta/new> - no more duplicates will be
> shown (see DUPEFILTER_DEBUG to show all duplicates)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/catalogue/2014-autumn> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/lookbooks/september> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/lookbooks> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/the-magazine> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/lookbooks/november> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:15-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/lookbooks/mango-premium> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:16-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/preHome.faces?state=violeta_400_US> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:16-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/catalogue> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:16-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/catalogue/touch-2014> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:16-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/catalogue/winter-2014> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:16-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/other-accessories> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:16-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/gloves> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:16-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/sunglasses> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:17-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/foulards-and-scarves>
> (referer: http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:17-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/hats-and-caps> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:17-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/belts> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:17-0800 [spider_mt] DEBUG: Redirecting (302) to <GET
> http://shop.mango.com/index.faces?state=she_400_US> from <GET
> http://shop.mango.com/entradaPaises.faces?pais=>
>
> 2014-11-13 12:19:17-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/jewellery> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:17-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/leather-goods> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:17-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/bags> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:18-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories/shoes> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:18-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/accessories> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:18-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/chequeRegalo.faces> (referer:
> http://shop.mango.com/US/women/social)
>
> 2014-11-13 12:19:19-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/p0/women/accessories/check-scarf/?id=33035678_92&ident=0_accesorio52%2C452_0_1415909108906&n=1&s=accesorios.fulares&ts=1415909108906>
> (referer: http://shop.mango.com/US/women/accessories/gloves)
>
> 2014-11-13 12:19:19-0800 [spider_mt] DEBUG: Now parsing
> http://shop.mango.com/US/p0/women/accessories/check-scarf/?id=33035678_92&ident=0_accesorio52%2C452_0_1415909108906&n=1&s=accesorios.fulares&ts=1415909108906
>
> [u'\n\t\t\t\t\t\t\t\tCheck scarf\n\t\t\t\t\t\t\t']
>
>
> http://shop.mango.com/US/p0/women/accessories/check-scarf/?id=33035678_92&ident=0_accesorio52%2C452_0_1415909108906&n=1&s=accesorios.fulares&ts=1415909108906
>
> [s] Available Scrapy objects:
>
> [s]   crawler    <scrapy.crawler.Crawler object at 0x10d351950>
>
> [s]   item       {}
>
> [s]   request    <GET
> http://shop.mango.com/US/p0/women/accessories/check-scarf/?id=33035678_92&ident=0_accesorio52%2C452_0_1415909108906&n=1&s=accesorios.fulares&ts=1415909108906
> >
>
> [s]   response   <200
> http://shop.mango.com/US/p0/women/accessories/check-scarf/?id=33035678_92&ident=0_accesorio52%2C452_0_1415909108906&n=1&s=accesorios.fulares&ts=1415909108906
> >
>
> [s]   settings   <scrapy.settings.Settings object at 0x10c79aa50>
>
> [s]   spider     <MangoTestSpider 'spider_mt' at 0x10d360250>
>
> [s] Useful shortcuts:
>
> [s]   shelp()           Shell help (print this help)
>
> [s]   view(response)    View response in a browser
>
>
> In [*1*]:
>
> Do you really want to exit ([y]/n)? y
>
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Scraped from <200
> http://shop.mango.com/US/p0/women/accessories/check-scarf/?id=33035678_92&ident=0_accesorio52%2C452_0_1415909108906&n=1&s=accesorios.fulares&ts=1415909108906
> >
>
> {'name': [u'\n\t\t\t\t\t\t\t\tCheck scarf\n\t\t\t\t\t\t\t']}
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/clothing/premium> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/clothing/jumpsuits> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/clothing/shorts> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/m/women/accessories/black/?m=color&v=9>
> (referer: http://shop.mango.com/US/women/accessories/other-accessories)
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/p0/women/accessories/knit-wool-blend-scarf/?id=33025533_02&ident=0_accesorio45%2C445_0_1415909108906&n=1&s=accesorios.gorros&ts=1415909108906>
> (referer: http://shop.mango.com/US/women/accessories/foulards-and-scarves)
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/m/women/accessories/sale/?m=coleccion&v=true>
> (referer: http://shop.mango.com/US/women/accessories/sunglasses)
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/p0/women/accessories/heel-lace-up-ankle-boots/?id=33065662_02&ident=0_accesorio48%2C448_0_1415909108906&n=1&s=accesorios.bisuteria&ts=1415909108906>
> (referer: http://shop.mango.com/US/women/accessories/hats-and-caps)
>
> 2014-11-13 12:19:28-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/women/clothing/essentials> (referer:
> http://shop.mango.com/US/women)
>
> 2014-11-13 12:19:29-0800 [spider_mt] DEBUG: Now parsing
> http://shop.mango.com/US/p0/women/accessories/knit-wool-blend-scarf/?id=33025533_02&ident=0_accesorio45%2C445_0_1415909108906&n=1&s=accesorios.gorros&ts=1415909108906
>
> [u'\n\t\t\t\t\t\t\t\tKnit wool-blend scarf\n\t\t\t\t\t\t\t']
>
>
> http://shop.mango.com/US/p0/women/accessories/knit-wool-blend-scarf/?id=33025533_02&ident=0_accesorio45%2C445_0_1415909108906&n=1&s=accesorios.gorros&ts=1415909108906
>
> [s] Available Scrapy objects:
>
> [s]   crawler    <scrapy.crawler.Crawler object at 0x10d351950>
>
> [s]   item       {}
>
> [s]   request    <GET
> http://shop.mango.com/US/p0/women/accessories/knit-wool-blend-scarf/?id=33025533_02&ident=0_accesorio45%2C445_0_1415909108906&n=1&s=accesorios.gorros&ts=1415909108906
> >
>
> [s]   response   <200
> http://shop.mango.com/US/p0/women/accessories/knit-wool-blend-scarf/?id=33025533_02&ident=0_accesorio45%2C445_0_1415909108906&n=1&s=accesorios.gorros&ts=1415909108906
> >
>
> [s]   settings   <scrapy.settings.Settings object at 0x10c79aa50>
>
> [s]   spider     <MangoTestSpider 'spider_mt' at 0x10d360250>
>
> [s] Useful shortcuts:
>
> [s]   shelp()           Shell help (print this help)
>
> [s]   view(response)    View response in a browser
>
>
> In [*1*]:
>
> Do you really want to exit ([y]/n)? y
>
>
> 2014-11-13 12:19:39-0800 [spider_mt] DEBUG: Scraped from <200
> http://shop.mango.com/US/p0/women/accessories/knit-wool-blend-scarf/?id=33025533_02&ident=0_accesorio45%2C445_0_1415909108906&n=1&s=accesorios.gorros&ts=1415909108906
> >
>
> {'name': [u'\n\t\t\t\t\t\t\t\tKnit wool-blend scarf\n\t\t\t\t\t\t\t']}
>
> 2014-11-13 12:19:39-0800 [spider_mt] DEBUG: Now parsing
> http://shop.mango.com/US/p0/women/accessories/heel-lace-up-ankle-boots/?id=33065662_02&ident=0_accesorio48%2C448_0_1415909108906&n=1&s=accesorios.bisuteria&ts=1415909108906
>
> [u'\n\t\t\t\t\t\t\t\tHeel lace-up ankle boots\n\t\t\t\t\t\t\t']
>
>
> http://shop.mango.com/US/p0/women/accessories/heel-lace-up-ankle-boots/?id=33065662_02&ident=0_accesorio48%2C448_0_1415909108906&n=1&s=accesorios.bisuteria&ts=1415909108906
>
> [s] Available Scrapy objects:
>
> [s]   crawler    <scrapy.crawler.Crawler object at 0x10d351950>
>
> [s]   item       {}
>
> [s]   request    <GET
> http://shop.mango.com/US/p0/women/accessories/heel-lace-up-ankle-boots/?id=33065662_02&ident=0_accesorio48%2C448_0_1415909108906&n=1&s=accesorios.bisuteria&ts=1415909108906
> >
>
> [s]   response   <200
> http://shop.mango.com/US/p0/women/accessories/heel-lace-up-ankle-boots/?id=33065662_02&ident=0_accesorio48%2C448_0_1415909108906&n=1&s=accesorios.bisuteria&ts=1415909108906
> >
>
> [s]   settings   <scrapy.settings.Settings object at 0x10c79aa50>
>
> [s]   spider     <MangoTestSpider 'spider_mt' at 0x10d360250>
>
> [s] Useful shortcuts:
>
> [s]   shelp()           Shell help (print this help)
>
> [s]   view(response)    View response in a browser
>
>
> In [*1*]:
>
> Do you really want to exit ([y]/n)? y
>
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Scraped from <200
> http://shop.mango.com/US/p0/women/accessories/heel-lace-up-ankle-boots/?id=33065662_02&ident=0_accesorio48%2C448_0_1415909108906&n=1&s=accesorios.bisuteria&ts=1415909108906
> >
>
> {'name': [u'\n\t\t\t\t\t\t\t\tHeel lace-up ankle boots\n\t\t\t\t\t\t\t']}
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Redirecting (302) to <GET
> http://shop.mango.com/preHome.faces?state=she_400_US> from <GET
> http://shop.mango.com/index.faces?state=she_400_US>
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/desc/women/accessories/belts/> (referer:
> http://shop.mango.com/US/women/accessories/belts)
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Redirecting (302) to <GET
> http://shop.mango.com/index.faces?state=she_400_US> from <GET
> http://shop.mango.com/US/p0/women/accessories/wedge-suede-ankle-boots/?id=33083656_74&ident=0__0_1415909108906&n=1&s=accesorios&ts=1415909108906
> >
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/p0/women/accessories/crisscross-leather-sandals/?id=33080334_02&ident=0__0_1415909108906&n=1&s=accesorios&ts=1415909108906>
> (referer: http://shop.mango.com/US/women/accessories/jewellery)
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Redirecting (302) to <GET
> http://shop.mango.com/index.faces?state=she_400_US> from <GET
> http://shop.mango.com/US/m/women/accessories/shoes/curve/?m=zapatos42%2C442%2C342&v=Violeta
> >
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/p0/women/clothing/wedge-suede-ankle-boots/?id=33083656_74&ident=0__0_1415909108906&n=1&s=prendas.basicos&ts=1415909108906>
> (referer: http://shop.mango.com/US/women/accessories/shoes)
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/desc/women/clothing/essentials/> (referer:
> http://shop.mango.com/US/women/accessories/shoes)
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Crawled (200) <GET
> http://shop.mango.com/US/asc/women/clothing/essentials/> (referer:
> http://shop.mango.com/US/women/accessories/shoes)
>
> 2014-11-13 12:19:51-0800 [spider_mt] DEBUG: Now parsing
> http://shop.mango.com/US/p0/women/accessories/crisscross-leather-sandals/?id=33080334_02&ident=0__0_1415909108906&n=1&s=accesorios&ts=1415909108906
>
> [u'\n\t\t\t\t\t\t\t\tWedge suede ankle boots\n\t\t\t\t\t\t\t']
>
>
> http://shop.mango.com/US/p0/women/accessories/crisscross-leather-sandals/?id=33080334_02&ident=0__0_1415909108906&n=1&s=accesorios&ts=1415909108906
>
> [s] Available Scrapy objects:
>
> [s]   crawler    <scrapy.crawler.Crawler object at 0x10d351950>
>
> [s]   item       {}
>
> [s]   request    <GET
> http://shop.mango.com/US/p0/women/accessories/crisscross-leather-sandals/?id=33080334_02&ident=0__0_1415909108906&n=1&s=accesorios&ts=1415909108906
> >
>
> [s]   response   <200
> http://shop.mango.com/US/p0/women/accessories/crisscross-leather-sandals/?id=33080334_02&ident=0__0_1415909108906&n=1&s=accesorios&ts=1415909108906
> >
>
> [s]   settings   <scrapy.settings.Settings object at 0x10c79aa50>
>
> [s]   spider     <MangoTestSpider 'spider_mt' at 0x10d360250>
>
> [s] Useful shortcuts:
>
> [s]   shelp()           Shell help (print this help)
>
> [s]   view(response)    View response in a browser
>
>
> In [*1*]: view(response)
>
> Out[*1*]: True
>
>
> In [*2*]: response.xpath('//div[@itemprop="name"]/h1/text()').extract()
>
> Out[*2*]: [u'\n\t\t\t\t\t\t\t\tWedge suede ankle boots\n\t\t\t\t\t\t\t']
>
> --
> You received this message because you are subscribed to the Google Groups
> "scrapy-users" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to scrapy-users+unsubscr...@googlegroups.com.
> To post to this group, send email to scrapy-users@googlegroups.com.
> Visit this group at http://groups.google.com/group/scrapy-users.
> For more options, visit https://groups.google.com/d/optout.
>



-- 
Rocío Aramberri

-- 
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to scrapy-users+unsubscr...@googlegroups.com.
To post to this group, send email to scrapy-users@googlegroups.com.
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Reply via email to