Hello,

I noticed that nutch cannot retrieve title and inlinks of one of the domains in 
the seed list. However, if I run identical code from the server where this 
domain is hosted then it correctly parses it. The surprising thing is that in 
both cases this urls has

status: 2 (status_fetched)
parseStatus:    success/ok (1/0), args=[]


I used nutch-2.1 with hbase-0.92.1 and nutch 1.4.


Any ideas why this happens?

Thanks.

Alex. 

Reply via email to