[
https://issues.apache.org/jira/browse/NUTCH-381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12482266
]
Andrzej Bialecki commented on NUTCH-381:
-
Your last comment confirms my suspicions. After analysis of the
[
http://issues.apache.org/jira/browse/NUTCH-381?page=comments#action_12440351 ]
Andrzej Bialecki commented on NUTCH-381:
-
It would be good to investigate where this problem occurs - is this somewhere
in the redirects? You should have
[
http://issues.apache.org/jira/browse/NUTCH-381?page=comments#action_12440453 ]
Uros Gruber commented on NUTCH-381:
---
I try to found what happened through the logs but because threads I didn't
found any connection. I also try with linksdb.
[
http://issues.apache.org/jira/browse/NUTCH-381?page=comments#action_12440304 ]
nutch.newbie commented on NUTCH-381:
Yes I can confirm this. I have a list of 5000+ urls and it didn't work. I went
back to regex include/exclude method.