[ https://issues.apache.org/jira/browse/NUTCH-661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Doğacan Güney closed NUTCH-661. ------------------------------- Resolution: Won't Fix Fix Version/s: 1.0.0 Assignee: Doğacan Güney Closing this issue as Won't Fix. This can be fixed with a urlnormalizer plugin as suggested in comments. > errors when the uri contains space characters > ---------------------------------------------- > > Key: NUTCH-661 > URL: https://issues.apache.org/jira/browse/NUTCH-661 > Project: Nutch > Issue Type: Improvement > Components: fetcher > Affects Versions: 0.9.0 > Environment: RedHat 5.1 > Reporter: Christos LAIOS > Assignee: Doğacan Güney > Fix For: 1.0.0 > > > While spidering our intranet, i get the following errors when the uri > contains space characters > fetch of http://intranet-rtd.rtd.cec.eu.int/services/docs/AAR_2007 - > FINAL.doc failed with: java.lang.IllegalArgumentException: Invalid uri > 'http://intranet-rtd.rtd.cec.eu.int/services/docs/AAR_2007 - FINAL.doc': > escaped absolute path not valid -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.