[jira] Resolved: (NUTCH-824) Crawling - File Error 404 when fetching file with an hexadecimal character in the file name.

2010-05-27 Thread Michela Becchi (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michela Becchi resolved NUTCH-824. -- Fix Version/s: 1.0.0 Resolution: Fixed Hi, I fixed (or, at least, circumvented

Re: Crawling - File Error 404 when fetching file with an hexadecimal character in the file name.

2010-05-27 Thread Michela Becchi
Hi, I circumvented this problem by modifying the org.apache.nutch.protocol.file.FileResponse class belonging to the protocol-file plugin. In particular, at line 120, I added String path = .equals(url.getPath()) ? / : url.getPath(); +String decoded_path = path; +try { +

[jira] Created: (NUTCH-824) Crawling - File Error 404 when fetching file with an hexadecimal character in the file name.

2010-05-20 Thread Michela Becchi (JIRA)
Project: Nutch Issue Type: Bug Components: fetcher Affects Versions: 1.0.0 Environment: Linux nube 2.6.31-20-server #58-Ubuntu SMP x86_64 GNU/Linux Reporter: Michela Becchi Priority: Blocker Hello, I am performing a local file system

Re: Crawling - File Error 404 when fetching file with an hexadecimal character in the file name.

2010-05-20 Thread Michela Becchi
Hi Julien, Thanks a lot. I tried the same test you indicated (bin/nutch plugin protocol-file org.apache.nutch.protocol.file ...) and got again an Error 404. Of course, I don't get this error if, when issuing the command, I replace the hexadecimal representation (e.g., %28 with (). I opened an