[
https://issues.apache.org/jira/browse/NUTCH-1669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995059#comment-13995059
]
Julien Nioche commented on NUTCH-1669:
--------------------------------------
Hi Rafael
Looks like this issue went unnoticed. Could you please submit a patch for it?
See https://wiki.apache.org/nutch/HowToContribute
Thanks
Julien
> FTP crawl does not use FTP's server root folder
> -----------------------------------------------
>
> Key: NUTCH-1669
> URL: https://issues.apache.org/jira/browse/NUTCH-1669
> Project: Nutch
> Issue Type: Bug
> Components: protocol
> Affects Versions: 1.7
> Environment: Linux Ubuntu
> Reporter: Rafael Thomas Goz Coutinho
> Priority: Minor
> Labels: ftp, ftpclient, protocol
>
> Setup an FTP with root folder setup for a user (let's say test) pointing to
> /home/test/ftphome/
> And create a folder under it called target with a test.txt file:
> /home/test/ftphome/target/test.txt
> Configure a URL to crawl as with depth of 1:
> ftp://FTP_SERVER/target/
> It will fail to crawl because the FTP plugin protocol assumes the path is
> always absolute. It will look into /target/ and not /home/test/ftphome/target/
--
This message was sent by Atlassian JIRA
(v6.2#6252)