[
https://issues.apache.org/jira/browse/NUTCH-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2331:
-----------------------------------
Affects Version/s: 1.15
> REST API Fetch fails to retrieve HDFS path on distributed mode
> --------------------------------------------------------------
>
> Key: NUTCH-2331
> URL: https://issues.apache.org/jira/browse/NUTCH-2331
> Project: Nutch
> Issue Type: Bug
> Components: fetcher, REST_api
> Affects Versions: 1.15
> Reporter: Sujen Shah
> Assignee: Sujen Shah
> Priority: Major
>
> Currently in the REST API, if the user does not specify the absolute path of
> the segment to fetch and only the crawlId, then the fetcher would find the
> latest segment generated and use that.
> But as of now, the above functionality will only work in local mode as per
> https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/fetcher/Fetcher.java#L562-L573.
> Need to update these lines to enable fetcher to read the directory and list
> files from an hdfs system.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)