[
https://issues.apache.org/jira/browse/NUTCH-1950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338029#comment-14338029
]
ASF GitHub Bot commented on NUTCH-1950:
---------------------------------------
GitHub user xzjh opened a pull request:
https://github.com/apache/nutch/pull/9
fix for NUTCH-1950 contributed by xzjh
It is the fix for this issue:
https://issues.apache.org/jira/browse/NUTCH-1950
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/xzjh/nutch NUTCH-1950
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nutch/pull/9.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #9
----
commit f77873d621a1c8bae364b08695c6cf8aa25be0e8
Author: xzjh <[email protected]>
Date: 2015-02-26T06:36:01Z
fix for NUTCH-1950 contributed by xzjh
----
> File name too long when bin/nutch dump
> --------------------------------------
>
> Key: NUTCH-1950
> URL: https://issues.apache.org/jira/browse/NUTCH-1950
> Project: Nutch
> Issue Type: Bug
> Components: segment
> Affects Versions: 1.10
> Reporter: Chong Li
> Priority: Minor
> Fix For: 1.10
>
> Original Estimate: 48h
> Remaining Estimate: 48h
>
> When bin/dump in version 1.10-trunk, there will be an exception saying "File
> name too long". When crawling, the length of the url may be longer than 255
> bytes and nutch save the file using the url as file name. It can be saved in
> segments but when dumping the files to local file system, the length of the
> filename can not be longer than 255 bytes.
> The FileDumper.java need to be changed to handle such exception.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)