[
https://issues.apache.org/jira/browse/NUTCH-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14351387#comment-14351387
]
Chris A. Mattmann commented on NUTCH-1954:
------------------------------------------
Here's the error I got running it on the NSF ACADIS/Polar data set from my
class:
{noformat}
[mattmann@nsfpolardata local]$ ./bin/nutch commoncrawldump -outputDir out
-segment /home/mattmann/polar-data/apache-nutch-1.9/bin/AcadisCrawl2/segments/
java.io.FileNotFoundException:
out/redirect.html?link=http%3a%2f%2fdataportal.ucar.edu%2fmetadata%2fcadis%2fTerrestrial_Ecosystems%2fArctic_Ecosystem_Changes%2fBarrow_Atqasuk_ITEX_Detailed_Microclimate%2f1998-20XX%2520Barrow%2520Atqasuk%2520ITEX%2520Detailed%2520Microclimate%2520metadata.doc
(File name too long)
at java.io.FileOutputStream.open(Native Method)
at java.io.FileOutputStream.<init>(FileOutputStream.java:221)
at java.io.FileOutputStream.<init>(FileOutputStream.java:171)
at
org.apache.nutch.tools.CommonCrawlDataDumper.dump(CommonCrawlDataDumper.java:372)
at
org.apache.nutch.tools.CommonCrawlDataDumper.main(CommonCrawlDataDumper.java:235)
{noformat}
> FilenameTooLong error appears in CommonCrawlDumper
> --------------------------------------------------
>
> Key: NUTCH-1954
> URL: https://issues.apache.org/jira/browse/NUTCH-1954
> Project: Nutch
> Issue Type: Bug
> Components: commoncrawl
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
> Fix For: 1.10
>
>
> The issue from NUTCH-1950 is appearing in the CommonCrawlDumper tool as well
> (FilenameTooLong). I'm going to apply that fix here as well (based on
> MD5/message digest).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)