Thanks for the help!
Do you happen to know why this is failing?

/cygdrive/c/Users/me/Downloads/nutch/apache-nutch-1.4-bin/apache-nutch-1.4-bin/runtime/local
$ bin/nutch readseg -dump crawl/segments/20120325130007
http://www.imdb.com/title/tt1231460/fullcredits
cygpath: can't convert empty path
SegmentReader: dump segment: crawl/segments/20120325130007
Exception in thread "main" java.lang.IllegalArgumentException: Wrong FS:
http://www.imdb.com/title/tt1231460/fullcredits/dump, expected: file:///
        at org.apache.hadoop.fs.FileSystem.checkPath(FileSystem.java:310)
        at
org.apache.hadoop.fs.RawLocalFileSystem.pathToFile(RawLocalFileSystem.java:47)
        at
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:357)
        at
org.apache.hadoop.fs.ChecksumFileSystem.delete(ChecksumFileSystem.java:430)
        at
org.apache.nutch.segment.SegmentReader.dump(SegmentReader.java:231)
        at
org.apache.nutch.segment.SegmentReader.main(SegmentReader.java:564)


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Out-of-the-box-Nutch-indexing-url-source-to-Solr-tp3855918p3864324.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to