Hi,
I observed that nutch seems to have problems with filenames in a local filesystem crawl.
One file of mine contained an exclamation mark (!) and was not processed by the nutch crawl.
After I removed it nutch was able to process it.
May be there are further characters?
Is this worth an issue in JIRA?
regards
Boris
