Tim Allison created TIKA-1754:
---------------------------------
Summary: tika-batch's FileListCrawler truncates the first
character of the fileList if the root is e.g. X:
Key: TIKA-1754
URL: https://issues.apache.org/jira/browse/TIKA-1754
Project: Tika
Issue Type: Bug
Components: batch
Affects Versions: 1.10
Reporter: Tim Allison
Priority: Trivial
The FileListCrawler takes a root directory and a list of relative file paths
and "crawls" that list as if it were a directory crawler. If the root is
specified as, e.g. "X:" on a Windows system, the call to substring on root's
absolute path and the subtraction of one character is incorrect.
With a root of X: and a relative file of "dir1/dir2/file.doc", the output file
is: "X:/ir/dir2/file.doc.txt"
Let's get rid of the substring calculations and move to Java 7! :)
See TIKA-1747.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)