Tim Allison created TIKA-1754:
---------------------------------

             Summary: tika-batch's FileListCrawler truncates the first 
character of the fileList if the root is e.g. X:
                 Key: TIKA-1754
                 URL: https://issues.apache.org/jira/browse/TIKA-1754
             Project: Tika
          Issue Type: Bug
          Components: batch
    Affects Versions: 1.10
            Reporter: Tim Allison
            Priority: Trivial


The FileListCrawler takes a root directory and a list of relative file paths 
and "crawls" that list as if it were a directory crawler.  If the root is 
specified as, e.g. "X:" on a Windows system, the call to substring on root's 
absolute path and the subtraction of one character is incorrect.

With a root of X: and a relative file of "dir1/dir2/file.doc", the output file 
is: "X:/ir/dir2/file.doc.txt"

Let's get rid of the substring calculations and move to Java 7! :)

See TIKA-1747.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to