Vinay created CONNECTORS-1494:
---------------------------------

             Summary: Error crawling file system with file names having special 
characters.
                 Key: CONNECTORS-1494
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1494
             Project: ManifoldCF
          Issue Type: Bug
          Components: File system connector
    Affects Versions: ManifoldCF 2.9.1
            Reporter: Vinay


I am crawling a file system mounted on linux machine. So the Repository 
Connection is of type "File System". For some files which has some special 
characters, Manifold Cf is not picking such files.

File ex: 2GHz_XY-SCDMA_ABC_Uuޓࠚϯmӣܼ˵Ҫȳ_֚3ҿؖúشԃԫхրҠë.pdf

exception: java.lang.NumberFormatException: For input string: ""
    at 
java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) 
~[?:1.8.0_151]
    at java.lang.Long.parseLong(Long.java:601) ~[?:1.8.0_151]
    at java.lang.Long.<init>(Long.java:965) ~[?:1.8.0_151]
    at 
org.apache.manifoldcf.agents.transformation.documentfilter.DocumentFilter$SpecPacker.<init>(DocumentFilter.java:513)
 ~[?:?]
    at 
org.apache.manifoldcf.agents.transformation.documentfilter.DocumentFilter.getPipelineDescription(DocumentFilter.java:76)
 ~[?:?]
    at 
org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.getTransformationDescription(IncrementalIngester.java:503)
 ~[mcf-agents.jar:?]
    at 
org.apache.manifoldcf.crawler.system.PipelineSpecification.<init>(PipelineSpecification.java:47)
 ~[mcf-pull-agent.jar:?]
    at 
org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:308) 
[mcf-pull-agent.jar:?]
FATAL 2018-02-07T23:47:15,927 (Worker thread '2') - Error tossed: For input 
string: ""



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to