susserj created TIKA-2105:
-----------------------------
Summary: Unable to process documents with french accents in
filenames
Key: TIKA-2105
URL: https://issues.apache.org/jira/browse/TIKA-2105
Project: Tika
Issue Type: Bug
Components: batch
Affects Versions: 1.13
Environment: Windows 7, Java version 1.7.0.111
Reporter: susserj
When I execute the following batch test1.bat script from my command prompt, I
get this error message:
test1.bat
@echo off
"C:\Program Files (x86)\Java\jre7\bin\java" -jar c:\temp\tika-app-1.13.jar -m
"S:\2008-09\2009-10\IC IT Environment 2009\français.docx"
Error:
Exception in thread "main" java.net.MalformedURLException: unknown protocol: s
at java.net.URL.<init>(Unknown Source)
at java.net.URL.<init>(Unknown Source)
at java.net.URL.<init>(Unknown Source)
at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:472)
at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:145)
When the filenames don't have special French characters, it works fine. (I
cannot change the names of all the files that need to be processed).
I apologise, my experience with java and TIKA is very limited.
Thanks
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)