[
https://issues.apache.org/jira/browse/TIKA-4517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18030077#comment-18030077
]
Tilman Hausherr edited comment on TIKA-4517 at 10/15/25 12:41 PM:
------------------------------------------------------------------
Add this in TikaAsyncCLI:
{code:java}
if (!new File(inString).exists() && inString.length() > 2 &&
inString.startsWith("\"") && inString.endsWith("\"")) {
inString = inString.substring(1, inString.length() - 1);
}
if (!new File(outString).exists() && outString.length() > 2 &&
outString.startsWith("\"") && outString.endsWith("\"")) {
outString = outString.substring(1, outString.length() - 1);
}
{code}
and replace this at the end of TikaCLITest.getFilenames():
{code:java}
return (Set<String>) names.stream().map(s -> s.replace('\\',
'/')).collect(Collectors.toSet());
{code}
was (Author: tilman):
Add this in TikaAsyncCLI:
{code:java}
if (!new File(inString).exists() && inString.length() > 2 &&
inString.startsWith("\"") && inString.endsWith("\"")) {
inString = inString.substring(1, inString.length() - 1);
}
if (!new File(outString).exists() && outString.length() > 2 &&
outString.startsWith("\"") && outString.endsWith("\"")) {
outString = outString.substring(1, outString.length() - 1);
}
{code}
and replace this at the end of TikaCLITest.getFilenames():
{code:java}
return (Set<String>) names.stream().map(s -> s.replace('\\',
'/')).collect(Collectors.toSet());
{code}
> Improve async cli
> -----------------
>
> Key: TIKA-4517
> URL: https://issues.apache.org/jira/browse/TIKA-4517
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Trivial
> Fix For: 4.0.0
>
>
> Improve documentation and handling of file names as non-options.
> Add xml vs text for content extraction.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)