[ 
https://issues.apache.org/jira/browse/TIKA-4472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-4472.
-------------------------------
    Fix Version/s: 4.0.0
                   3.3.0
       Resolution: Fixed

Apologies for the two PRs. The second addressed TikaGUI. We had been doing 
different things in the commandline vs the gui with regard to default 
configuration. 

 

These PRs align their behavior and point to a config file that users can now 
use as a baseline to modify their parser settings.

> Extract macros by default in tika-app's cli when run against a single file
> --------------------------------------------------------------------------
>
>                 Key: TIKA-4472
>                 URL: https://issues.apache.org/jira/browse/TIKA-4472
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 4.0.0, 3.3.0
>
>
> In tika-app's cli, when run against a single file, we're turning on some 
> non-default configurations to make things easier for users getting started. 
> For example, we're extracting inline images and incremental updates.
> I think we should also add "extract macros" for PDFs and msoffice files by 
> default.
> This won't affect configurations when running in batch/async mode.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to