[ 
https://issues.apache.org/jira/browse/TIKA-4545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18042325#comment-18042325
 ] 

ASF GitHub Bot commented on TIKA-4545:
--------------------------------------

tballison commented on PR #2418:
URL: https://github.com/apache/tika/pull/2418#issuecomment-3604301680

   Still working on this one. Once it is ready, this should be merged after 
#2417.
   
   There's still quite a bit to be done here.
   
   This integrates the json configs across the parsers, detectors, etc and the 
pipes/plugin components.




> Fully integrate new json based deserializer in 4.x
> --------------------------------------------------
>
>                 Key: TIKA-4545
>                 URL: https://issues.apache.org/jira/browse/TIKA-4545
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>
> Follow on for TIKA-4544.
> Steps:
>  * Add annotations to components (parsers, etc.) and unit tests to confirm 
> they work (finished this today)
>  * Modify components (parsers etc), at least a few of them so that they are 
> actually configurable. We don't have to modify all, just the most important 
> ones PDFParser, tesseract, MSOffice, and others???
>  * Move to tika-config.json in tika-pipes client/server, tika-async-cli, 
> tika-app and tika-server one by one



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to