[ 
https://issues.apache.org/jira/browse/TIKA-4550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18042796#comment-18042796
 ] 

Tilman Hausherr edited comment on TIKA-4550 at 12/4/25 2:19 PM:
----------------------------------------------------------------

0 + conversion tool. Less code is always better.


was (Author: tilman):
0 + conversion tool.

> Determine migration path for xml->json configuration in 4.x
> -----------------------------------------------------------
>
>                 Key: TIKA-4550
>                 URL: https://issues.apache.org/jira/browse/TIKA-4550
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>
> In main, we now have the json configuration operative from 
> tika-parsers-standard-package through the rest of the build. 
> We still have remnants of TikaConfig in tika-app, but tika-server is json 
> only.
> What do we want the user experience to be when migrating from 3.x to 4.x?
> I think most of the conversions should be fairly straightforward – claude was 
> able to do a pretty good job, for example. I worry a bit about the parsers 
> configurations.
> Mechanically at a high level:
> 0) Major version cutover/transition. Rely on documentation. Remove all of 
> TikaConfig from main now.
> 1) Same as option 0, but try to backport the json deserialization into 3.x so 
> that users can get ready.
> 2) Keep TikaConfig in main, and make it optional in tika-app and tika-server 
> for 4.x
> 3) Something else?
> My preference is for option 0. I worry about the amount of code and effort it 
> would take to get 1 and 2 right.
> WDYT?
>  
> Things that would help with the transition:
> 0) documentation, obviously.
> 1) Perhaps some code that would convert at least the parsers section for the 
> tika-parsers-standard parsers from xml to json.
> 2) other things?
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to