[ 
https://issues.apache.org/jira/browse/TIKA-4554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-4554:
------------------------------
    Description: 
In 3.x, we had four different ways to fork a process to handle dangerous files 
(and what file is not dangerous?):

a) tika-batch

b) tika-server 

c) tika-pipes

d) fork parser

 

For 4.x, we should centralize/unify our forking through tika-pipes if possible. 

We've already removed tika-batch on TIKA-4333. We removed the full server 
forking as part of the enormous refactoring on TIKA-4545.

Let's remove the ForkParser from 4.x. I propose that if anyone needs it, we can 
write a light wrapper around tika-pipes that would take an InputStream, write 
it to a temp file and then run tika-pipes against the file.

 

Any objections?

  was:
In 3.x, we had four different ways to fork a process to handle dangerous files:

a) tika-batch

b) tika-server 

c) tika-pipes

d) fork parser

 

For 4.x, we should centralize/unify our forking through tika-pipes if possible. 

We've already removed tika-batch on TIKA-4333. We removed the full server 
forking as part of the enormous refactoring on TIKA-4545.

Let's remove the ForkParser from 4.x. I propose that if anyone needs it, we can 
write a light wrapper around tika-pipes that would take an InputStream, write 
it to a temp file and then run tika-pipes against the file.

 

Any objections?


> Remove the ForkParser in 4.x
> ----------------------------
>
>                 Key: TIKA-4554
>                 URL: https://issues.apache.org/jira/browse/TIKA-4554
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>
> In 3.x, we had four different ways to fork a process to handle dangerous 
> files (and what file is not dangerous?):
> a) tika-batch
> b) tika-server 
> c) tika-pipes
> d) fork parser
>  
> For 4.x, we should centralize/unify our forking through tika-pipes if 
> possible. 
> We've already removed tika-batch on TIKA-4333. We removed the full server 
> forking as part of the enormous refactoring on TIKA-4545.
> Let's remove the ForkParser from 4.x. I propose that if anyone needs it, we 
> can write a light wrapper around tika-pipes that would take an InputStream, 
> write it to a temp file and then run tika-pipes against the file.
>  
> Any objections?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to