[ 
https://issues.apache.org/jira/browse/TIKA-3352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-3352.
-------------------------------
    Fix Version/s: 1.27
       Resolution: Fixed

> Add a handler for json output from the /tika endpoint
> -----------------------------------------------------
>
>                 Key: TIKA-3352
>                 URL: https://issues.apache.org/jira/browse/TIKA-3352
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>             Fix For: 1.27
>
>
> I've been focusing mostly on the {{/rmeta}} endpoint.  However, for many 
> users who aren't enthusiasts of the wild and crazy things that can happen 
> with embedded files (e.g., the rest of the world), it would be useful to have 
> some of the advantages of the /rmeta endpoint without the complexity.
> This would allow text + metadata in the response (for those who don't want to 
> parse the xhtml).  It would include "late metadata", that is metadata that is 
> only added after the content extraction has begun, which does not appear in 
> our usual xhtml output.  This would enable storing the stacktrace (if the 
> -s/--stackTrace commandline option is selected) in a field (as is done in 
> /rmeta) so that users would get what they could from a failed parse and be 
> able to align parse exceptions with the detected mime type.
> Unlike /rmeta, this proposal would not include stacktraces from embedded 
> files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to