Thanks! I checked version 1.27 and it does what is expected. However, the extra handling of the JSON will incur some processing overhead - not strictly necessary for my use case I think. Also, the content in X-TIKA:content is html and I would need plain text. What would be ideal would be an option to /tika (text|body) to essentially do what /remeta provides and concatenate in the output the metadata and the data. Something like `curl -H "Accept: text/plain" -H "X-Tika-meta: recursive" http://localhost:9998/tika` ? What do you think, does it make sense?
Thanks, Cristi On Wed, May 5, 2021 at 9:29 PM Tim Allison <[email protected]> wrote: > All, > I recently added a feature matrix page to our wiki for some of the > content +/- metadata endpoints in tika-server: > > https://cwiki.apache.org/confluence/display/TIKA/TikaServerEndpointsCompared > . > Please take a look and let me know what you think. > > Cheers, > > Tim > > On Wed, May 5, 2021 at 2:15 PM Tim Allison <[email protected]> wrote: > > > > Here’s a recent build if you want to check it out: > > > https://ci-builds.apache.org/job/Tika/job/tika-branch1x-jdk8/128/org.apache.tika$tika-server/artifact/org.apache.tika/tika-server/1.27-20210505.171622-28/tika-server-1.27-20210505.171622-28.jar > > > > On Wed, May 5, 2021 at 8:05 AM Tim Allison <[email protected]> wrote: > >> > >> My guess would be a month(ish)? Depends on what the community > decides... > >> > >> On Wed, May 5, 2021 at 5:59 AM Cristian Zamfir <[email protected]> > wrote: > >> > > >> > Great. When is 1.27 likely to be released? > >> > > >> > Thanks! > >> > Cristi > >> > > >> > On Wed, May 5, 2021 at 11:32 AM Tim Allison <[email protected]> > wrote: > >> >> > >> >> In 1.27, there’s an accept:application/json option for the /tika > endpoint that will do this. If you can build locally or grab a build from > Jenkins, please give it a try before the 1.27 release. > >> >> > >> >> > >> >> See also /rmeta. > >> >> > >> >> On Wed, May 5, 2021 at 5:20 AM Cristian Zamfir < > [email protected]> wrote: > >> >>> > >> >>> Hi! > >> >>> > >> >>> Is there an option to tika-server to concatenate the metadata and > the content in the same call to localhost:9998/tika, in order to avoid a > separate upload of the file just to get the metadata? > >> >>> > >> >>> Thanks! > >> >>> Cristi >
