Hi Kevin, The output seems correct to me. I would guess that meta-data field, in this case, is just very long. But, I'm not familiar with the he5 format. So, there is a chance that field shouldn't be in the metadata at all, but that is unlikely.
You can check out http://wiki.apache.org/tika/TikaJAXRS for some information on tika-server. Tyler On Fri, Jun 27, 2014 at 7:55 AM, kevin slote <[email protected]> wrote: > Hello everyone. I have a question about the expected output for tika. I > am working on integrating my python application with tika-server. One of > the test files for unit test produces this for the metadata. The test file > is test.he5, > and the way I call tika is just to send this file to > http://localhost:9998/meta while tika-serve-1.5 is running. > > Should I expect csv formatted data that occasionally has long strings of > text with many line breaks? >
