All,

  Chris Mattmann spent some time over the break updating tika-python
and closing out a _bunch_ of open issues on the tika-python repo
(https://github.com/chrismattmann/tika-python).  The key updates (from
my perspective):

1) Updated to the 2.x release branch, specifically 2.6.0:
https://github.com/chrismattmann/tika-python/releases/tag/2.6.0

2) Allowed "raw" /rmeta output.  The legacy behavior for tika-python
was to append fields for embedded files into a single metadata object,
which meant, for example, that users couldn't figure out which
embedded file a given "title" belonged to
(https://github.com/chrismattmann/tika-python/issues/375).

  Many thanks, Chris!

            Cheers,

                     Tim

Reply via email to