All, Chris Mattmann spent some time over the break updating tika-python and closing out a _bunch_ of open issues on the tika-python repo (https://github.com/chrismattmann/tika-python). The key updates (from my perspective):
1) Updated to the 2.x release branch, specifically 2.6.0: https://github.com/chrismattmann/tika-python/releases/tag/2.6.0 2) Allowed "raw" /rmeta output. The legacy behavior for tika-python was to append fields for embedded files into a single metadata object, which meant, for example, that users couldn't figure out which embedded file a given "title" belonged to (https://github.com/chrismattmann/tika-python/issues/375). Many thanks, Chris! Cheers, Tim
