All,
As always, apologies for the cluelessness the following reveals... I'm
starting to move from embedded Tika to a server option for greater robustness.
Is the jax-rs server intended not to handle embedded files recursively? If so,
how are users currently handling multiply embedded documents with the jax-rs
server? Would it be worthwhile to add another service that uses
AutoDetectParser as the embedded parser/extractor instead of
MyEmbeddedDocumentExtractor?
Best,
Tim
Timothy B. Allison, Ph.D.
Lead Artificial Intelligence Engineer
Group Lead
K83A/Human Language Technology
The MITRE Corporation
7515 Colshire Drive, McLean, VA 22102
703-983-2473 (phone); 703-983-1379 (fax)