On Mon, 18 Nov 2013, Daniel Gibby wrote:
Is the PDF conversion a part of a separate project like the MS word document conversion is?
Yup, very similar to the word stuff. Tika uses Apache PDFBox with custom Tika code to call it in the right way. Therefore, some fixes will be direct to Tika, while others will need upstream fixes in PDFBox
Nick
