Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "Troubleshooting Tika" page has been changed by TimothyAllison: https://wiki.apache.org/tika/Troubleshooting%20Tika?action=diff&rev1=11&rev2=12 If that shows the same problem, it's a PDFBox bug. Please [[http://pdfbox.apache.org/support.html|file an Apache PDFBox bug report]] and attach at least one failing file to the bug. When that gets fixed, Tika will pick up the new release and will get the fix - If PDFBox !ExtractText works fine, it's likely a Tika bug. Please [[http://tika.apache.org/contribute.html|report an Apache Tika bug]], attach at least one failing file, and mention that PDFBox !ExtractText doesn't have the issue. + If PDFBox !ExtractText works fine, it may* be a Tika bug. Please [[http://tika.apache.org/contribute.html|report an Apache Tika bug]], attach at least one failing file, and mention that PDFBox !ExtractText doesn't have the issue. + *PDFBox's ExtractText does not pull text from Annotations or Acroforms, so it is possible that a problem not encountered by PDFBox's ExtractText reveals a bug in Annotations or Acroforms; might be a bug in Tika, too. When in doubt, ask. +
