german umlaute are not recognized
---------------------------------
Key: PDFBOX-861
URL: https://issues.apache.org/jira/browse/PDFBOX-861
Project: PDFBox
Issue Type: Bug
Components: Text extraction
Environment: tika-0.8
Reporter: Reinhard Schwab
german umlaute are not recognized in this document
http://www.computing.dcu.ie/~irehbein/SS08/uebung1/stts-guide.pdf
Guidelines f
ur das Tagging deutscher Textcorpora
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.