Special characters in HTML file are not parsed correctly ---------------------------------------------------------
Key: TIKA-208 URL: https://issues.apache.org/jira/browse/TIKA-208 Project: Tika Issue Type: Bug Components: parser Affects Versions: 0.3 Reporter: Siddharth Gargate Words containing ä, ö characters are not parsed correctly if present in HTML document. Please refer to discussion: http://markmail.org/message/jgwzbw63o67amqu3 -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.