Special characters in HTML file are not parsed correctly 
---------------------------------------------------------

                 Key: TIKA-208
                 URL: https://issues.apache.org/jira/browse/TIKA-208
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 0.3
            Reporter: Siddharth Gargate


Words containing ä, ö characters are not parsed correctly if present in HTML 
document.
Please refer to discussion:
http://markmail.org/message/jgwzbw63o67amqu3

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to