Grigoriy Alekseev created TIKA-2701: ---------------------------------------
Summary: Text is not extracted properly from WMF files Key: TIKA-2701 URL: https://issues.apache.org/jira/browse/TIKA-2701 Project: Tika Issue Type: Bug Components: parser Affects Versions: 1.15 Reporter: Grigoriy Alekseev Fix For: 2.0.0 Attachments: thumbnail_1.wmf Text is always extracted assuming it is in cp-1252 encoding. The attached thumbnail_1.wmf has text in Shift JIS and is extracted incorrectly. Should be 普林斯. -- This message was sent by Atlassian JIRA (v7.6.3#76005)