[ https://issues.apache.org/jira/browse/TIKA-3123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17142921#comment-17142921 ]
Kenneth William Krugler commented on TIKA-3123: ----------------------------------------------- This looks like a character encoding issue. Can you provide the complete HTML that you sent to tika-server? > request to parse Chinese, but return Russian > -------------------------------------------- > > Key: TIKA-3123 > URL: https://issues.apache.org/jira/browse/TIKA-3123 > Project: Tika > Issue Type: Bug > Affects Versions: 1.24.1 > Reporter: 阿里木 > Priority: Trivial > > Try to parse html text containing Chinese: > {code:java} > <span>被</span>{code} > tika-server return Russian: > {code:java} > <p>швл</p> > {code} > -- This message was sent by Atlassian Jira (v8.3.4#803005)