[jira] [Commented] (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-12-28 Thread Uwe Schindler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176548#comment-13176548 ] Uwe Schindler commented on SOLR-2346: - Nice fix, is in-line with the other charset

[jira] [Commented] (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-12-27 Thread Koji Sekiguchi (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176446#comment-13176446 ] Koji Sekiguchi commented on SOLR-2346: -- bq. I can index the file correctly by applying

[jira] [Commented] (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-12-21 Thread Shinichiro Abe (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13174637#comment-13174637 ] Shinichiro Abe commented on SOLR-2346: -- I've faced the same problem. Tika parsed my

[jira] Commented: (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-03-08 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13004339#comment-13004339 ] Koji Sekiguchi commented on SOLR-2346: -- I've faced the same problem. I'm trying to

[jira] Commented: (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-03-08 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13004346#comment-13004346 ] Koji Sekiguchi commented on SOLR-2346: -- By looking at Tika, HtmlParser and TXTParser

[jira] Commented: (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-02-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990060#comment-12990060 ] Robert Muir commented on SOLR-2346: --- {noformat} String id = (String)

[jira] Commented: (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-02-03 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990172#comment-12990172 ] Yonik Seeley commented on SOLR-2346: From the email thread: {quote} One problem is that

[jira] Commented: (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-02-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990185#comment-12990185 ] Robert Muir commented on SOLR-2346: --- Right, I agree solr should work with a non-UTF8

[jira] Commented: (SOLR-2346) Non UTF-8 Text files having other than english texts(Japanese/Hebrew) are no getting indexed correctly.

2011-02-03 Thread Prasad Deshpande (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990444#comment-12990444 ] Prasad Deshpande commented on SOLR-2346: I agree, I was just trying to decode the