[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176548#comment-13176548
]
Uwe Schindler commented on SOLR-2346:
-
Nice fix, is in-line with the other charset
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13176446#comment-13176446
]
Koji Sekiguchi commented on SOLR-2346:
--
bq. I can index the file correctly by applying
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13174637#comment-13174637
]
Shinichiro Abe commented on SOLR-2346:
--
I've faced the same problem. Tika parsed my
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13004339#comment-13004339
]
Koji Sekiguchi commented on SOLR-2346:
--
I've faced the same problem. I'm trying to
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13004346#comment-13004346
]
Koji Sekiguchi commented on SOLR-2346:
--
By looking at Tika, HtmlParser and TXTParser
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990060#comment-12990060
]
Robert Muir commented on SOLR-2346:
---
{noformat}
String id = (String)
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990172#comment-12990172
]
Yonik Seeley commented on SOLR-2346:
From the email thread:
{quote}
One problem is that
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990185#comment-12990185
]
Robert Muir commented on SOLR-2346:
---
Right, I agree solr should work with a non-UTF8
[
https://issues.apache.org/jira/browse/SOLR-2346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12990444#comment-12990444
]
Prasad Deshpande commented on SOLR-2346:
I agree, I was just trying to decode the