[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-09 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13734469#comment-13734469 ] Christoph Straßer commented on SOLR-4679: - @Uwe: Big thanks for taking care of this

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-09 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13734759#comment-13734759 ] ASF subversion and git services commented on SOLR-4679: --- Commit

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-09 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13734763#comment-13734763 ] ASF subversion and git services commented on SOLR-4679: --- Commit

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733328#comment-13733328 ] Uwe Schindler commented on SOLR-4679: - There is another occurence of this bug with PDF

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1377#comment-1377 ] Uwe Schindler commented on SOLR-4679: - The stuff with ignorableWhitespace was discussed

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-08 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733656#comment-13733656 ] Hoss Man commented on SOLR-4679: Uwe: I defer to your judgement on this. if you think the

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733776#comment-13733776 ] Uwe Schindler commented on SOLR-4679: - Hoss: I just took this issue because it was

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-08 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733791#comment-13733791 ] Hoss Man commented on SOLR-4679: bq. Because you are still not convinced with my

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733901#comment-13733901 ] Uwe Schindler commented on SOLR-4679: - bq. I never said that ... You somehow said:

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-04-09 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626309#comment-13626309 ] Christoph Straßer commented on SOLR-4679: - Thank you for checking Tika. As far as

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-04-09 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13626775#comment-13626775 ] Hoss Man commented on SOLR-4679: Right ... i wonder if somewhere in the flow of SAX events

[jira] [Commented] (SOLR-4679) HTML line breaks (br) are removed during indexing; causes wrong search results

2013-04-08 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13625691#comment-13625691 ] Hoss Man commented on SOLR-4679: FYI, i've confirmed this isn't a general problem with Tika