[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287791#comment-14287791 ] Hoss Man commented on TIKA-1526: bq. I think we should catch the posix_spawn exception in

[jira] [Created] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-21 Thread Hoss Man (JIRA)
Hoss Man created TIKA-1526: -- Summary: ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers Key: TIKA-1526 URL:

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733649#comment-13733649 ] Hoss Man commented on TIKA-1134: bq. keep this open to make javadocs inside all those

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733819#comment-13733819 ] Hoss Man commented on TIKA-1134: The crux of my initial confusion and continuted concern

[jira] [Created] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
Hoss Man created TIKA-1134: -- Summary: ContentHandler gets ignorable whitespace for br tags when parsing HTML Key: TIKA-1134 URL: https://issues.apache.org/jira/browse/TIKA-1134 Project: Tika Issue

[jira] [Updated] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated TIKA-1134: --- Attachment: TIKA-1134.patch FWIW: Changing the XHTMLContentHandler.newline() function to delegate to

[jira] [Updated] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated TIKA-1134: --- Attachment: (was: SOLR-4679__weird_TIKA-1134.patch) ContentHandler gets ignorable whitespace for br

[jira] [Comment Edited] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-06-11 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13680544#comment-13680544 ] Hoss Man edited comment on TIKA-1134 at 6/11/13 7:00 PM: - -patch