[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733344#comment-13733344 ] Uwe Schindler commented on TIKA-1134: - Hi Hoss, the rule in TIKA is: - TIKA inserts

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733348#comment-13733348 ] Uwe Schindler commented on TIKA-1134: - I think this issue is Won't fix. The issues

[jira] [Commented] (TIKA-1157) Mp3 file won't convert = 100% CPU

2013-08-08 Thread Damien Dykman (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733645#comment-13733645 ] Damien Dykman commented on TIKA-1157: - Issue is resolved with nightly build (revision

[jira] [Closed] (TIKA-1157) Mp3 file won't convert = 100% CPU

2013-08-08 Thread Damien Dykman (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Damien Dykman closed TIKA-1157. --- Resolution: Fixed Fix Version/s: 1.5 Fixed sometime prior to nightly build from 2013/08/08

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733649#comment-13733649 ] Hoss Man commented on TIKA-1134: bq. keep this open to make javadocs inside all those

[jira] [Closed] (TIKA-1124) Nested documents not extracted if a PDF file is in the chain

2013-08-08 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison closed TIKA-1124. - Resolution: Fixed Fix Version/s: 1.5 Added tests (thanks to Nick's advice to use model of

[jira] [Commented] (TIKA-792) NoSuchMethodException CTMarkupImpl.init(org.apache.xmlbeans.SchemaType, boolean) processing a OOXML document

2013-08-08 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733804#comment-13733804 ] Tim Allison commented on TIKA-792: -- Committed in POI. Once POI3.9beta2 is released, I'll

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733819#comment-13733819 ] Hoss Man commented on TIKA-1134: The crux of my initial confusion and continuted concern