[jira] [Commented] (TIKA-2146) Unable to extract contents from protected MS word-doc-java.lang.ArrayIndexOutOfBoundsException

2016-10-28 Thread Frank Refol (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15616341#comment-15616341 ] Frank Refol commented on TIKA-2146: --- Thanks for clarifying and providing that link. That is very helpful

[jira] [Commented] (TIKA-2146) Unable to extract contents from protected MS word-doc-java.lang.ArrayIndexOutOfBoundsException

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15616180#comment-15616180 ] Tim Allison commented on TIKA-2146: --- I wonder if these errors are caused by what I found with old

[jira] [Commented] (TIKA-2144) NullPointerException on a valid Word file

2016-10-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15616001#comment-15616001 ] Hudson commented on TIKA-2144: -- FAILURE: Integrated in Jenkins build Tika-trunk #1128 (See

tika-2.x - Build # 166 - Failure

2016-10-28 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x (build #166) Status: Failure Check console output at https://builds.apache.org/job/tika-2.x/166/ to view the results.

[jira] [Commented] (TIKA-2144) NullPointerException on a valid Word file

2016-10-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615990#comment-15615990 ] Hudson commented on TIKA-2144: -- FAILURE: Integrated in Jenkins build tika-2.x #166 (See

[jira] [Updated] (TIKA-2144) NullPointerException on a valid Word file

2016-10-28 Thread Seva Alekseyev (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated TIKA-2144: - Attachment: (was: Proposal ID 17 Offeror ChromoLogic.docx) > NullPointerException on a valid

[jira] [Commented] (TIKA-2147) ClassCastException on a valid Word template

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615870#comment-15615870 ] Tim Allison commented on TIKA-2147: --- Great. Thank you. My proposed fix works on both docs. Will wait

[jira] [Commented] (TIKA-2147) ClassCastException on a valid Word template

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615862#comment-15615862 ] Tim Allison commented on TIKA-2147: --- https://bz.apache.org/bugzilla/show_bug.cgi?id=60316 >

[jira] [Commented] (TIKA-2150) RTF TextExtractor omits some content

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615786#comment-15615786 ] Tim Allison commented on TIKA-2150: --- Thank you for opening this and submitting a minimal file and even

[jira] [Updated] (TIKA-2147) ClassCastException on a valid Word template

2016-10-28 Thread Sharath Kumar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sharath Kumar updated TIKA-2147: Attachment: basicresume.docx > ClassCastException on a valid Word template >

[jira] [Commented] (TIKA-2147) ClassCastException on a valid Word template

2016-10-28 Thread Sharath Kumar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615588#comment-15615588 ] Sharath Kumar commented on TIKA-2147: - I get the similar issue for docx too . I have attached the

[jira] [Commented] (TIKA-2149) org.apache.poi.POIXMLDocumentPart cannot be cast to org.apache.poi.xwpf.usermodel.XWPFDocument - MS Word docx

2016-10-28 Thread Sharath Kumar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615584#comment-15615584 ] Sharath Kumar commented on TIKA-2149: - Tika 2147, the input document is a word template. However not in

[jira] [Comment Edited] (TIKA-2149) org.apache.poi.POIXMLDocumentPart cannot be cast to org.apache.poi.xwpf.usermodel.XWPFDocument - MS Word docx

2016-10-28 Thread Sharath Kumar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615584#comment-15615584 ] Sharath Kumar edited comment on TIKA-2149 at 10/28/16 2:37 PM: --- Bug

[jira] [Commented] (TIKA-2144) NullPointerException on a valid Word file

2016-10-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615542#comment-15615542 ] Hudson commented on TIKA-2144: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #68 (See

tika-2.x-windows - Build # 68 - Still Failing

2016-10-28 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #68) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x-windows/68/ to view the results.

[jira] [Commented] (TIKA-2145) InvalidFormatException on a valid Word file

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615530#comment-15615530 ] Tim Allison commented on TIKA-2145: --- Fixed in POI https://bz.apache.org/bugzilla/show_bug.cgi?id=60315 >

[jira] [Resolved] (TIKA-2144) NullPointerException on a valid Word file

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2144. --- Resolution: Fixed > NullPointerException on a valid Word file >

[jira] [Commented] (TIKA-2142) ArrayIndexOutOfBoundsException

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615448#comment-15615448 ] Tim Allison commented on TIKA-2142: --- Fixed in POI r1767023 > ArrayIndexOutOfBoundsException >

[jira] [Created] (TIKA-2150) RTF TextExtractor omits some content

2016-10-28 Thread T. Schmidt (JIRA)
T. Schmidt created TIKA-2150: Summary: RTF TextExtractor omits some content Key: TIKA-2150 URL: https://issues.apache.org/jira/browse/TIKA-2150 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-2142) ArrayIndexOutOfBoundsException

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15615404#comment-15615404 ] Tim Allison commented on TIKA-2142: --- https://bz.apache.org/bugzilla/show_bug.cgi?id=60305 >

[jira] [Resolved] (TIKA-2149) org.apache.poi.POIXMLDocumentPart cannot be cast to org.apache.poi.xwpf.usermodel.XWPFDocument - MS Word docx

2016-10-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2149. --- Resolution: Duplicate > org.apache.poi.POIXMLDocumentPart cannot be cast to >

[jira] [Issue Comment Deleted] (TIKA-2146) Unable to extract contents from protected MS word-doc-java.lang.ArrayIndexOutOfBoundsException

2016-10-28 Thread Sharath Kumar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sharath Kumar updated TIKA-2146: Comment: was deleted (was: Does tika support extracting the contents of a protected MS-word

[jira] [Commented] (TIKA-2146) Unable to extract contents from protected MS word-doc-java.lang.ArrayIndexOutOfBoundsException

2016-10-28 Thread Sharath Kumar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15614659#comment-15614659 ] Sharath Kumar commented on TIKA-2146: - Does tika support extracting the contents of a protected

[jira] [Commented] (TIKA-2146) Unable to extract contents from protected MS word-doc-java.lang.ArrayIndexOutOfBoundsException

2016-10-28 Thread Sharath Kumar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15614660#comment-15614660 ] Sharath Kumar commented on TIKA-2146: - Does tika support extracting the contents of a protected

[jira] [Created] (TIKA-2149) org.apache.poi.POIXMLDocumentPart cannot be cast to org.apache.poi.xwpf.usermodel.XWPFDocument - MS Word docx

2016-10-28 Thread Sharath Kumar (JIRA)
Sharath Kumar created TIKA-2149: --- Summary: org.apache.poi.POIXMLDocumentPart cannot be cast to org.apache.poi.xwpf.usermodel.XWPFDocument - MS Word docx Key: TIKA-2149 URL: