[jira] [Updated] (TIKA-2500) Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.

2017-11-13 Thread Rohit Sureshrao Shelhalkar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohit Sureshrao Shelhalkar updated TIKA-2500: - Description: When I am parsing the RTF file, it only prints last three char

[jira] [Commented] (TIKA-2496) TIKA crashes / runs out of memory on simple PDF

2017-11-13 Thread chelambarasan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250840#comment-16250840 ] chelambarasan commented on TIKA-2496: - Hi [~talli...@apache.org], I have tried with ti

[jira] [Commented] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250311#comment-16250311 ] Hudson commented on TIKA-2502: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1390 (See [h

[jira] [Commented] (TIKA-2490) Turn off stderr warnings in Tika-app

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250258#comment-16250258 ] Tim Allison commented on TIKA-2490: --- [~markus17], are you still getting the warning for s

[jira] [Commented] (TIKA-2496) TIKA crashes / runs out of memory on simple PDF

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250251#comment-16250251 ] Tim Allison commented on TIKA-2496: --- I regret that I don't think we can do much unless yo

[jira] [Commented] (TIKA-2497) Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250244#comment-16250244 ] Tim Allison commented on TIKA-2497: --- [~kiwiwings], any ideas what may be causing this in

[jira] [Commented] (TIKA-2497) Unexpected RuntimeException from org.apache.tika.parser.microsoft.ooxml.OOXMLParser

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250243#comment-16250243 ] Tim Allison commented on TIKA-2497: --- It looks like pure Tika master is able to handle thi

[jira] [Commented] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250129#comment-16250129 ] Tim Allison commented on TIKA-2502: --- Looks like there's a workaround for 3.3.0 here: htt

[jira] [Commented] (TIKA-2486) Upgrade metadata-extractor to 2.10.1

2017-11-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16250091#comment-16250091 ] Hudson commented on TIKA-2486: -- FAILURE: Integrated in Jenkins build Tika-trunk #1389 (See [h

[jira] [Reopened] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-2502: --- With felix <= 2.3.7, we get: {noformat} [ERROR] Bundle org.apache.tika:tika-bundle:bundle:1.17-SNAPSHOT :

[jira] [Commented] (TIKA-2488) Outlook PST Parser fails from NullPointerException

2017-11-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249978#comment-16249978 ] Hudson commented on TIKA-2488: -- FAILURE: Integrated in Jenkins build Tika-trunk #1388 (See [h

[jira] [Commented] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249980#comment-16249980 ] Hudson commented on TIKA-2502: -- FAILURE: Integrated in Jenkins build Tika-trunk #1388 (See [h

[jira] [Commented] (TIKA-2503) Try to upgrade httpclient to >=4.5.3

2017-11-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249981#comment-16249981 ] Hudson commented on TIKA-2503: -- FAILURE: Integrated in Jenkins build Tika-trunk #1388 (See [h

[jira] [Commented] (TIKA-2501) Upgrade jackson to 2.9.2

2017-11-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249979#comment-16249979 ] Hudson commented on TIKA-2501: -- FAILURE: Integrated in Jenkins build Tika-trunk #1388 (See [h

[jira] [Assigned] (TIKA-2427) Add OWASP check?

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reassigned TIKA-2427: - Assignee: Tim Allison > Add OWASP check? > > > Key: TIKA-2427 >

RE: Tika 1.17?

2017-11-13 Thread Allison, Timothy B.
Y. You're right. Thank you! I think I've been avoiding that because there were some regressions in metadata-extractor last I looked at this. Let's hope those are gone in 2.10.1. -Original Message- From: Tyler Bui-Palsulich [mailto:tpalsul...@apache.org] Sent: Sunday, November 12, 20

[jira] [Updated] (TIKA-2486) Upgrade metadata-extractor to 2.10.1

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2486: -- Priority: Blocker (was: Major) > Upgrade metadata-extractor to 2.10.1 >

[jira] [Resolved] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2502. --- Resolution: Fixed Fix Version/s: 1.17 > Upgrade OpenNLP to 1.8.3 > > >

[jira] [Resolved] (TIKA-2501) Upgrade jackson to 2.9.2

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2501. --- Resolution: Fixed Fix Version/s: 1.17 > Upgrade jackson to 2.9.2 > > >

[jira] [Commented] (TIKA-2503) Try to upgrade httpclient to >=4.5.3

2017-11-13 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249913#comment-16249913 ] Chris A. Mattmann commented on TIKA-2503: - thanks Tim, no we don't have coverage. I

[jira] [Commented] (TIKA-2503) Try to upgrade httpclient to >=4.5.3

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249901#comment-16249901 ] Tim Allison commented on TIKA-2503: --- Thank you, [~chrismattmann]! If I do something like

[jira] [Commented] (TIKA-2503) Try to upgrade httpclient to >=4.5.3

2017-11-13 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249877#comment-16249877 ] Chris A. Mattmann commented on TIKA-2503: - for OpeNDAP datasets I believe we would

[jira] [Updated] (TIKA-2499) Sonatype Nexus Auditor is reporting that Tika 1.13 is using a number of vulnerable Third party components.

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2499: -- Priority: Blocker (was: Major) > Sonatype Nexus Auditor is reporting that Tika 1.13 is using a number of

[jira] [Comment Edited] (TIKA-2504) Upgrade or remove plexus-utils

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249871#comment-16249871 ] Tim Allison edited comment on TIKA-2504 at 11/13/17 5:46 PM: - [

[jira] [Updated] (TIKA-2504) Upgrade or remove plexus-utils

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2504: -- Description: (was: [~lfcnassif] or [~gagravarr], vfs2 is an optional dependency for the RARParser. Th

[jira] [Commented] (TIKA-2504) Upgrade or remove plexus-utils

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249871#comment-16249871 ] Tim Allison commented on TIKA-2504: --- [~lfcnassif] or [~gagravarr], vfs2 is an optional de

[jira] [Created] (TIKA-2504) Upgrade or remove plexus-utils

2017-11-13 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2504: - Summary: Upgrade or remove plexus-utils Key: TIKA-2504 URL: https://issues.apache.org/jira/browse/TIKA-2504 Project: Tika Issue Type: Sub-task Reporter

[jira] [Comment Edited] (TIKA-2503) Try to upgrade httpclient to >=4.5.3

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249856#comment-16249856 ] Tim Allison edited comment on TIKA-2503 at 11/13/17 5:28 PM: - I

[jira] [Commented] (TIKA-2503) Try to upgrade httpclient to >=4.5.3

2017-11-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16249856#comment-16249856 ] Tim Allison commented on TIKA-2503: --- If I'm reading the maven dependency:tree correctly,

[jira] [Created] (TIKA-2503) Try to upgrade httpclient to >=4.5.3

2017-11-13 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2503: - Summary: Try to upgrade httpclient to >=4.5.3 Key: TIKA-2503 URL: https://issues.apache.org/jira/browse/TIKA-2503 Project: Tika Issue Type: Sub-task Re

[jira] [Created] (TIKA-2502) Upgrade OpenNLP to 1.8.3

2017-11-13 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2502: - Summary: Upgrade OpenNLP to 1.8.3 Key: TIKA-2502 URL: https://issues.apache.org/jira/browse/TIKA-2502 Project: Tika Issue Type: Sub-task Reporter: Tim

[jira] [Created] (TIKA-2501) Upgrade jackson to 2.9.2

2017-11-13 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2501: - Summary: Upgrade jackson to 2.9.2 Key: TIKA-2501 URL: https://issues.apache.org/jira/browse/TIKA-2501 Project: Tika Issue Type: Sub-task Reporter: Tim

[jira] [Updated] (TIKA-2500) Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.

2017-11-13 Thread Rohit Sureshrao Shelhalkar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohit Sureshrao Shelhalkar updated TIKA-2500: - Labels: RTF RTFParser (was: RTF) > Apache Tika do not extract first line o

[jira] [Updated] (TIKA-2500) Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.

2017-11-13 Thread Rohit Sureshrao Shelhalkar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohit Sureshrao Shelhalkar updated TIKA-2500: - Labels: RTF (was: ) > Apache Tika do not extract first line of the RTF fil

[jira] [Updated] (TIKA-2500) Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.

2017-11-13 Thread Rohit Sureshrao Shelhalkar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohit Sureshrao Shelhalkar updated TIKA-2500: - Affects Version/s: 1.16 > Apache Tika do not extract first line of the RTF

[jira] [Updated] (TIKA-2500) Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.

2017-11-13 Thread Rohit Sureshrao Shelhalkar (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rohit Sureshrao Shelhalkar updated TIKA-2500: - Description: When I am parsing the RTF file, it only prints last three char

[jira] [Created] (TIKA-2500) Apache Tika do not extract first line of the RTF file, It only extract last three char of first line.

2017-11-13 Thread Rohit Sureshrao Shelhalkar (JIRA)
Rohit Sureshrao Shelhalkar created TIKA-2500: Summary: Apache Tika do not extract first line of the RTF file, It only extract last three char of first line. Key: TIKA-2500 URL: https://issues.apache.or