Re: My "What's new with Apache Tika 2.0" talk slides

2016-05-11 Thread Sergey Beryozkin
Saw Nick passing by but by the time I was ready to say hi he was gone, tomorrow then :-) And I guess I've seen Ken too, but did not know it was him :-). And to make it relevant: Tika rocks of course :-) On 12/05/16 04:41, Ken Krugler wrote: One annoying attendee kept asking about the new

Re: My "What's new with Apache Tika 2.0" talk slides

2016-05-11 Thread Ken Krugler
One annoying attendee kept asking about the new language detector support in 2.0 :) — Ken > On May 11, 2016, at 5:04pm, Allison, Timothy B. wrote: > > Great slides. Thank you, Nick. Wish I could be there... > > Any feedback/guidance from the audience? > > -Original

RE: My "What's new with Apache Tika 2.0" talk slides

2016-05-11 Thread Allison, Timothy B.
Great slides. Thank you, Nick. Wish I could be there... Any feedback/guidance from the audience? -Original Message- From: Nick Burch [mailto:n...@apache.org] Sent: Wednesday, May 11, 2016 5:09 PM To: u...@tika.apache.org Cc: dev@tika.apache.org Subject: My "What's new with Apache Tika

[jira] [Assigned] (TIKA-1454) Extracting as HTML loses links in xlsx, ppt, and pptx files

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reassigned TIKA-1454: - Assignee: Tim Allison > Extracting as HTML loses links in xlsx, ppt, and pptx files >

Re: [VOTE] Release Apache Tika 1.13 Candidate #1

2016-05-11 Thread Lewis John Mcgibbney
Hi David, Good job on the RC The .zip artifact contains 2015 in NOTICE Everything else looks great All Signatures good. Tests pass on MacOSX, Java 1.7 [X] +1 Release this package as Apache Tika 1.13 On Wed, May 11, 2016 at 6:50 AM, wrote: > > From: David Meikle

My "What's new with Apache Tika 2.0" talk slides

2016-05-11 Thread Nick Burch
Hi All For those who couldn't make it to Vancouver this week, the slides from my "What's new with Apache Tika 2.0" talk are now available online: http://www.slideshare.net/NickBurch2/apache-tika-whats-new-with-20 The audio was recorded, hopefully that will be available to go with the slides

[jira] [Updated] (TIKA-1454) Extracting as HTML loses links in xlsx, ppt, and pptx files

2016-05-11 Thread Chris Bryant (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Bryant updated TIKA-1454: --- Affects Version/s: 1.7 1.8 1.9

[jira] [Updated] (TIKA-1454) Extracting as HTML loses links in xlsx, ppt, and pptx files

2016-05-11 Thread Chris Bryant (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Bryant updated TIKA-1454: --- Environment: RedHat EL5, EL6, EL7 (was: I tested this only on RedHat EL5.) > Extracting as HTML loses

[jira] [Commented] (TIKA-1968) Veracode static scan reports 3 very high OS Command injections in tika-core-1.9.jar

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280177#comment-15280177 ] Tim Allison commented on TIKA-1968: --- Sounds good. For the record, I'm very much in favor of some of the

[jira] [Commented] (TIKA-1968) Veracode static scan reports 3 very high OS Command injections in tika-core-1.9.jar

2016-05-11 Thread I-Min Mau (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280167#comment-15280167 ] I-Min Mau commented on TIKA-1968: - Thanks let me follow up on my end with our developers and most likely we

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280148#comment-15280148 ] Tim Allison commented on TIKA-1513: --- [~iryndin], now that 1.13 is in the voting process, I'd like to

RE: [VOTE] Release Apache Tika 1.13 Candidate #1

2016-05-11 Thread Allison, Timothy B.
+1 Built on Windows and Linux. I'm relying on earlier pre-release tests for no surprises. :) Thank you, Dave! -Original Message- From: David Meikle [mailto:loo...@gmail.com] On Behalf Of David Meikle Sent: Monday, May 9, 2016 3:35 PM To: dev@tika.apache.org; u...@tika.apache.org

[jira] [Commented] (TIKA-1966) Issue in parsing iWorksDocument with Apache Tika

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280090#comment-15280090 ] Tim Allison commented on TIKA-1966: --- Thank you, [~gagravarr]! > Issue in parsing iWorksDocument with

[jira] [Commented] (TIKA-1966) Issue in parsing iWorksDocument with Apache Tika

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280089#comment-15280089 ] Tim Allison commented on TIKA-1966: --- Y, will do. Also need to update:

[jira] [Commented] (TIKA-1967) Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@10b8c32

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280063#comment-15280063 ] Tim Allison commented on TIKA-1967: --- Given the dependencies that Tika brings with it and Solr's

[jira] [Resolved] (TIKA-1967) Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@10b8c32

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1967. --- Resolution: Not A Problem Fix Version/s: (was: 1.12) (was: 1.7)

[jira] [Commented] (TIKA-1967) Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@10b8c32

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280060#comment-15280060 ] Tim Allison commented on TIKA-1967: --- The stack trace suggests the first problem is a duplicate of

[jira] [Commented] (TIKA-1968) Veracode static scan reports 3 very high OS Command injections in tika-core-1.9.jar

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15280055#comment-15280055 ] Tim Allison commented on TIKA-1968: --- Not sure why this was assigned to me. Back in 1.9, those lines in

[jira] [Updated] (TIKA-1968) Veracode static scan reports 3 very high OS Command injections in tika-core-1.9.jar

2016-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1968: -- Assignee: (was: Tim Allison) > Veracode static scan reports 3 very high OS Command injections in >

[jira] [Updated] (TIKA-1969) The filename or extension is too long

2016-05-11 Thread Alin Turbut (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alin Turbut updated TIKA-1969: -- Description: After I add the Tika dependency to my project, I receive this error: {code} Caused by:

[jira] [Created] (TIKA-1969) The filename or extension is too long

2016-05-11 Thread Alin Turbut (JIRA)
Alin Turbut created TIKA-1969: - Summary: The filename or extension is too long Key: TIKA-1969 URL: https://issues.apache.org/jira/browse/TIKA-1969 Project: Tika Issue Type: Bug Affects