[VOTE] Release Apache Tika 1.16 Candidate #1

2017-07-07 Thread Tim Allison
A candidate for the Tika 1.16 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: https://github.com/apache/tika/tree/1.16-rc1 The SHA1 checksum of the archive is e6884af0209ace42bf0b9b59d72c3c5a0052055e In addition,

[jira] [Commented] (TIKA-2053) Adding TagRatio to Tika Parser

2017-07-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1607#comment-1607 ] ASF GitHub Bot commented on TIKA-2053: -- chrismattmann commented on issue #131: fix for TIKA-2053

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Madhav Sharan (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078886#comment-16078886 ] Madhav Sharan commented on TIKA-1988: - I faced the same issue as Tim earlier. What do you guys think

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078882#comment-16078882 ] Tim Allison commented on TIKA-1988: --- Thank you! bq. Tim why weren't the models available for you? They

[jira] [Resolved] (TIKA-2424) Don't include ml model .bin files in src.zip

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2424. --- Resolution: Fixed Fix Version/s: 1.16 > Don't include ml model .bin files in src.zip >

[jira] [Created] (TIKA-2424) Don't include ml model .bin files in src.zip

2017-07-07 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2424: - Summary: Don't include ml model .bin files in src.zip Key: TIKA-2424 URL: https://issues.apache.org/jira/browse/TIKA-2424 Project: Tika Issue Type: Improvement

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078876#comment-16078876 ] Chris A. Mattmann commented on TIKA-1988: - For now yes [~talli...@mitre.org] until we fix

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078865#comment-16078865 ] Tim Allison commented on TIKA-1988: --- [~chrismattmann], to confirm, you want the "model" directory at the

Re: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Chris Mattmann
Hey Tim, I usually do a search in JIRA, then I go to the upper right of the screen and select “Bulk Change” from there. Then I Edit the fix version and push off those in my search scheduled for but with resolution Hope that helps! Cheers, Chris On 7/7/17, 11:31 AM, "Allison, Timothy B."

RE: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Allison, Timothy B.
Never mind the never mind... I can't commit the src.zip to dist: Transmitting file data svn: E175002: Commit failed (details follow): svn: E175002: PUT request on '/repos/dist/!svn/txr/20365-heq/dev/tika/tika-1.16-src.zip' failed Did we hit a max file limit in our svn?

FW: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Allison, Timothy B.
Never mind, looks like we included the ner models in 1.15's src.zip, which was 133MB. The new src.zip is 240MB. I'll continue with RC1. Please -1 if we need to fix this. -Original Message- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Friday, July 7, 2017 5:37 PM To:

RE: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Allison, Timothy B.
I got most of the way through rc1 but found that the src.zip contains 60MB of models for the age recognizer. We forgot to exclude those. Unless anyone objects, I'll revert, drop the release, try to fix the excludes and try again later tonight. -Original Message- From: Allison, Timothy

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078625#comment-16078625 ] Hudson commented on TIKA-1988: -- ABORTED: Integrated in Jenkins build Tika-trunk #1321 (See

[jira] [Updated] (TIKA-2423) Document Parse error

2017-07-07 Thread Gaurav (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gaurav updated TIKA-2423: - Attachment: S5-130184 Discussion paper on access network information.doc > Document Parse error >

[jira] [Created] (TIKA-2423) Document Parse error

2017-07-07 Thread Gaurav (JIRA)
Gaurav created TIKA-2423: Summary: Document Parse error Key: TIKA-2423 URL: https://issues.apache.org/jira/browse/TIKA-2423 Project: Tika Issue Type: Bug Affects Versions: 1.15

RE: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Allison, Timothy B.
Thank you, Chris! Now, how do I bulk move open 1.16->1.17 on JIRA? -Original Message- From: Chris Mattmann [mailto:mattm...@apache.org] Sent: Friday, July 7, 2017 11:39 AM To: dev@tika.apache.org Subject: Re: [tika] branch master updated: TIKA-1988 -- allow for errors downloading

build failures

2017-07-07 Thread Allison, Timothy B.
Well, that's getting more exciting... java.io.IOException: No space left on device at java.io.FileOutputStream.write(Native Method) at java.io.FileOutputStream.write(FileOutputStream.java:290) at

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078350#comment-16078350 ] Hudson commented on TIKA-1988: -- ABORTED: Integrated in Jenkins build Tika-trunk #1320 (See

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-07 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078352#comment-16078352 ] Hudson commented on TIKA-2399: -- ABORTED: Integrated in Jenkins build Tika-trunk #1320 (See

[jira] [Commented] (TIKA-2389) Warn log level is pretty strong for missing JBIG2ImageReader

2017-07-07 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078348#comment-16078348 ] Hudson commented on TIKA-2389: -- ABORTED: Integrated in Jenkins build Tika-trunk #1320 (See

[jira] [Commented] (TIKA-2338) Change Scope of Jai-ImageIO-Core dependency

2017-07-07 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078311#comment-16078311 ] Luis Filipe Nassif commented on TIKA-2338: -- As a side note, seems like Oracle will integrate jai,

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078283#comment-16078283 ] Chris A. Mattmann commented on TIKA-1988: - Sounds good to me...almost done with tika-nlp will

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078275#comment-16078275 ] Tim Allison commented on TIKA-1988: --- Thought: lower expectations for 2.0 (put off parser compos-ability

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078259#comment-16078259 ] Tim Allison commented on TIKA-2399: --- [~mcaruanagalizia], thank you very much for your contributions on

[jira] [Resolved] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2399. --- Resolution: Fixed Assignee: Tim Allison Fix Version/s: 1.16 We've excluded jj2000 out

Re: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Chris Mattmann
Sure On 7/7/17, 7:57 AM, "Allison, Timothy B." wrote: I'll leave the moving to a new module to you? -Original Message- From: Chris Mattmann [mailto:mattm...@apache.org] Sent: Friday, July 7, 2017 10:32 AM To: dev@tika.apache.org;

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078247#comment-16078247 ] Tim Allison commented on TIKA-2399: --- There is a legal team, but I don't think they do much lobbying.

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078237#comment-16078237 ] Chris A. Mattmann commented on TIKA-1988: - Agree on #3. I'm going to take a first cut at tika-nlp.

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078233#comment-16078233 ] Tim Allison commented on TIKA-1988: --- 3. At some point we should follow [~grossws]'s fantastic TIKA-2245

RE: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Allison, Timothy B.
I'll leave the moving to a new module to you? -Original Message- From: Chris Mattmann [mailto:mattm...@apache.org] Sent: Friday, July 7, 2017 10:32 AM To: dev@tika.apache.org; comm...@tika.apache.org Subject: Re: [tika] branch master updated: TIKA-1988 -- allow for errors downloading

Re: [tika] branch master updated: TIKA-1988 -- allow for errors downloading models

2017-07-07 Thread Chris Mattmann
Great Tim thanks! On 7/7/17, 7:28 AM, "talli...@apache.org" wrote: This is an automated email from the ASF dual-hosted git repository. tallison pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/tika.git The

[jira] [Comment Edited] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078163#comment-16078163 ] Tim Allison edited comment on TIKA-1988 at 7/7/17 2:30 PM: --- 1. No idea. 2. Yes,

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078163#comment-16078163 ] Tim Allison commented on TIKA-1988: --- 1. No idea. 2. Yes, rather. :D > Age Detection Tika Recogniser >

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078155#comment-16078155 ] Chris A. Mattmann commented on TIKA-1988: - #1 - absolutely - i thought putting the model download

[jira] [Reopened] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-1988: --- 1) Would it be possible to allow for failure to get/find models? Failed to execute goal

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-07 Thread Matthew Caruana Galizia (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078012#comment-16078012 ] Matthew Caruana Galizia commented on TIKA-2399: --- OK. I can't think of any other option for

[jira] [Commented] (TIKA-2399) Version conflict with non-ASL jai-imageio-jpeg2000 and edu.ucar jj2000

2017-07-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16077929#comment-16077929 ] Tim Allison commented on TIKA-2399: --- Unless I hear differently, I'll exclude jj2000 and roll 1.16-rc1 in

[jira] [Commented] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16077700#comment-16077700 ] Hudson commented on TIKA-1988: -- FAILURE: Integrated in Jenkins build Tika-trunk #1319 (See

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16077692#comment-16077692 ] Chris A. Mattmann commented on TIKA-2298: - docs added here:

[jira] [Resolved] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved TIKA-1988. - Resolution: Fixed - merged into master thanks [~msha...@usc.edu], [~tgow...@gmail.com] and

[jira] [Updated] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-1988: Labels: age machine_learning memex nlp opennlp (was: age memex nlp opennlp) > Age Detection

[jira] [Updated] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-1988: Fix Version/s: 1.16 > Age Detection Tika Recogniser > - > >

[jira] [Assigned] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned TIKA-1988: --- Assignee: Chris A. Mattmann > Age Detection Tika Recogniser >

[jira] [Updated] (TIKA-1988) Age Detection Tika Recogniser

2017-07-07 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-1988: Labels: age memex nlp opennlp (was: ) > Age Detection Tika Recogniser >

Re: Tika 1.15.1? -> 1.16

2017-07-07 Thread Chris Mattmann
OK Tim / all, TIKA-1988 is done! Age resolution is in. Enjoy and proceed with the release, please +1. Cheers, Chris On 7/5/17, 8:37 PM, "Luís Filipe Nassif" wrote: Hi Tim, Taking a fast look at Nick's fix on TIKA-2419 seems conservative to me,