[jira] [Commented] (TIKA-1334) Add presentation layer for results of each run

2017-05-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15994941#comment-15994941 ] Tyler Palsulich commented on TIKA-1334: --- The format should probably be in the form: {noformat

Re: Squashing GitHub pull requests while merging

2016-05-07 Thread Tyler Palsulich
A contributor should be able to squash the commits in the pull request before we merge into the Tika. So, we don't need to mess up Tika's history. Right? Tyler On May 6, 2016 8:41 PM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > Squashing messes up history and atm

Re: JIRA issue?

2016-04-21 Thread Tyler Palsulich
Hi Ben, Sorry for the inconvenience. The infrastructure team had to disable the create and comment features of JIRA for many projects to mitigate spam. Hopefully everything will be back up and running again soon. Thanks for emailing. Tyler Hi, I'd like to create an issue on the JIRA. When I

Re: [VOTE] Apache Tika 1.12 Release Candidate #1

2016-02-13 Thread Tyler Palsulich
A bit late to the party, but +1 from me. Tyler On Thu, Feb 4, 2016 at 1:44 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi Chris, > +1 to release this release candidate > Thanks > Lewis > > On Tue, Feb 2, 2016 at 4:24 PM, Lewis John Mcgibbney < > lewis.mcgibb...@gmail.com>

Re: [VOTE] Moving SCM to Git

2016-01-02 Thread Tyler Palsulich
plaining how to use Git at Apache, and how to > >>> use it with Github, and how to use it even in a traditional SVN > >>> sense. The page is here: > >>> > >>> https://wiki.apache.org/tika/UsingGit > >>> > >>> > >>>

RE: NER Parser tests behind proxy?

2015-11-23 Thread Tyler Palsulich
Apologies if i missed a discussion about this earlier, but should we be downloading a model by default? Tyler On Nov 23, 2015 8:03 AM, "Allison, Timothy B." wrote: > The problem comes down to: ModelGetter.groovy which is trying to grab: >

Re: [DISCUSS] Moving to Git

2015-11-18 Thread Tyler Palsulich
+1 from me. Tyler On Nov 18, 2015 6:46 AM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > Hey Team, > > I propose we move to writeable git repos for Tika for our repository. > I mostly interact with Git & Github nowadays even with Tika using the > mirroring and PR

Re: Named Entity Recognition support in trunk

2015-11-18 Thread Tyler Palsulich
That's awesome! Great work. Have we tried running any benchmarks? Tyler On Nov 18, 2015 6:42 AM, "Mattmann, Chris A (3980)" < chris.a.mattm...@jpl.nasa.gov> wrote: > Hey Folks, > > With the commit of TIKA-1787/GH-61 in trunk we now have full integration > of Named Entity Recognition with

Re: [VOTE] Apache Tika 1.11 Release Candidate #1

2015-10-22 Thread Tyler Palsulich
+1 from me -- builds, tests pass, sanity check files parse, and sums look good. But, I get a warning that the signature is not certified with a trusted signature. Tyler On Wed, Oct 21, 2015 at 6:43 AM Allison, Timothy B. wrote: > +0 (some regressions in ppt content) > > I

Re: Tika Tesseract configuration

2015-10-17 Thread Tyler Palsulich
Hi Aditya, The wiki (https://wiki.apache.org/tika/TikaOCR) also had some good information about setting up and configuring Tesseract. Let me know if you have any questions. Thanks, Tyler On Wed, Oct 14, 2015, 6:59 AM Aditya Dhulipala wrote: > Hi Tika devs, > > Scratch that

Re: svn commit: r1706077 - /tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/gdal/TestGDALParser.java

2015-10-01 Thread Tyler Palsulich
Hi Chris, It looks like these two lines are equivalent (assert not null versus assert true not null). Right? Tyler On Wed, Sep 30, 2015, 9:45 AM wrote: > Author: mattmann > Date: Wed Sep 30 16:45:32 2015 > New Revision: 1706077 > > URL:

Re: svn commit: r1706077 - /tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/gdal/TestGDALParser.java

2015-10-01 Thread Tyler Palsulich
+++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++ > > > > > > -Original Message- > From: Tyler Palsulich <tpalsul...@g

[jira] [Commented] (TIKA-1743) NetworkParser can create Unbounded Number of Threads

2015-09-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903878#comment-14903878 ] Tyler Palsulich commented on TIKA-1743: --- [Copied from the list] This sounds like a great idea! We

Re: [jira] [Created] (TIKA-1743) NetworkParser can create Unbounded Number of Threads

2015-09-22 Thread Tyler Palsulich
This sounds like a great idea! We should make the size of the pool configurable with TikaConfig. On Tue, Sep 22, 2015, 3:04 PM Bob Paulin (JIRA) wrote: > Bob Paulin created TIKA-1743: > > > Summary: NetworkParser can create

Re: [ANNOUNCE] Welcome Bob Paulin as Tika Committer + PMC Member

2015-09-16 Thread Tyler Palsulich
Welcome! On Wed, Sep 16, 2015, 6:37 PM Allison, Timothy B. wrote: > Welcome! Great to have you on board! > > Cheers, > > Tim > > -Original Message- > From: Bob Paulin [mailto:b...@bobpaulin.com] > Sent: Wednesday, September 16, 2015 9:16 PM > To:

[jira] [Commented] (TIKA-1672) Integrate tika-java7 component

2015-08-30 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14722705#comment-14722705 ] Tyler Palsulich commented on TIKA-1672: --- Hmm. Maybe we should rename the module

Re: [ANNOUNCE] Apache Tika 1.10 release

2015-08-08 Thread Tyler Palsulich
Thanks, Dave! On Sat, Aug 8, 2015, 7:01 AM David Meikle dmei...@apache.org wrote: The Apache Tika project is pleased to announce the release of Apache Tika 1.10. The release contents have been pushed out to the main Apache release site and to the Central sync, so the releases should be

Re: [VOTE] Apache Tika 1.10 Release Candidate #1

2015-08-04 Thread Tyler Palsulich
Everything looks good to me! +1 Thanks, Dave! Tyler On Tue, Aug 4, 2015, 6:48 AM Ken Krugler kkrugler_li...@transpac.com wrote: +1 Built on Mac, tested with Bixo. -- Ken From: David Meikle Sent: August 2, 2015 12:15:24am PDT To: dev@tika.apache.org; u...@tika.apache.org Subject:

[jira] [Commented] (TIKA-1362) Add GoogleTranslate implementation of Translation API

2015-07-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14623246#comment-14623246 ] Tyler Palsulich commented on TIKA-1362: --- If you have a pressing need for better

[jira] [Created] (TIKA-1672) Integrate tika-java7 component

2015-07-02 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1672: - Summary: Integrate tika-java7 component Key: TIKA-1672 URL: https://issues.apache.org/jira/browse/TIKA-1672 Project: Tika Issue Type: Improvement

[jira] [Resolved] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-07-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1536. --- Resolution: Fixed Upgraded in r1688779. Thanks, all. Will open a new issue regarding

[jira] [Commented] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605772#comment-14605772 ] Tyler Palsulich commented on TIKA-1536: --- Yep, see http://apache.markmail.org/thread

[jira] [Closed] (TIKA-1481) TikaJAXRS get metadata calls give different results

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1481. - Resolution: Not A Problem Hi [~arbuzovada]. Sorry for the trouble! Did you make sure to respond

[jira] [Resolved] (TIKA-756) XMP output from Tika CLI

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-756. -- Resolution: Fixed Marking this as Fixed, since there are a few more references to tika-parser

[jira] [Closed] (TIKA-1429) Unable to View a 9mb file even after setting a large Heap Size of 3GB while TIKA GUI

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1429. - Resolution: Not A Problem Closing this as not a problem. The file needs to be kept in memory

[jira] [Commented] (TIKA-1493) Update for JAXRS page with details on passing password

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605292#comment-14605292 ] Tyler Palsulich commented on TIKA-1493: --- Can someone familiar with the latest

[jira] [Closed] (TIKA-1552) Pdf document parser

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1552. - Resolution: Not A Problem Marking this as not a problem, since Adobe Reader also adds white space

[jira] [Closed] (TIKA-1452) parser.parse() throws exception after which the procesed file is not getting renamed/moved/deleted

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1452. - Resolution: Not A Problem I'm closing this as not a problem. But, please feel free to reopen

[jira] [Closed] (TIKA-1439) PDF embeded with document can not parse.

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1439. - Resolution: Duplicate PDF embeded with document can not parse

[jira] [Updated] (TIKA-1233) PDFBox can throw StringIndexOutOfBoundsException on some dates

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1233: -- Fix Version/s: (was: 1.6) 1.10 PDFBox can throw

[jira] [Resolved] (TIKA-1585) Create Example Website with Form Submission

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1585. --- Resolution: Fixed Good idea, [~lewismc]. I added it to http://people.apache.org/~tpalsulich

[jira] [Commented] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-06-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605300#comment-14605300 ] Tyler Palsulich commented on TIKA-1536: --- Now that 1.9 is released, are there any

Re: Troubleshooting guide

2015-06-24 Thread Tyler Palsulich
Looks good! Thanks, Nick. Tyler On Wed, Jun 24, 2015 at 2:42 PM Nick Burch apa...@gagravarr.org wrote: Hi All I've had a go at writing up a troubleshooting guide on the wiki, hopefully covering the main problems people face (content detected wrong, parser missing etc). It's linked from the

Re: Configuring parsers and translators

2015-06-13 Thread Tyler Palsulich
++ -Original Message- From: Tyler Palsulich tpalsul...@gmail.com Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Saturday, June 6, 2015 at 3:59 PM To: dev@tika.apache.org dev@tika.apache.org Subject: Re: Configuring parsers and translators

[jira] [Closed] (TIKA-1199) Tika extracts weird signs instead of text

2015-06-09 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1199. - Resolution: Not A Problem Tika extracts weird signs instead of text

[jira] [Resolved] (TIKA-1630) Mention APK support in List of Supported Formats

2015-06-09 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1630. --- Resolution: Fixed Fix Version/s: 1.9 Assignee: Tyler Palsulich Bolded

Re: [VOTE] Release Apache Tika 1.9 Candidate #2

2015-06-09 Thread Tyler Palsulich
+1 from me. Thanks for running this, Chris! Tyler On Mon, Jun 8, 2015 at 1:11 PM Allison, Timothy B. talli...@mitre.org wrote: +1 Built in Windows and Linux. Works on problems (that I caused!) in rc1. Let's make sure to include last Java 1.6 version in the release notes, if that's what

Re: Configuring parsers and translators

2015-06-06 Thread Tyler Palsulich
Hi Nick, I've been mulling this over since you sent the first message. But, I'm afraid I don't have a good solution or developed ideas. I agree, it would be very nice to consolidate all configuration for all parsers in the server and app. Is it feasible to put everything into tika-config? Then

[jira] [Commented] (TIKA-1652) Tika Server should allow config file override from the command line like Tika App

2015-06-06 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14575986#comment-14575986 ] Tyler Palsulich commented on TIKA-1652: --- I think this is a duplicate of TIKA-1426

Re: Configuring parsers and translators

2015-06-06 Thread Tyler Palsulich
++ -Original Message- From: Tyler Palsulich tpalsul...@gmail.com Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Saturday, June 6, 2015 at 3:45 PM To: dev@tika.apache.org dev@tika.apache.org Subject: Re: Configuring parsers and translators Hi Nick, I've been mulling

Re: [DISCUSS] Thinking about completely refactoring the ExternalParser and using commons-exec

2015-05-25 Thread Tyler Palsulich
On Mon, May 25, 2015 at 4:05 PM, Nick Burch apa...@gagravarr.org wrote: On Mon, 25 May 2015, Mattmann, Chris A (3980) wrote: ExternalParser is way broke. I have some patches that somewhat fix it, but in doing so, I realized, why not just use commons-exec? I realize that this is another

Re: Any reason we removed the links to other downstream Tika APIs off the main web site?

2015-05-20 Thread Tyler Palsulich
Hi Chris, I may have botched the version of the index on the site (see the other thread with Nick's comments.) I'll investigate more tonight or tomorrow, if you don't beat me to it. Tyler On May 20, 2015 4:39 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Hey Folks,

[jira] [Commented] (TIKA-1624) Syntax error in DOAP file release section

2015-05-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553281#comment-14553281 ] Tyler Palsulich commented on TIKA-1624: --- Thanks, Ken. I published the file a few

Re: Any reason we removed the links to other downstream Tika APIs off the main web site?

2015-05-20 Thread Tyler Palsulich
believe the versioning issues from the other thread have been resolved. Tyler On Wed, May 20, 2015 at 5:54 PM, Tyler Palsulich tpalsul...@gmail.com wrote: Hi Chris, I may have botched the version of the index on the site (see the other thread with Nick's comments.) I'll investigate more tonight

[jira] [Commented] (TIKA-1630) Mention APK support in List of Supported Formats

2015-05-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14553272#comment-14553272 ] Tyler Palsulich commented on TIKA-1630: --- That is a very good point

[jira] [Commented] (TIKA-1630) Mention APK support in List of Supported Formats

2015-05-14 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544104#comment-14544104 ] Tyler Palsulich commented on TIKA-1630: --- Hi. Thanks for reporting this! Can you

Published Site Changes

2015-05-14 Thread Tyler Palsulich
Hi Everyone, I was about to update the site for TIKA-1619 (checksums wrong on the site), but found unpublished changes in the site. This is the status after checking out the repo and running `mvn install`: ➜ site svn status M publish/1.7/examples.html M publish/1.8/examples.html M

[jira] [Commented] (TIKA-1624) Syntax error in DOAP file release section

2015-05-14 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544150#comment-14544150 ] Tyler Palsulich commented on TIKA-1624: --- [~kkrugler], yes. I just updated the release

Re: Translation API question

2015-05-05 Thread Tyler Palsulich
Hi Sergey, Unfortunately, not yet. See TIKA-1328. Tyler On Tue, May 5, 2015 at 4:51 PM, Sergey Beryozkin sberyoz...@gmail.com wrote: Hi All Is it possible to submit a document to the Translation API and get the translated words as a sequence of events ? For example, with a regular Tika

Re: Java 1.6 support for Tika 1.9?

2015-04-27 Thread Tyler Palsulich
I should have included the fact this is the last release planned to support Java 1.6 in the announcement (as we talked about a while back). But, since that has passed, should we just update the announcement on the website, wait another release, or just drop Java 1.6 support when we release 1.9? I

Re: comparing Tika's file detect with other tools?

2015-04-22 Thread Tyler Palsulich
Hi Tim, I do not know about if there would be licensing concerns. But, we do have TIKA-289 to track merging magic bytes from `file` into Tika. Tyler On Wed, Apr 22, 2015 at 10:40 AM, Ken Krugler kkrugler_li...@transpac.com wrote: Hi Tim, I don't believe there's any issue with comparing

[jira] [Commented] (TIKA-1585) Create Example Website with Form Submission

2015-04-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14507259#comment-14507259 ] Tyler Palsulich commented on TIKA-1585: --- Is there an Apache hosted location we'd like

Re: NUTCH-1994 and UCAR Dependencies

2015-04-21 Thread Tyler Palsulich
Hi Lewis, I also tried upgrading Tika in Nutch. But, ran into the same issue (but, udunits is found, as expected): [ivy:retrieve] :: [ivy:retrieve] :: UNRESOLVED DEPENDENCIES :: [ivy:retrieve]

[jira] [Commented] (TIKA-1607) Introduce new HashMapString, Object data structure for persitsence of Tika Metadata

2015-04-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14503778#comment-14503778 ] Tyler Palsulich commented on TIKA-1607: --- Good idea! What if you created a subclass

[ANNOUNCE] Apache Tika 1.8 Released

2015-04-20 Thread Tyler Palsulich
not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/tika.asc For more information on Apache Tika, visit the project home page: http://tika.apache.org/ -- Tyler

[RESULT] [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-20 Thread Tyler Palsulich
Hi Everyone, The VOTE to release Tika 1.8 RC #2 has passed with the following tally: +1: Chris Mattmann Hong-Thai Nguyen Konstantin Gribov Lewis John Mcgibbney Oleg Tikhonov Tim Allison Tyler Palsulich ±0: None -1: None I'll move forward with the release process now. Thank you all for your

Re: [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-20 Thread Tyler Palsulich
Thank you, Everyone! I'll move forward now. Lewis, KEYS are here: https://people.apache.org/keys/group/tika.asc. Of course, I'm also +1. Tyler On Mon, Apr 20, 2015 at 3:47 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Folks, On Thu, Apr 16, 2015 at 2:42 PM,

Re: [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-19 Thread Tyler Palsulich
for https://issues.apache.org/jira/browse/TIKA-1606? It's a simple change (bumping the Guava version), but as seen this can have unexpected consequences. I'm fine either way. -- Ken From: Tyler Palsulich Sent: April 18, 2015 8:29:22pm PDT To: dev@tika.apache.org Subject: RE: [VOTE

RE: [VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-18 Thread Tyler Palsulich
we should upgrade to at least 11.0.2 But I don't know if this is enough of an issue to require another RC. -- Ken PS - I've created https://issues.apache.org/jira/browse/TIKA-1606 to track this. From: Tyler Palsulich Sent: April 13, 2015 10:56:29am PDT To: dev@tika.apache.org, u

[jira] [Closed] (TIKA-1266) Tika OSGI Bundle needs Bundle-ClassPath to work in Equinox

2015-04-16 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1266. - Resolution: Not A Problem Thanks, [~bobpaulin]! Tika OSGI Bundle needs Bundle-ClassPath to work

[VOTE] Apache Tika 1.8 Release Candidate #2

2015-04-13 Thread Tyler Palsulich
Hi Folks, A candidate for the Tika 1.8 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.8-rc2/ The SHA1 checksum of the archive is

[jira] [Commented] (TIKA-1593) Doco: Broken link to Parser Quick Start Guide

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492638#comment-14492638 ] Tyler Palsulich commented on TIKA-1593: --- See https://svn.apache.org/repos/asf/tika

[jira] [Resolved] (TIKA-1593) Doco: Broken link to Parser Quick Start Guide

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1593. --- Resolution: Fixed Assignee: Tyler Palsulich Fixed in r1673240. Thank you [~bhamail

[jira] [Comment Edited] (TIKA-1593) Doco: Broken link to Parser Quick Start Guide

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14492662#comment-14492662 ] Tyler Palsulich edited comment on TIKA-1593 at 4/13/15 5:02 PM

[jira] [Resolved] (TIKA-1600) Unable to parse ODT files because of failed to close temporary resources

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1600. --- Resolution: Fixed Assignee: Hong-Thai Nguyen Thanks, [~thaichat04]! I just updated

[jira] [Updated] (TIKA-1600) Unable to parse ODT files because of failed to close temporary resources

2015-04-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1600: -- Priority: Blocker (was: Major) Unable to parse ODT files because of failed to close temporary

Re: [VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-13 Thread Tyler Palsulich
? -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Sunday, April 12, 2015 12:12 AM To: dev@tika.apache.org Subject: Re: [VOTE] Release Apache Tika 1.8 Candidate #1 Are we ready for another RC? I'd like to make sure the above issues are (believed to be) settled

Re: [VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-11 Thread Tyler Palsulich
Are we ready for another RC? I'd like to make sure the above issues are (believed to be) settled before the next cut. Thanks, Tyler On Apr 10, 2015 4:55 PM, David Meikle loo...@gmail.com wrote: On 10 Apr 2015, at 11:38, Allison, Timothy B. talli...@mitre.org wrote: I agree that the ODT

Re: [VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-07 Thread Tyler Palsulich
CC'ing user@tika for visibility. Tyler On Tue, Apr 7, 2015 at 4:54 PM, Tyler Palsulich tpalsul...@apache.org wrote: Hi Folks, A candidate for the Tika 1.8 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources

[VOTE] Release Apache Tika 1.8 Candidate #1

2015-04-07 Thread Tyler Palsulich
Hi Folks, A candidate for the Tika 1.8 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.8-rc1/ The SHA1 checksum of the archive is

[jira] [Closed] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1592. - Resolution: Invalid Closing as Invalid. Feel free to create additional issues if you run

[jira] [Commented] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14393246#comment-14393246 ] Tyler Palsulich commented on TIKA-1592: --- I tried building ikube on a Mac, but I ran

[jira] [Commented] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14393184#comment-14393184 ] Tyler Palsulich commented on TIKA-1592: --- Thanks for reporting this, [~michaelcouck

[jira] [Comment Edited] (TIKA-1592) It seems dbus and x11 server are invoked, and fails for some reason too

2015-04-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14393184#comment-14393184 ] Tyler Palsulich edited comment on TIKA-1592 at 4/2/15 7:09 PM

Re: Access Control Allow Origin

2015-04-01 Thread Tyler Palsulich
it to support more properties but great for a start Thanks, Sergey On 27/03/15 18:56, Tyler Palsulich wrote: Thank you, Sergey! I didn't know about that feature. I am going to try to work up a patch this weekend which enables CORS. I'll let you know if I run into any issues. Thanks again

Re: Access Control Allow Origin

2015-04-01 Thread Tyler Palsulich
option to -C and use -c for the tika config file? -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Wednesday, April 01, 2015 11:13 AM To: dev@tika.apache.org Subject: Re: Access Control Allow Origin Thank you for the feedback! I think there's an issue

[jira] [Comment Edited] (TIKA-1585) Create Example Website with Form Submission

2015-04-01 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14390841#comment-14390841 ] Tyler Palsulich edited comment on TIKA-1585 at 4/1/15 3:51 PM

Re: [jira] [Commented] (TIKA-1330) Add robust tika-batch code

2015-04-01 Thread Tyler Palsulich
All tests are passing. Only issue I see is excessive logging. The Hudson failure does just look like a hiccup. Tyler On Wed, Apr 1, 2015 at 2:55 PM, Allison, Timothy B. talli...@mitre.org wrote: This looks like a Hudson hiccup. Tyler is seeing excessive logging: Running

[jira] [Updated] (TIKA-1558) Create a Parser Blacklist

2015-03-31 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1558: -- Description: As talked about in TIKA-1555 and TIKA-1557, it would be nice to be able to disable

[jira] [Comment Edited] (TIKA-1558) Create a Parser Blacklist

2015-03-31 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1432#comment-1432 ] Tyler Palsulich edited comment on TIKA-1558 at 3/31/15 9:41 PM

Re: including refactored docs from govdocs1 in test suite

2015-03-30 Thread Tyler Palsulich
. Best, Tim -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Sunday, March 29, 2015 9:13 AM To: dev@tika.apache.org Subject: Re: [DISCUSS] Tika 1.8 or 1.7.1 Once TIKA-1584 and TIKA-1575 are resolved, I'll work up an RC (unless

[jira] [Commented] (TIKA-1587) ForkParser::setJavaCommand should take ListString

2015-03-30 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386685#comment-14386685 ] Tyler Palsulich commented on TIKA-1587: --- Thank you for reporting this! It seems like

RE: including refactored docs from govdocs1 in test suite

2015-03-30 Thread Tyler Palsulich
- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Monday, March 30, 2015 9:24 AM To: dev@tika.apache.org Subject: Re: including refactored docs from govdocs1 in test suite Can you copy the hyperlink into a new doc and change the URL? I have no idea about including the modified version

Re: [DISCUSS] Tika 1.8 or 1.7.1

2015-03-30 Thread Tyler Palsulich
including ~4MB of pdfs (README.NLDAS1.pdf and README.NLDAS2.pdf) from the GRIB(?) parser in the tika-app and tika-server jars. Best, Tim -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Sunday, March 29, 2015 9:13 AM To: dev@tika.apache.org

[jira] [Comment Edited] (TIKA-1584) Tika 1.7 possible regression (nested attachment files not getting parsed)

2015-03-30 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386906#comment-14386906 ] Tyler Palsulich edited comment on TIKA-1584 at 3/30/15 4:05 PM

[jira] [Updated] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available

2015-03-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1575: -- Fix Version/s: 1.8 Upgrade to PDFBox 1.8.9 when available

Re: [DISCUSS] Tika 1.8 or 1.7.1

2015-03-29 Thread Tyler Palsulich
Once TIKA-1584 and TIKA-1575 are resolved, I'll work up an RC (unless something else pops up). Thank you everyone. Tyler On Mar 29, 2015 4:43 AM, Hong-Thai Nguyen thaicha...@gmail.com wrote: +1 for 1.8 Hong-Thai On 28 Mar 2015, at 16:01, Tyler Palsulich tpalsul...@apache.org wrote: Hi

[jira] [Resolved] (TIKA-1579) Add file type to NetCDFParser

2015-03-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1579. --- Resolution: Fixed Add file type to NetCDFParser

Re: [DISCUSS] Tika 1.8 or 1.7.1

2015-03-28 Thread Tyler Palsulich
of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Tyler Palsulich tpalsul...@apache.org Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Saturday, March 28, 2015 at 8:01 AM To: dev

[jira] [Commented] (TIKA-1584) Tika 1.7 possible regression (nested attachment files not getting parsed)

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385483#comment-14385483 ] Tyler Palsulich commented on TIKA-1584: --- We now have two major issues which need

[jira] [Created] (TIKA-1585) Create Example Website with Form Submission

2015-03-28 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1585: - Summary: Create Example Website with Form Submission Key: TIKA-1585 URL: https://issues.apache.org/jira/browse/TIKA-1585 Project: Tika Issue Type: New

[jira] [Resolved] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1526. --- Resolution: Fixed Marking this as Fixed, per the above comments. [~thetaphi] or [~hossman

[jira] [Commented] (TIKA-1581) jhighlight license concerns

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385337#comment-14385337 ] Tyler Palsulich commented on TIKA-1581: --- Hi [~kkrugler]. Thanks. The comment is now

[jira] [Created] (TIKA-1586) Enable CORS on Tika Server

2015-03-28 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1586: - Summary: Enable CORS on Tika Server Key: TIKA-1586 URL: https://issues.apache.org/jira/browse/TIKA-1586 Project: Tika Issue Type: New Feature

[jira] [Resolved] (TIKA-1586) Enable CORS on Tika Server

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1586. --- Resolution: Fixed Fixed in r1669799. Enable CORS on Tika Server

[jira] [Commented] (TIKA-1585) Create Example Website with Form Submission

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385411#comment-14385411 ] Tyler Palsulich commented on TIKA-1585: --- CORS work is now integrated. [~talli

[DISCUSS] Tika 1.8 or 1.7.1

2015-03-28 Thread Tyler Palsulich
Hi Folks, Now that TIKA-1581 (JHighlight licensing issues) is resolved, we need to release a new version of Tika. I'll volunteer to be the release manager again. Should we release this as 1.8 or 1.7.1? Does anyone have any last minute issues they'd like to finish and see in Tika 1.X? I'd like

[jira] [Commented] (TIKA-1586) Enable CORS on Tika Server

2015-03-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385372#comment-14385372 ] Tyler Palsulich commented on TIKA-1586: --- Can someone take a look at the above PR

[jira] [Closed] (TIKA-1354) ForkParser doesn't work in OSGI container

2015-03-27 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1354. - Resolution: Fixed Fix Version/s: 1.7 Marking as Fixed. ForkParser doesn't work in OSGI

Enabling CORS

2015-03-27 Thread Tyler Palsulich
Hi Folks, I'm trying to enable CORS on a few of Tika's Server resources. But, after adding the pom.xml dependency and a @CrossOriginResourceSharing( allowOrigins = {url} ) annotation to the resources, the Access-Control-Allow-Origin header is still not given. Is there another

  1   2   3   4   5   6   7   8   >