Re: Build failure of OSGi bundle, Gsoc2015.

2015-02-14 Thread Tyler Palsulich
Java 1.7 on Win8.1, so I set up Ubuntu 14.10 on a virtual machine and then built the code using Java 1.7 (First with 1.8, which again failed on Ubuntu14.10). Finally it was successful :) Thank you very much for help :) Regards, Abhinav. On Wed, Feb 11, 2015 at 11:45 PM, Tyler Palsulich tpalsul

Re: svn commit: r1658847 - /tika/trunk/tika-server/pom.xml

2015-02-11 Thread Tyler Palsulich
Hi All, Responses inline. On Wed, Feb 11, 2015 at 7:35 AM, Allison, Timothy B. talli...@mitre.org wrote: I'm working behind a proxy and getting a new proxy error (proxy unacknowledged) with r1658847 on tika-server package. That seems odd... Would adding another pluginRepository cause that?

Re: Build failure of OSGi bundle, Gsoc2015.

2015-02-11 Thread Tyler Palsulich
to work on Tika-1456. Could you please guide me on how to participate, fix patches and write a proposal ? [0] : http://pastebin.com/MZPq2dji On Tue, Feb 10, 2015 at 11:36 PM, Tyler Palsulich tpalsul...@gmail.com wrote: HI Abhinav, Whoops, yes, I forgot to add the link to the previous

[jira] [Commented] (TIKA-1548) System property added while catching exception on parsing PDF encrypted doc

2015-02-11 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14316787#comment-14316787 ] Tyler Palsulich commented on TIKA-1548: --- I'm not seeing any mentions in Tika, either

Re: Error installation (Build Failure at OSGi Bundle)

2015-02-11 Thread Tyler Palsulich
Wrong thread/list? Cheers, Tyler On Wed, Feb 11, 2015 at 1:14 PM, Abhinav Gupta abhinavgupta2...@gmail.com wrote: Hi Myrna, Thanks for the help :) As Bryan had suggested I'm able to execute ant junit-system-mini and ant junit-all. I am new to the open source and somehow I had managed to

[jira] [Commented] (TIKA-1269) Self-hosted documentation for the JAX-RS Server

2015-02-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14315538#comment-14315538 ] Tyler Palsulich commented on TIKA-1269: --- Thanks. But, I think that requires login

[jira] [Created] (TIKA-1545) Create tika-server Frontend

2015-02-10 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1545: - Summary: Create tika-server Frontend Key: TIKA-1545 URL: https://issues.apache.org/jira/browse/TIKA-1545 Project: Tika Issue Type: Improvement

[jira] [Commented] (TIKA-1545) Create tika-server Frontend

2015-02-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14314652#comment-14314652 ] Tyler Palsulich commented on TIKA-1545: --- It looks like TIKA-1269 is semi-done

[jira] [Comment Edited] (TIKA-1545) Create tika-server Frontend

2015-02-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14314652#comment-14314652 ] Tyler Palsulich edited comment on TIKA-1545 at 2/10/15 6:54 PM

[jira] [Updated] (TIKA-1545) Create tika-server Frontend

2015-02-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1545: -- Attachment: TIKA-1545.palsulich.patch Patch which adds a form to the top of the the root server

[jira] [Commented] (TIKA-1269) Self-hosted documentation for the JAX-RS Server

2015-02-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14315305#comment-14315305 ] Tyler Palsulich commented on TIKA-1269: --- Committed patch from [~lewismc] in r1658847

[jira] [Comment Edited] (TIKA-1331) Find/configure a vm and gather initial corpus

2015-02-04 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306355#comment-14306355 ] Tyler Palsulich edited comment on TIKA-1331 at 2/5/15 12:59 AM

[jira] [Commented] (TIKA-1331) Find/configure a vm and gather initial corpus

2015-02-04 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14306355#comment-14306355 ] Tyler Palsulich commented on TIKA-1331: --- That hierarchy looks good to me. So, 2TB

[jira] [Commented] (TIKA-1540) New Tika plugin for image based feature extraction using computer vision techniques

2015-02-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303701#comment-14303701 ] Tyler Palsulich commented on TIKA-1540: --- Will this feature extraction happen

[jira] [Commented] (TIKA-1331) Find/configure a vm and gather initial corpus

2015-02-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14302063#comment-14302063 ] Tyler Palsulich commented on TIKA-1331: --- I formatted and uploaded the notes from

[jira] [Commented] (TIKA-1537) Installation on OSX 10.10.2 generates OutOfMemory Error during parser tests

2015-02-01 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14300954#comment-14300954 ] Tyler Palsulich commented on TIKA-1537: --- Have you tried setting the MAVEN_OPTS

[jira] [Commented] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-01-31 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1422#comment-1422 ] Tyler Palsulich commented on TIKA-1536: --- Should we hold off on this until 2.0

Re: multiple detect call - different results (tika 1.7)

2015-01-29 Thread Tyler Palsulich
Thanks Konstantin and Gabriele! Please feel free to email any other questions or open an issue on the Tika JIRA. Have a good day! Tyler On Jan 29, 2015 11:43 AM, Gabriele Guidi gabriele.gu...@eng.it wrote: Ok, thank you for your support Best regards 2015-01-29 15:14 GMT+01:00 Konstantin

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-29 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14297614#comment-14297614 ] Tyler Palsulich commented on TIKA-1518: --- 2. Sent a message. Andrew Bayer responded

Re: TIKA-1423 Build a parser to extract data from GRIB formats not good with Java 6

2015-01-29 Thread Tyler Palsulich
+1 Tyler On Jan 29, 2015 9:52 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: +1 move to 1.7 Sent from my iPhone On Jan 29, 2015, at 5:04 PM, Allison, Timothy B. talli...@mitre.org wrote: +1 to dropping 1.6...let's move to 1.8 and beyond! :) -Original

[jira] [Commented] (TIKA-1517) MIME type selection with probability

2015-01-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296103#comment-14296103 ] Tyler Palsulich commented on TIKA-1517: --- Hi [~Lukeliush]. Thanks for raising

[jira] [Comment Edited] (TIKA-1517) MIME type selection with probability

2015-01-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296103#comment-14296103 ] Tyler Palsulich edited comment on TIKA-1517 at 1/29/15 12:04 AM

[jira] [Comment Edited] (TIKA-1521) Handle password protected 7zip files

2015-01-27 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14294102#comment-14294102 ] Tyler Palsulich edited comment on TIKA-1521 at 1/27/15 8:09 PM

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-27 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14294100#comment-14294100 ] Tyler Palsulich commented on TIKA-1526: --- Do we know if the update fixed the issue? We

[jira] [Commented] (TIKA-1521) Handle password protected 7zip files

2015-01-27 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14294102#comment-14294102 ] Tyler Palsulich commented on TIKA-1521: --- Does anyone else have the test passing

[jira] [Commented] (TIKA-1529) Turn forbidden-apis back on

2015-01-23 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289405#comment-14289405 ] Tyler Palsulich commented on TIKA-1529: --- +1 to {{RuntimeException}}. Turn forbidden

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289419#comment-14289419 ] Tyler Palsulich commented on TIKA-1526: --- This is exactly how I saw the bug. I

[jira] [Commented] (TIKA-1529) Turn forbidden-apis back on

2015-01-23 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289402#comment-14289402 ] Tyler Palsulich commented on TIKA-1529: --- Yes, Locale.ROOT is OK. Turn forbidden

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287929#comment-14287929 ] Tyler Palsulich commented on TIKA-1526: --- Yup! I can test this by EOD today, if no one

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14288365#comment-14288365 ] Tyler Palsulich commented on TIKA-1526: --- I saw the error once, but haven't been able

[jira] [Reopened] (TIKA-1521) Handle password protected 7zip files

2015-01-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich reopened TIKA-1521: --- I'm getting a test failure with this -- same as https://builds.apache.org/job/tika-trunk-jdk1.6

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287766#comment-14287766 ] Tyler Palsulich commented on TIKA-1526: --- I think we should catch the posix_spawn

[jira] [Commented] (TIKA-1519) Don't allow whatever is in http-equiv Content-Type to overwrite actual Content-Type in HtmlParser

2015-01-20 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284945#comment-14284945 ] Tyler Palsulich commented on TIKA-1519: --- Those seem reasonable to me. And, I agree

Re: [ANNOUNCE] Apache Tika 1.7 Released

2015-01-16 Thread Tyler Palsulich
.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Tyler Palsulich [mailto:tpalsul...@apache.org] Sent: Thursday, January 15, 2015 7:43 PM To: dev@tika.apache.org; u...@tika.apache.org Subject: [ANNOUNCE] Apache Tika 1.7

[RESULT] [VOTE] Apache Tika 1.7 Release Candidate #3

2015-01-15 Thread Tyler Palsulich
Hi All, The VOTE for releasing Apache Tika 1.7 RC#3 finished with the following tally: +1: Chris Mattmann David Meikle Hong-Thai Nguyen Nick Burch Tim Allison Tyler Palsulich +0: [None] -1: [None] Thank you everyone for voting! I will move forward with the release. Have a good day, Tyler

Re: [VOTE] Apache Tika 1.7 Release

2015-01-15 Thread Tyler Palsulich
Found it: https://github.com/chrismattmann/apachestuff/blob/master/extract-tika-contribs :) Thanks! Tyler On Thu, Jan 15, 2015 at 8:57 AM, Tyler Palsulich tpalsul...@gmail.com wrote: Thanks, Chris! That sounds useful. Let me know when you get a chance to upload it somewhere. Tyler On Wed

[ANNOUNCE] Apache Tika 1.7 Released

2015-01-15 Thread Tyler Palsulich
not be available on all mirrors. When downloading from a mirror site, please remember to verify the downloads using signatures found on the Apache site: https://people.apache.org/keys/group/tika.asc For more information on Apache Tika, visit the project home page: http://tika.apache.org/ -- Tyler

Re: [VOTE] Apache Tika 1.7 Release

2015-01-15 Thread Tyler Palsulich
@tika.apache.org Subject: Re: [VOTE] Apache Tika 1.7 Release On Wed, 14 Jan 2015, Tyler Palsulich wrote: Nick, thanks for building the site! We still need to rebuild the index, right? You'll need to build the 1.7 index page (based on the changelog), then update the download page + homepage + menu

Re: [VOTE] Apache Tika 1.7 Release

2015-01-14 Thread Tyler Palsulich
Beryozkin (Release Management) sberyoz...@gmail.com not changed gpg: key 97EDDE66: tallison (apache_distro_keys) talli...@apache.org not changed gpg: key D4F10117: public key Tyler Palsulich tpalsul...@apache.org imported gpg: key 48BAEBF6: Lewis John McGibbney (CODE SIGNING KEY) lewi

Re: [VOTE] Apache Tika 1.7 Release

2015-01-13 Thread Tyler Palsulich
Hi Folks, Let's mark this RC#2 as failed and shift the vote to the updated RC#3 ( http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract metadata fixes and David's test fix. Thanks, Tyler On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer pe...@mapledesign.co.uk wrote: +1. Worked

Re: TestMultiPart tests failing

2015-01-12 Thread Tyler Palsulich
Hi Chris, I'm not getting any test failures from trunk on my Mac. So, I'm also curious what revision you're on. uname -a: Darwin Tylers-MacBook-Pro.local 14.0.0 Darwin Kernel Version 14.0.0: Fri Sep 19 00:26:44 PDT 2014; root:xnu-2782.1.97~2/RELEASE_X86_64 x86_64 tesseract --version: tesseract

[VOTE] Apache Tika 1.7 Release

2015-01-09 Thread Tyler Palsulich
Hi All, A candidate for the Tika 1.7 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.7-rc3/ The SHA1 checksum of the archive is

Re: [IMPORTANT] Please close off Tika Staging repos on repository.apache.org

2015-01-09 Thread Tyler Palsulich
Done. On Fri, Jan 9, 2015 at 6:53 AM, Tyler Palsulich tpalsul...@gmail.com wrote: Hi Lewis, I created them. There should be two right now. I'll drop them this morning, since we're going to cut a new RC with image metadata fixes + Dave's Windows test issue fix. Thanks for keeping an eye

Re: [IMPORTANT] Please close off Tika Staging repos on repository.apache.org

2015-01-09 Thread Tyler Palsulich
Hi Lewis, I created them. There should be two right now. I'll drop them this morning, since we're going to cut a new RC with image metadata fixes + Dave's Windows test issue fix. Thanks for keeping an eye out! Tyler On Jan 9, 2015 12:41 AM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote:

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2015-01-08 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14269800#comment-14269800 ] Tyler Palsulich commented on TIKA-1445: --- Thanks guys! [~tallison], let me know once

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2015-01-07 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267879#comment-14267879 ] Tyler Palsulich commented on TIKA-1445: --- All tests pass with and without Tesseract

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2015-01-07 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14268006#comment-14268006 ] Tyler Palsulich commented on TIKA-1445: --- Done. I made some small changes and split

[VOTE] Apache Tika 1.7 Release

2015-01-05 Thread Tyler Palsulich
Hi All, A candidate for the Tika 1.7 release is available at: https://dist.apache.org/repos/dist/dev/tika/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/tika/tags/1.7-rc2/ The SHA1 checksum of the archive is

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Monday, December 22, 2014 1:58 PM To: dev@tika.apache.org Subject: Re: 1.7 release? Hi All, Nick added the temporary fix for TIKA-1445 and made the POI updates for TIKA-1469 (thanks!). And, I'll volunteer to be the Release

Re: 1.7 release? | potential blocker?

2015-01-05 Thread Tyler Palsulich
trunk tonight (with null check, of course :)). Should I also patch the rc1 branch or will you re-branch from trunk? -Original Message- From: Tyler Palsulich [mailto:tpalsul...@gmail.com] Sent: Monday, January 05, 2015 11:38 AM To: dev@tika.apache.org Subject: Re: 1.7 release

[jira] [Updated] (TIKA-1505) chmparser breaks down when extracting from file of CHM format v3

2015-01-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1505: -- Fix Version/s: (was: 1.7) 1.8 chmparser breaks down when extracting from

Re: Multiple parsers for the same MIME type

2015-01-02 Thread Tyler Palsulich
Hi, Both of Jukka's options look good to me. Another option is to modify the existing Parser -- extract the extra information when possible, stick with current behavior if not. We've run into this problem with images by trying to run OCR and extract metadata at the same time. Please see

Re: svn commit: r1648939 - /tika/trunk/KEYS

2015-01-01 Thread Tyler Palsulich
/ ++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Tyler Palsulich

[jira] [Closed] (TIKA-1492) tika-app-1.6.jar does not extract any text from CHM file

2014-12-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich closed TIKA-1492. - Resolution: Fixed Assignee: Tyler Palsulich tika-app-1.6.jar does not extract any text from

[jira] [Commented] (TIKA-1465) Implement extraction of non-global variables from netCDF3 and netCDF4

2014-12-24 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14258124#comment-14258124 ] Tyler Palsulich commented on TIKA-1465: --- I made some updates to the parser

[jira] [Resolved] (TIKA-1500) FeedParser extracts XML markup with BodyContentHandler

2014-12-23 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1500. --- Resolution: Fixed Fix Version/s: (was: 1.8) 1.7 Assignee

Re: 1.7 release?

2014-12-22 Thread Tyler Palsulich
++ -Original Message- From: Tyler Palsulich tpalsul...@gmail.com Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Thursday, December 18, 2014 at 9:15 PM To: dev@tika.apache.org dev@tika.apache.org Subject: Re: 1.7 release? I'm OK with trying

[jira] [Commented] (TIKA-1494) JAXRS server: allow passing PDF password in the request

2014-12-19 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14253388#comment-14253388 ] Tyler Palsulich commented on TIKA-1494: --- I expanded the testing in r1646707 to make

[jira] [Comment Edited] (TIKA-1494) JAXRS server: allow passing PDF password in the request

2014-12-19 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14253388#comment-14253388 ] Tyler Palsulich edited comment on TIKA-1494 at 12/19/14 1:46 PM

[jira] [Commented] (TIKA-1489) PDF Text extraction without permission

2014-12-19 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14253400#comment-14253400 ] Tyler Palsulich commented on TIKA-1489: --- bq. People ... will miss content they get

[jira] [Commented] (TIKA-1489) PDF Text extraction without permission

2014-12-19 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14253423#comment-14253423 ] Tyler Palsulich commented on TIKA-1489: --- Ah! I misunderstood -- I was thinking

[jira] [Updated] (TIKA-1495) Parser for BPG (Better Portable Graphics) format

2014-12-18 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1495: -- Affects Version/s: (was: 1..7) 1.7 Parser for BPG (Better Portable

Re: 1.7 release?

2014-12-18 Thread Tyler Palsulich
, Tim From: Oleg Tikhonov [olegtikho...@gmail.com] Sent: Friday, October 24, 2014 2:24 PM To: dev@tika.apache.org Subject: Re: 1.7 release? Hi Tyler, don't mention. Cheers, Oleg On Oct 24, 2014 8:02 PM, Tyler Palsulich tpalsul

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-12-18 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252956#comment-14252956 ] Tyler Palsulich commented on TIKA-1445: --- +1, Nick. That sounds good to me. I'll

Re: 1.7 release?

2014-12-18 Thread Tyler Palsulich
of Southern California, Los Angeles, CA 90089 USA ++ -Original Message- From: Tyler Palsulich tpalsul...@gmail.com Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Thursday, December 18, 2014 at 12:54 PM

[jira] [Created] (TIKA-1496) Upgrade slf4j-log4j12 to version 1.7.7

2014-12-13 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1496: - Summary: Upgrade slf4j-log4j12 to version 1.7.7 Key: TIKA-1496 URL: https://issues.apache.org/jira/browse/TIKA-1496 Project: Tika Issue Type: Improvement

[jira] [Resolved] (TIKA-1496) Upgrade slf4j-log4j12 to version 1.7.7

2014-12-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1496. --- Resolution: Done Done in r1645376. Upgrade slf4j-log4j12 to version 1.7.7

[jira] [Commented] (TIKA-1384) Use tika-parent dependency management for common dependencies

2014-12-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14245778#comment-14245778 ] Tyler Palsulich commented on TIKA-1384: --- Moved slf4j-log4j12 in r1645376 (related

[jira] [Resolved] (TIKA-1384) Use tika-parent dependency management for common dependencies

2014-12-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1384. --- Resolution: Done Fix Version/s: (was: 1.8) 1.7 Final update

Re: FW: FYI Broken link / wrong file extension to download your jar.

2014-12-11 Thread Tyler Palsulich
Hi Billson, Thanks for pointing this out. I just updated the site ( http://tika.apache.org/download.html) to be more clear that the links go to lists of available mirrors. Let us know if you have any more issues or questions! Tyler On Thu, Dec 11, 2014 at 10:25 AM, Mattmann, Chris A (3980)

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14241568#comment-14241568 ] Tyler Palsulich commented on TIKA-1423: --- I also tried to get this working again

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-10 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14241620#comment-14241620 ] Tyler Palsulich commented on TIKA-1423: --- Try adding {{thredds.catalog;resolution

[jira] [Resolved] (TIKA-1218) Unable to parse a mp3 file on 1.5 getting a exception

2014-12-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1218. --- Resolution: Fixed Fix Version/s: 1.7 Fixed in r1643411. Unable to parse a mp3 file

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236079#comment-14236079 ] Tyler Palsulich commented on TIKA-1423: --- Thanks [~lewismc]! Please see https

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236112#comment-14236112 ] Tyler Palsulich commented on TIKA-1423: --- I may be wrong, but I think we need the GRIB

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236174#comment-14236174 ] Tyler Palsulich commented on TIKA-1423: --- Thanks [~lewismc]. If you look at http

[jira] [Comment Edited] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236174#comment-14236174 ] Tyler Palsulich edited comment on TIKA-1423 at 12/5/14 9:50 PM

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-05 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14236233#comment-14236233 ] Tyler Palsulich commented on TIKA-1423: --- It's a different error -- different module

Re: Confusion

2014-12-04 Thread Tyler Palsulich
subscribe to the Tika list by sending a blank email to dev-subscr...@tika.apache.org and following the instructions from there. Some replies below: We should clear up the instructions on the website to say this explicitly, rather than give a link the the general Apache page. No reason to not

Re: Kill Buildbot Builds

2014-12-04 Thread Tyler Palsulich
I'm also +1 to stop Buildbot builds. Are there any notable configuration differences between Buildbot and Jenkins? Tyler On Wed, Dec 3, 2014 at 2:24 AM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Thanks Lewis - I think the Jenkins builds are most used, so I would be +1 to

[jira] [Commented] (TIKA-1436) improvement to PDFParser

2014-12-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233799#comment-14233799 ] Tyler Palsulich commented on TIKA-1436: --- Thank you for the patch! I'm sorry

[jira] [Commented] (TIKA-1218) Unable to parse a mp3 file on 1.5 getting a exception

2014-12-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233807#comment-14233807 ] Tyler Palsulich commented on TIKA-1218: --- A simple fix is to not let the size

[jira] [Assigned] (TIKA-1218) Unable to parse a mp3 file on 1.5 getting a exception

2014-12-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich reassigned TIKA-1218: - Assignee: Tyler Palsulich Unable to parse a mp3 file on 1.5 getting a exception

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2014-12-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233892#comment-14233892 ] Tyler Palsulich commented on TIKA-1423: --- What is the latest review board

[jira] [Resolved] (TIKA-1167) Embedded object not extracted

2014-12-03 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1167. --- Resolution: Fixed Fix Version/s: (was: 1.8) 1.7 Assignee

Re: Move definitively from SVN to Git ?

2014-11-20 Thread Tyler Palsulich
++ -Original Message- From: Ken Krugler kkrugler_li...@transpac.com Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Thursday, November 20, 2014 at 5:25 AM To: dev@tika.apache.org dev@tika.apache.org Subject: RE: Move definitively from SVN to Git ? From: Tyler

Re: svn commit: r1640535 - /tika/trunk/tika-server/src/main/java/org/apache/tika/server/TikaResource.java

2014-11-19 Thread Tyler Palsulich
On Wed, Nov 19, 2014 at 7:44 AM, dmei...@apache.org wrote: Author: dmeikle Date: Wed Nov 19 12:44:41 2014 New Revision: 1640535 URL: http://svn.apache.org/r1640535 Log: TIKA-1477: Added new custom header to Tika resource override Tesseract OCR language Modified:

Re: svn commit: r1640535 - /tika/trunk/tika-server/src/main/java/org/apache/tika/server/TikaResource. java

2014-11-19 Thread Tyler Palsulich
Found it! http://markmail.org/message/42nc64tdyhvzaril Looks like javax, java, then other. I'll update the site today. Tyler On Wed, Nov 19, 2014 at 10:54 AM, Nick Burch apa...@gagravarr.org wrote: On Wed, 19 Nov 2014, Tyler Palsulich wrote: It looks like imports are being reordered here. I

[jira] [Commented] (TIKA-1483) Create a general raw string parser

2014-11-19 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14218068#comment-14218068 ] Tyler Palsulich commented on TIKA-1483: --- Definitely agree. This would be really nice

[jira] [Commented] (TIKA-1473) Apache Tika is not working for .docx documents

2014-11-19 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14218120#comment-14218120 ] Tyler Palsulich commented on TIKA-1473: --- Hi, Have you tried increasing the memory

[jira] [Commented] (TIKA-1302) Let's run Tika against a large batch of docs nightly

2014-11-19 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14218825#comment-14218825 ] Tyler Palsulich commented on TIKA-1302: --- I just got access to an HPC cluster at NYU

Re: Move definitively from SVN to Git ?

2014-11-19 Thread Tyler Palsulich
On Mon, Nov 17, 2014 at 6:23 AM, Nick Burch apa...@gagravarr.org wrote: Given that non-committers can already work with Git, could you explain what committers would gain from the move to Git which would outweigh the effort that SVN-using committers would have to expend with the move?

Re: error Unsupported Media Type : while implementing ContentStreamUpdateRequestExample from the link http://wiki.apache.org/solr/ContentStreamUpdateRequestExample

2014-11-13 Thread Tyler Palsulich
Hi, Thanks for raising this issue! Can you give a complete stacktrace with the error? Are you able to run Tika by itself on the file? What is the returned MediaType? Thanks, Tyler On Thu, Nov 13, 2014 at 5:38 AM, raju lovaraju4j...@gmail.com wrote: Hi Team, I am getting the error

Re: Tika Api consumes given stream

2014-11-13 Thread Tyler Palsulich
Shot in the dark here, as I haven't tried this. But, have you tried using mark/reset on the TikaInputStream? That should forward the requests on to the underlying InputStream and hopefully work. Tyler On Wed, Nov 12, 2014 at 1:22 PM, Runomu celikonur@gmail.com wrote: I use Apache Tika

[jira] [Resolved] (TIKA-1470) Error installing Tika

2014-11-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1470. --- Resolution: Fixed Fix Version/s: 1.7 Assignee: Tyler Palsulich Looks like

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14211277#comment-14211277 ] Tyler Palsulich commented on TIKA-1445: --- [~talli...@apache.org], what was the system

[jira] [Created] (TIKA-1475) Reformat pom.xml files

2014-11-13 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1475: - Summary: Reformat pom.xml files Key: TIKA-1475 URL: https://issues.apache.org/jira/browse/TIKA-1475 Project: Tika Issue Type: Task Reporter

[jira] [Resolved] (TIKA-1475) Reformat pom.xml files

2014-11-13 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich resolved TIKA-1475. --- Resolution: Done Fix Version/s: 1.7 Done in r1639521. Reformat pom.xml files

Re: Review Request 27414: GRIB Parser for TIKA

2014-11-02 Thread Tyler Palsulich
: --- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/27414/ --- (Updated Nov. 2, 2014, 3:17 p.m.) Review request for tika, Lewis McGibbney, Chris Mattmann, and Tyler Palsulich

Re: Review Request 27414: GRIB Parser for TIKA

2014-11-02 Thread Tyler Palsulich
://reviews.apache.org//r/27414/#fcomment47 Optional style comment: Can do a foreach loop. - Tyler Palsulich On Nov. 2, 2014, 3:17 p.m., Vineet Ghatge Hemantkumar wrote: --- This is an automatically generated e-mail. To reply, visit: https

<    1   2   3   4   5   6   7   8   >