[jira] [Commented] (TIKA-1533) PDF parse failing to capture right order of text (2 columns)

2015-01-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295159#comment-14295159 ] Tim Allison commented on TIKA-1533: --- In the first document, printed page 303/pdf page 152

[jira] [Created] (TIKA-1533) PDF parse failing to capture right order of text (2 columns)

2015-01-28 Thread Tamara (JIRA)
Tamara created TIKA-1533: Summary: PDF parse failing to capture right order of text (2 columns) Key: TIKA-1533 URL: https://issues.apache.org/jira/browse/TIKA-1533 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-1517) MIME type selection with probability

2015-01-28 Thread Luke sh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295928#comment-14295928 ] Luke sh commented on TIKA-1517: --- the probability selection will inherit the class MIMETypes,

[jira] [Commented] (TIKA-1535) Inheritance modification for the class MIMETypes

2015-01-28 Thread Luke sh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295922#comment-14295922 ] Luke sh commented on TIKA-1535: --- TIKA-1517, the mime type selection mechanism with

[jira] [Commented] (TIKA-1521) Handle password protected 7zip files

2015-01-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295791#comment-14295791 ] Hudson commented on TIKA-1521: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #442 (See

[jira] [Commented] (TIKA-1534) Upgrade to Commons Compress 1.9

2015-01-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295789#comment-14295789 ] Hudson commented on TIKA-1534: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #442 (See

[jira] [Commented] (TIKA-1517) MIME type selection with probability

2015-01-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296103#comment-14296103 ] Tyler Palsulich commented on TIKA-1517: --- Hi [~Lukeliush]. Thanks for raising this

[jira] [Comment Edited] (TIKA-1517) MIME type selection with probability

2015-01-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296103#comment-14296103 ] Tyler Palsulich edited comment on TIKA-1517 at 1/29/15 12:04 AM:

[jira] [Commented] (TIKA-1423) Build a parser to extract data from GRIB formats

2015-01-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296129#comment-14296129 ] Lewis John McGibbney commented on TIKA-1423: I am working on this and think I

[jira] [Comment Edited] (TIKA-1517) MIME type selection with probability

2015-01-28 Thread Luke sh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295928#comment-14295928 ] Luke sh edited comment on TIKA-1517 at 1/28/15 11:06 PM: - the

RE: [jira] [Commented] (TIKA-1535) Inheritance modification for the class MIMETypes

2015-01-28 Thread Luke
Hi Professor and all, Bayesian or machine learning Detector is different from Bayesian Selection mechanism reported in TIKA-1517. It would make sense if we implemented a machine learning algorithm in separate Detector class, I have not gone too far with this design thought, as I am still on

Re: [jira] [Commented] (TIKA-1535) Inheritance modification for the class MIMETypes

2015-01-28 Thread Mattmann, Chris A (3980)
Hi Luke, -Original Message- From: Luke hanson311...@gmail.com Date: Wednesday, January 28, 2015 at 7:15 PM To: Chris Mattmann mattm...@usc.edu, Chris Mattmann chris.a.mattm...@jpl.nasa.gov, dev@tika.apache.org dev@tika.apache.org Cc: NSF Polar CyberInfrastructure DR Students

[jira] [Comment Edited] (TIKA-1518) Docker with Tika Server

2015-01-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296439#comment-14296439 ] Chris A. Mattmann edited comment on TIKA-1518 at 1/29/15 6:15 AM:

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296439#comment-14296439 ] Chris A. Mattmann commented on TIKA-1518: - Thanks Tyler. Can you raise #2 on

[jira] [Comment Edited] (TIKA-1518) Docker with Tika Server

2015-01-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296439#comment-14296439 ] Chris A. Mattmann edited comment on TIKA-1518 at 1/29/15 6:15 AM:

[jira] [Comment Edited] (TIKA-1423) Build a parser to extract data from GRIB formats

2015-01-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14296541#comment-14296541 ] Lewis John McGibbney edited comment on TIKA-1423 at 1/29/15 7:54 AM:

Re: multiple detect call - different results (tika 1.7)

2015-01-28 Thread Mattmann, Chris A (3980)
Dear Gabriele, Thanks for your question. It should be sent to dev@tika.apache.org (moving dev-ow...@tika.apache.org to BCC). I’ll take a look tomorrow if someone else hasn’t answered yet. Cheers, Chris ++ Chris Mattmann, Ph.D.

RE: [jira] [Commented] (TIKA-1535) Inheritance modification for the class MIMETypes

2015-01-28 Thread Luke
Thanks professor for the prompt and kind response, will keep you updated on the progress and findings. -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Wednesday, January 28, 2015 8:17 PM To: Luke; 'Christian Alan Mattmann';

[jira] [Updated] (TIKA-1423) Build a parser to extract data from GRIB formats

2015-01-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-1423: --- Attachment: TIKA-1423v2.patch Patch for trunk which passes all tests including issues

[jira] [Created] (TIKA-1534) Upgrade to Commons Compress 1.9

2015-01-28 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1534: - Summary: Upgrade to Commons Compress 1.9 Key: TIKA-1534 URL: https://issues.apache.org/jira/browse/TIKA-1534 Project: Tika Issue Type: Improvement

[jira] [Commented] (TIKA-1534) Upgrade to Commons Compress 1.9

2015-01-28 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295749#comment-14295749 ] Hudson commented on TIKA-1534: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #457 (See

[jira] [Reopened] (TIKA-1329) Add RecursiveParserWrapper aka Jukka's (and Nick's) RecursiveMetadataParser

2015-01-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-1329: --- Wait, do I need to update the webpage, too? Or is that done automatically from tika-examples? Add