[jira] [Updated] (TIKA-241) Rar archive support

2015-01-13 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luis Filipe Nassif updated TIKA-241: Attachment: TIKA-241.patch Sorry for the long delay. Patch with 4 spaces indents and an unit

[jira] [Comment Edited] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275534#comment-14275534 ] Tim Allison edited comment on TIKA-1511 at 1/13/15 5:03 PM:

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275553#comment-14275553 ] Tim Allison commented on TIKA-1513: --- Any interest in encouraging iryndin to push to

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275636#comment-14275636 ] Luis Filipe Nassif commented on TIKA-1513: -- I can if the community thinks that

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275542#comment-14275542 ] Luis Filipe Nassif commented on TIKA-1513: -- I have found

[jira] [Comment Edited] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275534#comment-14275534 ] Tim Allison edited comment on TIKA-1511 at 1/13/15 5:04 PM:

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275605#comment-14275605 ] Luis Filipe Nassif commented on TIKA-1511: -- I think the jdbc based AbstractClass

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275679#comment-14275679 ] Konstantin Gribov commented on TIKA-1513: - [~talli...@mitre.org], I think it's good

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275703#comment-14275703 ] Konstantin Gribov commented on TIKA-1511: - [~lfcnassif], +1. IMHO, ManifoldCF

Re: [VOTE] Apache Tika 1.7 Release

2015-01-13 Thread Tyler Palsulich
Hi Folks, Let's mark this RC#2 as failed and shift the vote to the updated RC#3 ( http://markmail.org/message/m5gpgmr7hedgpjdj), which has Tesseract metadata fixes and David's test fix. Thanks, Tyler On Thu, Jan 8, 2015 at 6:25 AM, Peter Bowyer pe...@mapledesign.co.uk wrote: +1. Worked

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275217#comment-14275217 ] Tim Allison commented on TIKA-1511: --- Thank you, [~grossws]! Two questions: 1) On how

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275286#comment-14275286 ] Konstantin Gribov commented on TIKA-1511: - Usual way is to exclude maven dependency

[jira] [Comment Edited] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275217#comment-14275217 ] Tim Allison edited comment on TIKA-1511 at 1/13/15 1:59 PM:

RE: ExternalParser isn't called

2015-01-13 Thread Allison, Timothy B.
Chris, Should we interpret this as -1 on rc3 from you? Or should we go forth with testing and voting on rc3? Thank you! Best, Tim -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Monday, January

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275182#comment-14275182 ] Konstantin Gribov commented on TIKA-1511: - JNI can potentially give some issues in

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275112#comment-14275112 ] Tim Allison commented on TIKA-1511: --- Thank you for looking into that. I like the

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275361#comment-14275361 ] Tim Allison commented on TIKA-1511: --- Completely agree with this...that was the plan, esp

[jira] [Created] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1513: - Summary: Add mime detection and parsing for dbf files Key: TIKA-1513 URL: https://issues.apache.org/jira/browse/TIKA-1513 Project: Tika Issue Type: Improvement

[jira] [Created] (TIKA-1514) http-equiv content-type extraction should pick first parseable content value

2015-01-13 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1514: - Summary: http-equiv content-type extraction should pick first parseable content value Key: TIKA-1514 URL: https://issues.apache.org/jira/browse/TIKA-1514 Project: Tika

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275467#comment-14275467 ] Konstantin Gribov commented on TIKA-1513: - Is this lib alive? Last commits were in

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275472#comment-14275472 ] Tim Allison commented on TIKA-1513: --- I share your concern. There are ~2600 .dbase3 files

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275485#comment-14275485 ] Luis Filipe Nassif commented on TIKA-1511: -- Another library option is

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275508#comment-14275508 ] Nick Burch commented on TIKA-1511: -- If we're going to do a general jdbc option, maybe we'd

[jira] [Created] (TIKA-1515) Old XLS 3 parsing is not working

2015-01-13 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1515: - Summary: Old XLS 3 parsing is not working Key: TIKA-1515 URL: https://issues.apache.org/jira/browse/TIKA-1515 Project: Tika Issue Type: Bug Reporter:

[jira] [Updated] (TIKA-1515) Old XLS 3 parsing is not working on some documents

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1515: -- Summary: Old XLS 3 parsing is not working on some documents (was: Old XLS 3 parsing is not working)

[jira] [Updated] (TIKA-1515) Old XLS 3 parsing is not working on some documents

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1515: -- Attachment: 081247.unk.xls This file comes from govdocs1 and demonstrates the code page issue. Old XLS

[jira] [Updated] (TIKA-1515) Old XLS 3 parsing is not working on some documents

2015-01-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1515: -- Description: Thanks to [~gagravarr], we now have mime type id for excel.sheet.4 and excel.sheet.3, and

[jira] [Commented] (TIKA-1509) Create configurable strategies for composite parsers

2015-01-13 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276105#comment-14276105 ] Nick Burch commented on TIKA-1509: -- First up is probably some sort of composite /

[jira] [Commented] (TIKA-1515) Old XLS 3 parsing is not working on some documents

2015-01-13 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276053#comment-14276053 ] Nick Burch commented on TIKA-1515: -- Hopefully fixed in Apache POI in r1651517 - it seems