[jira] [Commented] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents

2012-07-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13409095#comment-13409095 ] Dave Meikle commented on TIKA-906: -- Support for AutoPageNumbers added in r1358856.

[jira] [Updated] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents

2012-07-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-906: - Fix Version/s: (was: 1.3) 1.2 Headers, footers, and footnotes not extracted

[jira] [Commented] (TIKA-960) Duplicate letters in text extracted from PDF files

2012-07-23 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13420806#comment-13420806 ] Dave Meikle commented on TIKA-960: -- On the move so may be wrong but this sounds like

[jira] [Commented] (TIKA-906) Headers, footers, and footnotes not extracted from Pages documents

2012-07-30 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13425243#comment-13425243 ] Dave Meikle commented on TIKA-906: -- Sorry - I missed the header the first time. Added it

[jira] [Commented] (TIKA-918) iWork Charts not being parsed in all products (Pages, Numbers, Keynote)

2012-10-20 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13480684#comment-13480684 ] Dave Meikle commented on TIKA-918: -- Erik - feel free to fire it to me and I can slim the

[jira] [Commented] (TIKA-1016) KEYS file not linked from download page

2012-11-05 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13490953#comment-13490953 ] Dave Meikle commented on TIKA-1016: --- Good spot Sebb. Have updated the website with the

[jira] [Closed] (TIKA-1016) KEYS file not linked from download page

2012-11-05 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle closed TIKA-1016. - KEYS file not linked from download page --- Key:

[jira] [Resolved] (TIKA-1049) Upgrade to PDFBox 1.7.1

2012-12-27 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1049. --- Resolution: Fixed Assignee: Dave Meikle Update applied in r1426251 Upgrade to

[jira] [Updated] (TIKA-963) Backwards Compatibility for Metadata.DATE is Incorrect

2013-01-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-963: - Fix Version/s: 1.3 Backwards Compatibility for Metadata.DATE is Incorrect

[jira] [Updated] (TIKA-962) Backwards Compatibility for Metadata.LAST_AUTHOR is Broken

2013-01-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-962: - Fix Version/s: 1.3 Backwards Compatibility for Metadata.LAST_AUTHOR is Broken

[jira] [Comment Edited] (TIKA-1013) Add ability to check if a mime-type is already registered

2013-01-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13547371#comment-13547371 ] Dave Meikle edited comment on TIKA-1013 at 1/8/13 10:58 PM:

[jira] [Resolved] (TIKA-1013) Add ability to check if a mime-type is already registered

2013-01-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1013. --- Resolution: Fixed Add ability to check if a mime-type is already registered

[jira] [Assigned] (TIKA-1098) not able to parse pdfs/docs/ppts using 1.1 tika parser‏‏

2013-04-07 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle reassigned TIKA-1098: - Assignee: Dave Meikle not able to parse pdfs/docs/ppts using 1.1 tika parser‏‏

[jira] [Updated] (TIKA-1098) not able to parse pdfs/docs/ppts using 1.1 tika parser‏‏

2013-04-07 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1098: -- Assignee: (was: Dave Meikle) not able to parse pdfs/docs/ppts using 1.1 tika parser‏‏

[jira] [Created] (TIKA-1104) Upgrade to PDFBox 1.8.1

2013-04-11 Thread Dave Meikle (JIRA)
Dave Meikle created TIKA-1104: - Summary: Upgrade to PDFBox 1.8.1 Key: TIKA-1104 URL: https://issues.apache.org/jira/browse/TIKA-1104 Project: Tika Issue Type: Bug Components: parser

[jira] [Updated] (TIKA-1104) Upgrade to PDFBox 1.8.1

2013-04-11 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1104: -- Priority: Trivial (was: Major) Issue Type: Task (was: Bug) Upgrade to PDFBox 1.8.1

[jira] [Resolved] (TIKA-1104) Upgrade to PDFBox 1.8.1

2013-04-11 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1104. --- Resolution: Fixed Updated in r1466775 Upgrade to PDFBox 1.8.1

[jira] [Commented] (TIKA-992) OpenGraph meta tags to allow multiple values

2013-05-13 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13656182#comment-13656182 ] Dave Meikle commented on TIKA-992: -- Hi Markus - looks like this one slipped through the

[jira] [Created] (TIKA-1121) Socket server text parsing error on large text files

2013-05-19 Thread Dave Meikle (JIRA)
Dave Meikle created TIKA-1121: - Summary: Socket server text parsing error on large text files Key: TIKA-1121 URL: https://issues.apache.org/jira/browse/TIKA-1121 Project: Tika Issue Type: Bug

[jira] [Commented] (TIKA-1123) Add more mimetypes for famous programming languages

2013-05-25 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13667026#comment-13667026 ] Dave Meikle commented on TIKA-1123: --- Added in r1486301. Thanks Bernhard.

[jira] [Resolved] (TIKA-1123) Add more mimetypes for famous programming languages

2013-05-25 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1123. --- Resolution: Fixed Fix Version/s: 1.4 Add more mimetypes for famous programming languages

[jira] [Commented] (TIKA-1126) text/html procuder for tika-server

2013-05-26 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13667290#comment-13667290 ] Dave Meikle commented on TIKA-1126: --- Thanks Ali - patch committed in r1486409.

[jira] [Resolved] (TIKA-1126) text/html procuder for tika-server

2013-05-26 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1126. --- Resolution: Fixed Fix Version/s: 1.4 Improvement included in r1486409

[jira] [Commented] (TIKA-1215) Regression: Unable parse a mp3 file on 1.5 which parsed successfully on 1.4

2013-12-27 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857539#comment-13857539 ] Dave Meikle commented on TIKA-1215: --- This is working for me with your file on the latest

[jira] [Commented] (TIKA-1215) Regression: Unable parse a mp3 file on 1.5 which parsed successfully on 1.4

2013-12-27 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857544#comment-13857544 ] Dave Meikle commented on TIKA-1215: --- Can you send a sample of your code please as Git is

[jira] [Commented] (TIKA-1086) Tika-bundle 1.3 does not import org.w3c.dom package

2013-12-28 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858042#comment-13858042 ] Dave Meikle commented on TIKA-1086: --- Added update to POM in r1553845, thanks Niels! Is

[jira] [Commented] (TIKA-820) Locator is unset for HTML parser

2013-12-28 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858154#comment-13858154 ] Dave Meikle commented on TIKA-820: -- Committed slightly tidied up patch in r1553957, thanks

[jira] [Resolved] (TIKA-820) Locator is unset for HTML parser

2013-12-28 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-820. -- Resolution: Fixed Patch committed in r1553957. Locator is unset for HTML parser

[jira] [Commented] (TIKA-1198) Consider optionally utilizing CXF JAX-RS Attachment support

2013-12-29 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13858301#comment-13858301 ] Dave Meikle commented on TIKA-1198: --- Sergey - this change appears to be breaking the

[jira] [Commented] (TIKA-1198) Consider optionally utilizing CXF JAX-RS Attachment support

2013-12-31 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13859434#comment-13859434 ] Dave Meikle commented on TIKA-1198: --- One example would be running the following command

[jira] [Commented] (TIKA-1198) Consider optionally utilizing CXF JAX-RS Attachment support

2014-01-20 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13876920#comment-13876920 ] Dave Meikle commented on TIKA-1198: --- Hi Sergey - Thanks for taking a look at this. I

[jira] [Updated] (TIKA-605) Tika GDAL parser

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-605: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Tika GDAL

[jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-539: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Encoding

[jira] [Updated] (TIKA-715) Some parsers produce non-well-formed XHTML SAX events

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-715: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Some

[jira] [Updated] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-819: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Make Option

[jira] [Updated] (TIKA-985) Support for HTML5 elements

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-985: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Support for

[jira] [Updated] (TIKA-995) XHTMLContentHandler doesn't pass attributes of body element

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-995: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC

[jira] [Updated] (TIKA-774) ExifTool Parser

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-774: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC ExifTool

[jira] [Updated] (TIKA-1059) Better Handling of InterruptedException in ExternalParser and ExternalEmbedder

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1059: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Better

[jira] [Updated] (TIKA-1079) Word document hits AIOOBE in SummaryExtractor.parseSummaries

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1079: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Word

[jira] [Updated] (TIKA-1072) AIOOBE when handling embedded document in .doc file

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1072: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC AIOOBE

[jira] [Updated] (TIKA-1108) Represent individual slides in pptx

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1108: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC

[jira] [Updated] (TIKA-987) Embedded drawing (SHAPE MERGEFORMAT) sometimes not extracted

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-987: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Embedded

[jira] [Updated] (TIKA-1106) CLAVIN Integration

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1106: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC CLAVIN

[jira] [Updated] (TIKA-1208) Migrate Any23 mime contributions to Tika

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1208: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Migrate

[jira] [Updated] (TIKA-1220) Parser implementration for IFC files

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1220: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Parser

[jira] [Updated] (TIKA-891) Use POST in addition to PUT on method calls in tika-server

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-891: - Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Use POST in

[jira] [Updated] (TIKA-1231) Safely handle null embedded files in PDFs

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1231: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Safely

[jira] [Resolved] (TIKA-973) PDF form data isn't included in extracted content.

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-973. -- Resolution: Fixed PDF form data isn't included in extracted content.

[jira] [Updated] (TIKA-1205) Allow PDFParser to fallback to other parser if there is an exception

2014-02-04 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1205: -- Fix Version/s: (was: 1.5) 1.6 Pushed out to 1.6, preparing for 1.5 RC Allow

[jira] [Commented] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2014-06-19 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14037219#comment-14037219 ] Dave Meikle commented on TIKA-1343: --- Hey Chris - I am up for building out on this one.

[jira] [Created] (TIKA-1381) Add Lingo24Translate implementation of Translate API

2014-07-31 Thread Dave Meikle (JIRA)
Dave Meikle created TIKA-1381: - Summary: Add Lingo24Translate implementation of Translate API Key: TIKA-1381 URL: https://issues.apache.org/jira/browse/TIKA-1381 Project: Tika Issue Type: New

[jira] [Commented] (TIKA-1381) Add Lingo24Translate implementation of Translate API

2014-07-31 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14081196#comment-14081196 ] Dave Meikle commented on TIKA-1381: --- Committed implementation in r1614945. Add

[jira] [Comment Edited] (TIKA-1381) Add Lingo24Translate implementation of Translate API

2014-07-31 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14081208#comment-14081208 ] Dave Meikle edited comment on TIKA-1381 at 7/31/14 6:10 PM:

[jira] [Commented] (TIKA-1381) Add Lingo24Translate implementation of Translate API

2014-07-31 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14081208#comment-14081208 ] Dave Meikle commented on TIKA-1381: --- [~chrismattmann] Before I edit the CHANGES file, is

[jira] [Commented] (TIKA-1381) Add Lingo24Translate implementation of Translate API

2014-07-31 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14081231#comment-14081231 ] Dave Meikle commented on TIKA-1381: --- [~chrismattmann] On it, thanks. Add

[jira] [Commented] (TIKA-1381) Add Lingo24Translate implementation of Translate API

2014-07-31 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14081241#comment-14081241 ] Dave Meikle commented on TIKA-1381: --- Added in r1614950 on the Tika 1.6 tag. Add

[jira] [Updated] (TIKA-1220) Parser implementration for IFC files

2014-09-28 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1220: -- Assignee: Lewis John McGibbney Parser implementration for IFC files

[jira] [Commented] (TIKA-1220) Parser implementration for IFC files

2014-09-28 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151204#comment-14151204 ] Dave Meikle commented on TIKA-1220: --- Hi [~lewismc] - this is now assigned to you. I have

[jira] [Assigned] (TIKA-1476) Allow TesseractOCRParser to be configured using an external configuration file

2014-11-16 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle reassigned TIKA-1476: - Assignee: Dave Meikle Allow TesseractOCRParser to be configured using an external configuration

[jira] [Created] (TIKA-1476) Allow TesseractOCRParser to be configured using an external configuration file

2014-11-16 Thread Dave Meikle (JIRA)
Dave Meikle created TIKA-1476: - Summary: Allow TesseractOCRParser to be configured using an external configuration file Key: TIKA-1476 URL: https://issues.apache.org/jira/browse/TIKA-1476 Project: Tika

[jira] [Updated] (TIKA-1476) Allow TesseractOCRParser to be configured using an external configuration file

2014-11-16 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1476: -- Description: The TesseractOCRParser is great but configuration at the moment requires configuring up a

[jira] [Resolved] (TIKA-1476) Allow TesseractOCRParser to be configured using an external configuration file

2014-11-16 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1476. --- Resolution: Implemented Added in r1640017. Allow TesseractOCRParser to be configured using an

[jira] [Created] (TIKA-1477) Add customer header to allow overriding of OCR language to be used in Tika Server

2014-11-17 Thread Dave Meikle (JIRA)
Dave Meikle created TIKA-1477: - Summary: Add customer header to allow overriding of OCR language to be used in Tika Server Key: TIKA-1477 URL: https://issues.apache.org/jira/browse/TIKA-1477 Project:

[jira] [Updated] (TIKA-1477) Add custom header to allow overriding of OCR language to be used in Tika Server

2014-11-17 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1477: -- Summary: Add custom header to allow overriding of OCR language to be used in Tika Server (was: Add

[jira] [Commented] (TIKA-1480) TikaJAXRS get all resourses call fail

2014-11-18 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216130#comment-14216130 ] Dave Meikle commented on TIKA-1480: --- I have updated the Wiki page. TikaJAXRS get all

[jira] [Assigned] (TIKA-595) HtmlHandler does not support multivalue metadata

2014-11-18 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle reassigned TIKA-595: Assignee: Dave Meikle HtmlHandler does not support multivalue metadata

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-19 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217685#comment-14217685 ] Dave Meikle commented on TIKA-1445: --- bq. Hey Guys, to be honest, the way I see that we

[jira] [Resolved] (TIKA-595) HtmlHandler does not support multivalue metadata

2014-11-19 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-595. -- Resolution: Fixed Committed Julien Nioche's patch in r1640521. Thanks! HtmlHandler does not support

[jira] [Updated] (TIKA-1477) Add custom header processing to allow overriding of OCR and PDF configuration to be used in Tika Server

2014-11-20 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1477: -- Summary: Add custom header processing to allow overriding of OCR and PDF configuration to be used in

[jira] [Updated] (TIKA-1477) Add custom header processing to allow overriding of OCR and PDF configuration to be used in Tika Server

2014-11-20 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1477: -- Description: The _TesseractOCRParser_ and _PDFParser_ provide different configuration options via their

[jira] [Resolved] (TIKA-1477) Add custom header processing to allow overriding of OCR and PDF configuration to be used in Tika Server

2014-11-20 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1477. --- Resolution: Fixed Added in r1640714. Add custom header processing to allow overriding of OCR and PDF

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-31 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14299805#comment-14299805 ] Dave Meikle commented on TIKA-1518: --- Right folks I have added the Dockerfile to

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-30 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14299697#comment-14299697 ] Dave Meikle commented on TIKA-1518: --- Sorry gang been travelling a lot. #1, Totally up

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-24 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14290507#comment-14290507 ] Dave Meikle commented on TIKA-1518: --- Hi [~grossws] - I have added the automated build

[jira] [Created] (TIKA-1637) Oracle internal API jdeps request for information

2015-05-25 Thread Dave Meikle (JIRA)
Dave Meikle created TIKA-1637: - Summary: Oracle internal API jdeps request for information Key: TIKA-1637 URL: https://issues.apache.org/jira/browse/TIKA-1637 Project: Tika Issue Type: Task

[jira] [Updated] (TIKA-1276) Missing embedded dependencies in tika-bundle

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1276: -- Fix Version/s: (was: 1.10) 1.11 Missing embedded dependencies in tika-bundle

[jira] [Commented] (TIKA-1276) Missing embedded dependencies in tika-bundle

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14650320#comment-14650320 ] Dave Meikle commented on TIKA-1276: --- Moved to 1.11 to allow for 1.10 release, but I have

[jira] [Commented] (TIKA-539) Encoding detection is too biased by encoding in meta tag

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14650315#comment-14650315 ] Dave Meikle commented on TIKA-539: -- Pushed to 1.11 Encoding detection is too biased by

[jira] [Updated] (TIKA-1518) Docker with Tika Server

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1518: -- Fix Version/s: (was: 1.10) 1.11 Docker with Tika Server ---

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14650316#comment-14650316 ] Dave Meikle commented on TIKA-1518: --- Moved to 1.11 to keep work to get DockerHub is

[jira] [Updated] (TIKA-539) Encoding detection is too biased by encoding in meta tag

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-539: - Fix Version/s: (was: 1.10) 1.11 Encoding detection is too biased by encoding in

[jira] [Commented] (TIKA-1329) Add RecursiveParserWrapper aka Jukka's (and Nick's) RecursiveMetadataParser

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14650322#comment-14650322 ] Dave Meikle commented on TIKA-1329: --- Moved to 1.11 but will work on update for site as

[jira] [Updated] (TIKA-1329) Add RecursiveParserWrapper aka Jukka's (and Nick's) RecursiveMetadataParser

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1329: -- Fix Version/s: (was: 1.10) 1.11 Add RecursiveParserWrapper aka Jukka's (and

[jira] [Resolved] (TIKA-1238) Update OutlookExtractor to handle codepage identification more rigorously

2015-08-01 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle resolved TIKA-1238. --- Resolution: Fixed Fixed committed in r1691962. Update OutlookExtractor to handle codepage

[jira] [Commented] (TIKA-1705) Update ASM dependency to 5.0.4

2015-08-11 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14681530#comment-14681530 ] Dave Meikle commented on TIKA-1705: --- Thanks [~thetaphi]. Have made the change and

[jira] [Updated] (TIKA-776) ExifTool Embedder

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-776: - Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release ExifTool

[jira] [Updated] (TIKA-1435) Update rome dependency to 1.5

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1435: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Update

[jira] [Updated] (TIKA-1106) CLAVIN Integration

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1106: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release CLAVIN

[jira] [Updated] (TIKA-987) Embedded drawing (SHAPE MERGEFORMAT) sometimes not extracted

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-987: - Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Embedded

[jira] [Updated] (TIKA-1672) Integrate tika-java7 component

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1672: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release

[jira] [Updated] (TIKA-1379) error in Tika().detect for xml files with xades signature

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1379: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release error

[jira] [Updated] (TIKA-1308) Support in memory parse mode(don't create temp file): to support run Tika in GAE

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1308: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Support

[jira] [Updated] (TIKA-894) Add webapp mode for Tika Server, simplifies deployment

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-894: - Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Add webapp

[jira] [Updated] (TIKA-1108) Represent individual slides in pptx

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1108: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release

[jira] [Updated] (TIKA-1688) Tika Version in Metadata

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1688: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Tika

[jira] [Updated] (TIKA-1696) Language Identification with Text Processing Toolkit from MITLL

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1696: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release

[jira] [Updated] (TIKA-1616) Tika Parser for GIBS Metadata

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1616: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Tika

[jira] [Updated] (TIKA-1366) Update some of Tika Server services to support JAX-RS 2.0 AsyncResponse

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1366: -- Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Update

[jira] [Updated] (TIKA-891) Use POST in addition to PUT on method calls in tika-server

2015-08-08 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-891: - Fix Version/s: (was: 1.10) 1.11 * Pushed to 1.11 following 1.10 release Use POST

  1   2   3   >