Re: Tika 1.15.1?

2017-06-29 Thread Luís Filipe Nassif
Agreed. Luis 2017-06-29 15:45 GMT-03:00 Bob Paulin : > If we're adding features does it make sense just to bump to 1.16 rather > than 1.15.1? Traditionally point releases would be bug fixes only [1]. > > > - Bob > > [1] http://semver.org/ > On 6/29/2017 1:18 PM, Allison,

[jira] [Updated] (TIKA-2406) IllegalArgumentException in text extraction from PDF file

2017-06-29 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Spinsanti updated TIKA-2406: -- Description: I got an IllegalArgumentException in text extraction from PDF file (attached):

[jira] [Updated] (TIKA-2406) IllegalArgumentException in text extraction from PDF file

2017-06-29 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Spinsanti updated TIKA-2406: -- Attachment: IllegalArgumentException.pdf > IllegalArgumentException in text extraction from PDF

[jira] [Created] (TIKA-2406) IllegalArgumentException in text extraction from PDF file

2017-06-29 Thread Jorge Spinsanti (JIRA)
Jorge Spinsanti created TIKA-2406: - Summary: IllegalArgumentException in text extraction from PDF file Key: TIKA-2406 URL: https://issues.apache.org/jira/browse/TIKA-2406 Project: Tika Issue

[jira] [Updated] (TIKA-2405) SAXParseException in text extraction from DOCX file

2017-06-29 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Spinsanti updated TIKA-2405: -- Attachment: SAXParseException.docx > SAXParseException in text extraction from DOCX file >

[jira] [Updated] (TIKA-2405) SAXParseException in text extraction from DOCX file

2017-06-29 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Spinsanti updated TIKA-2405: -- Description: I got SAXParseException in text extraction from DOCX file (see attachment): {code}

[jira] [Created] (TIKA-2405) SAXParseException in text extraction from DOCX file

2017-06-29 Thread Jorge Spinsanti (JIRA)
Jorge Spinsanti created TIKA-2405: - Summary: SAXParseException in text extraction from DOCX file Key: TIKA-2405 URL: https://issues.apache.org/jira/browse/TIKA-2405 Project: Tika Issue Type:

[jira] [Updated] (TIKA-2404) XMLException in DOCX->TXT conversion

2017-06-29 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Spinsanti updated TIKA-2404: -- Description: I got an XMLException when try to extract text from DOCX file (see attached file):

[jira] [Updated] (TIKA-2404) XMLException in DOCX->TXT conversion

2017-06-29 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Spinsanti updated TIKA-2404: -- Description: I got an XMException when try to extract text from DOCX file: {code} Caused by:

[jira] [Updated] (TIKA-2404) XMLException in DOCX->TXT conversion

2017-06-29 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Spinsanti updated TIKA-2404: -- Attachment: XmlException.docx > XMLException in DOCX->TXT conversion >

Re: Tika 1.15.1?

2017-06-29 Thread Bob Paulin
If we're adding features does it make sense just to bump to 1.16 rather than 1.15.1? Traditionally point releases would be bug fixes only [1]. - Bob [1] http://semver.org/ On 6/29/2017 1:18 PM, Allison, Timothy B. wrote: > K. > > -Original Message- > From: Mattmann, Chris A (3010)

RE: Tika 1.15.1?

2017-06-29 Thread Allison, Timothy B.
K. -Original Message- From: Mattmann, Chris A (3010) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Thursday, June 29, 2017 1:59 PM To: dev@tika.apache.org Subject: Re: Tika 1.15.1? Hey Tim, I’d like to try and get in: https://issues.apache.org/jira/browse/TIKA-1988 today for 15.1. I am

[jira] [Commented] (TIKA-2402) Support all image formats in Object Recognition REST Parser

2017-06-29 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068731#comment-16068731 ] Luis Filipe Nassif commented on TIKA-2402: -- Hi [~ThejanWijesinghe], looks like DataVec

Re: Tika 1.15.1?

2017-06-29 Thread Mattmann, Chris A (3010)
Hey Tim, I’d like to try and get in: https://issues.apache.org/jira/browse/TIKA-1988 today for 15.1. I am working on integrating it now and adding some docs to the wiki. I’ll keep you posted. Cheers, Chris ++ Chris

[jira] [Updated] (TIKA-1988) Age Detection Tika Recogniser

2017-06-29 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-1988: Summary: Age Detection Tika Recogniser (was: Tika parser for extracting text based

[jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types

2017-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068698#comment-16068698 ] ASF GitHub Bot commented on TIKA-2262: -- chrismattmann commented on issue #180: Fix for TIKA-2262:

[jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types

2017-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068656#comment-16068656 ] ASF GitHub Bot commented on TIKA-2262: -- thammegowda commented on issue #180: Fix for TIKA-2262:

[jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types

2017-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068635#comment-16068635 ] ASF GitHub Bot commented on TIKA-2262: -- chrismattmann commented on issue #180: Fix for TIKA-2262:

[jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types

2017-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068634#comment-16068634 ] ASF GitHub Bot commented on TIKA-2262: -- thammegowda commented on issue #180: Fix for TIKA-2262:

[jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types

2017-06-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068620#comment-16068620 ] ASF GitHub Bot commented on TIKA-2262: -- thammegowda commented on issue #180: Fix for TIKA-2262:

[jira] [Commented] (TIKA-2403) Elasticsearch 5.2.2 - Ingest Node - PDF - Parsing Issue

2017-06-29 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068557#comment-16068557 ] Tim Allison commented on TIKA-2403: --- Thank you for the ping. Are you able to share the triggering

[jira] [Commented] (TIKA-2398) Unifying Object Recognition REST services

2017-06-29 Thread Thamme Gowda (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16068484#comment-16068484 ] Thamme Gowda commented on TIKA-2398: [~ThejanWijesinghe] Looks good. This will be the next milestone

[jira] [Updated] (TIKA-2403) Elasticsearch 5.2.2 - Ingest Node - PDF - Parsing Issue

2017-06-29 Thread Boopathi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boopathi updated TIKA-2403: --- Description: We are using Elasticsearch 5.2.2 for Full text search. With the help of ingest node we are able

[jira] [Updated] (TIKA-2403) Elasticsearch 5.2.2 - Ingest Node - PDF - Parsing Issue

2017-06-29 Thread Boopathi (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Boopathi updated TIKA-2403: --- Description: We are using Elasticsearch 5.2.2 for Full text search. With the help of ingest node we are able