[jira] [Updated] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES

2016-06-09 Thread Michele Andreano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michele Andreano updated TIKA-1997: --- Fix Version/s: (was: 1.14) > Problem in Tika().detect for xml file signed in CADES >

[jira] [Updated] (TIKA-1997) Problem in Tika().detect for xml file signed in CADES

2016-06-09 Thread Michele Andreano (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michele Andreano updated TIKA-1997: --- Fix Version/s: (was: 1.13) 1.14 > Problem in Tika().detect for xml file

[jira] [Commented] (TIKA-1358) Add support for newer iWork file formats

2016-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322947#comment-15322947 ] Tim Allison commented on TIKA-1358: --- Think they'd be willing to push a build to maven? > Add support for

[jira] [Commented] (TIKA-1358) Add support for newer iWork file formats

2016-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322946#comment-15322946 ] Tim Allison commented on TIKA-1358: --- [~davemeikle], did you mention that you had parsers for this that

[jira] [Resolved] (TIKA-1966) Issue in parsing iWorksDocument with Apache Tika

2016-06-09 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1966. --- Resolution: Duplicate Ha. I had clearly memorized "fiddly" from earlier ticket w/out remembering

[jira] [Comment Edited] (TIKA-2001) Parsing XML outputs empty string

2016-06-09 Thread George L. Yermulnik (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322715#comment-15322715 ] George L. Yermulnik edited comment on TIKA-2001 at 6/9/16 3:39 PM: ---

[jira] [Commented] (TIKA-2001) Parsing XML outputs empty string

2016-06-09 Thread George L. Yermulnik (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322715#comment-15322715 ] George L. Yermulnik commented on TIKA-2001: --- > By default Tika only extracts the text between XML

[jira] [Commented] (TIKA-2001) Parsing XML outputs empty string

2016-06-09 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322710#comment-15322710 ] Jukka Zitting commented on TIKA-2001: - By default Tika only extracts the text between XML tags, not

[jira] [Commented] (TIKA-2001) Parsing XML outputs empty string

2016-06-09 Thread George L. Yermulnik (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322409#comment-15322409 ] George L. Yermulnik commented on TIKA-2001: --- {code} root@spring:/tmp# java -jar tika-app-1.13.jar

[jira] [Commented] (TIKA-2001) Parsing XML outputs empty string

2016-06-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322404#comment-15322404 ] Nick Burch commented on TIKA-2001: -- What's the output of `--detect` on the problematic file? > Parsing

[jira] [Created] (TIKA-2001) Parsing XML outputs empty string

2016-06-09 Thread George L. Yermulnik (JIRA)
George L. Yermulnik created TIKA-2001: - Summary: Parsing XML outputs empty string Key: TIKA-2001 URL: https://issues.apache.org/jira/browse/TIKA-2001 Project: Tika Issue Type: Bug

[jira] [Updated] (TIKA-2000) Author profile parser

2016-06-09 Thread Anthony Beylerian (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Beylerian updated TIKA-2000: Description: The profile parser aims to parse documents and return information about age and

[jira] [Updated] (TIKA-2000) Author profile parser

2016-06-09 Thread Anthony Beylerian (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Beylerian updated TIKA-2000: Description: The profile parser aims to parse documents and return information about age and

Re: Profiler for OpenNLP

2016-06-09 Thread Anthony Beylerian
Hello, Thank you very much for your interest. We are planning to implement some of features listed here [1]. However due to the breadth of approaches, any suggestions or hints based on your experience are of course welcome. [1] : http://www.ripublication.com/ijaer16/ijaerv11n5_24.pdf On Wed,