[jira] [Commented] (TIKA-456) Support timeouts for parsers

2014-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13900340#comment-13900340 ] Konstantin Gribov commented on TIKA-456: As I remeber there's no correct way to

[jira] [Commented] (TIKA-456) Support timeouts for parsers

2014-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13900371#comment-13900371 ] Konstantin Gribov commented on TIKA-456: In my case (10-20M files, one server) I

[jira] [Updated] (TIKA-1258) Update NetCDF dependency

2014-03-07 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1258: Attachment: update-netcdf-to-4.2.20.patch Update NetCDF dependency

[jira] [Commented] (TIKA-1253) SLF4J: The requested version 1.5.6 by your slf4j binding is not compatible with [1.6, 1.7]

2014-06-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14027664#comment-14027664 ] Konstantin Gribov commented on TIKA-1253: - I've started a thread about logging

[jira] [Updated] (TIKA-1472) Warning on Tika Server startup - Failed to load class org.slf4j.impl.StaticLoggerBinder

2014-11-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1472: Attachment: 0001-Added-slf4j-jcl-impl-to-tika-server-deps.patch Warning on Tika Server

[jira] [Commented] (TIKA-1472) Warning on Tika Server startup - Failed to load class org.slf4j.impl.StaticLoggerBinder

2014-11-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14206479#comment-14206479 ] Konstantin Gribov commented on TIKA-1472: - Patch to fix issue above, added slf4j

[jira] [Commented] (TIKA-1472) Warning on Tika Server startup - Failed to load class org.slf4j.impl.StaticLoggerBinder

2014-11-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14206489#comment-14206489 ] Konstantin Gribov commented on TIKA-1472: - May be merget via PR:

[jira] [Commented] (TIKA-1472) Warning on Tika Server startup - Failed to load class org.slf4j.impl.StaticLoggerBinder

2014-11-18 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215947#comment-14215947 ] Konstantin Gribov commented on TIKA-1472: - If you want to _upload_ jar, you have to

[jira] [Commented] (TIKA-1480) TikaJAXRS get all resourses call fail

2014-11-18 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215961#comment-14215961 ] Konstantin Gribov commented on TIKA-1480: - You can browse [http://localhost:9998/]

[jira] [Commented] (TIKA-1489) PDF Text extraction without permission

2014-12-01 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229850#comment-14229850 ] Konstantin Gribov commented on TIKA-1489: - I think, some field in meta should be

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-26 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291999#comment-14291999 ] Konstantin Gribov commented on TIKA-1518: - Thank you, [~davemeikle]. It works

[jira] [Commented] (TIKA-1538) Wrong mimetype detection

2015-02-03 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14303146#comment-14303146 ] Konstantin Gribov commented on TIKA-1538: - {code:java} Tika tika = new Tika();

[jira] [Commented] (TIKA-1543) TesseractOCRParser.setTesseractPath() doesn't work on Linux

2015-02-06 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14309325#comment-14309325 ] Konstantin Gribov commented on TIKA-1543: - Is tesseract binary executable? Is it

[jira] [Commented] (TIKA-1516) Downgrade Rome dependency to 0.9 to avoid nasty NPE

2015-01-15 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278821#comment-14278821 ] Konstantin Gribov commented on TIKA-1516: - Also, it seems to be an classloader

[jira] [Commented] (TIKA-1516) Downgrade Rome dependency to 0.9 to avoid nasty NPE

2015-01-15 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278800#comment-14278800 ] Konstantin Gribov commented on TIKA-1516: - Upstream bug isn't fixed yet. See

[jira] [Commented] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283102#comment-14283102 ] Konstantin Gribov commented on TIKA-1523: - And libreoffice shows 10 pages, as I see

[jira] [Commented] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283134#comment-14283134 ] Konstantin Gribov commented on TIKA-1523: - [~thetaphi], thank you for digging into

[jira] [Resolved] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1523. - Resolution: Won't Fix metadata extractor gets the wrong number of pages of some documents

[jira] [Closed] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-20 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov closed TIKA-1523. --- metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

[jira] [Assigned] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov reassigned TIKA-1523: --- Assignee: Konstantin Gribov metadata extractor gets the wrong number of pages of

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-20 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284926#comment-14284926 ] Konstantin Gribov commented on TIKA-1518: - I've dropped my version to avoid

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-16 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14280241#comment-14280241 ] Konstantin Gribov commented on TIKA-1518: - Ok, I'll create it soon Docker with

[jira] [Commented] (TIKA-1516) Downgrade Rome dependency to 0.9 to avoid nasty NPE

2015-01-16 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14281129#comment-14281129 ] Konstantin Gribov commented on TIKA-1516: - [~lewismc], if I understood you correct

[jira] [Commented] (TIKA-1516) Downgrade Rome dependency to 0.9 to avoid nasty NPE

2015-01-16 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14281136#comment-14281136 ] Konstantin Gribov commented on TIKA-1516: - Also, patch from their bugtracker

[jira] [Commented] (TIKA-241) Rar archive support

2015-01-14 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14276966#comment-14276966 ] Konstantin Gribov commented on TIKA-241: Don't we need to add licensing info about

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-14 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277080#comment-14277080 ] Konstantin Gribov commented on TIKA-1511: - [~talli...@mitre.org], working with

[jira] [Commented] (TIKA-1518) Docker with Tika Server

2015-01-15 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14278446#comment-14278446 ] Konstantin Gribov commented on TIKA-1518: - To pull latest Tika you can use snippet

[jira] [Commented] (TIKA-1519) Don't allow whatever is in http-equiv Content-Type to overwrite actual Content-Type in HtmlParser

2015-01-20 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14284955#comment-14284955 ] Konstantin Gribov commented on TIKA-1519: - +1 to override and hint, library user

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289172#comment-14289172 ] Konstantin Gribov commented on TIKA-1526: - [~thetaphi], I understand that this is

[jira] [Comment Edited] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289172#comment-14289172 ] Konstantin Gribov edited comment on TIKA-1526 at 1/23/15 12:24 PM:

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14288977#comment-14288977 ] Konstantin Gribov commented on TIKA-1526: - [~thetaphi], they fixed this in 2.0 RC

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275679#comment-14275679 ] Konstantin Gribov commented on TIKA-1513: - [~talli...@mitre.org], I think it's good

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275703#comment-14275703 ] Konstantin Gribov commented on TIKA-1511: - [~lfcnassif], +1. IMHO, ManifoldCF

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275286#comment-14275286 ] Konstantin Gribov commented on TIKA-1511: - Usual way is to exclude maven dependency

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275182#comment-14275182 ] Konstantin Gribov commented on TIKA-1511: - JNI can potentially give some issues in

[jira] [Commented] (TIKA-1513) Add mime detection and parsing for dbf files

2015-01-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14275467#comment-14275467 ] Konstantin Gribov commented on TIKA-1513: - Is this lib alive? Last commits were in

[jira] [Updated] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1511: Attachment: TIKA-1511v3bis.patch Create a parser for SQLite3 ---

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320059#comment-14320059 ] Konstantin Gribov commented on TIKA-1511: - With v3 patch forbiddenapis found that

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320103#comment-14320103 ] Konstantin Gribov commented on TIKA-1511: - [~talli...@mitre.org], r1659547 work

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320104#comment-14320104 ] Konstantin Gribov commented on TIKA-1511: - [~talli...@mitre.org], r1659547 work

[jira] [Issue Comment Deleted] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1511: Comment: was deleted (was: [~talli...@mitre.org], r1659547 work fine. Tests for sqlite3

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320183#comment-14320183 ] Konstantin Gribov commented on TIKA-1511: - I don't see a lot of

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320022#comment-14320022 ] Konstantin Gribov commented on TIKA-1511: - [~talli...@mitre.org], you can also make

[jira] [Comment Edited] (TIKA-1511) Create a parser for SQLite3

2015-02-13 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320022#comment-14320022 ] Konstantin Gribov edited comment on TIKA-1511 at 2/13/15 12:40 PM:

[jira] [Created] (TIKA-1574) Frames in header/footer in doc files aren't extracted

2015-03-11 Thread Konstantin Gribov (JIRA)
Konstantin Gribov created TIKA-1574: --- Summary: Frames in header/footer in doc files aren't extracted Key: TIKA-1574 URL: https://issues.apache.org/jira/browse/TIKA-1574 Project: Tika Issue

[jira] [Commented] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available

2015-03-28 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385561#comment-14385561 ] Konstantin Gribov commented on TIKA-1575: - What about updating to released pdfbox

[jira] [Resolved] (TIKA-1575) Upgrade to PDFBox 1.8.9 when available

2015-03-29 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1575. - Resolution: Fixed Upgrade to PDFBox 1.8.9 when available

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-03-29 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385802#comment-14385802 ] Konstantin Gribov commented on TIKA-1511: - +1 for including xerial in tika-app and

[jira] [Comment Edited] (TIKA-1511) Create a parser for SQLite3

2015-03-29 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385836#comment-14385836 ] Konstantin Gribov edited comment on TIKA-1511 at 3/29/15 4:05 PM:

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-03-29 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385836#comment-14385836 ] Konstantin Gribov commented on TIKA-1511: - Idea of better tika-parsers module

[jira] [Commented] (TIKA-1587) ForkParser::setJavaCommand should take ListString

2015-03-30 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14386746#comment-14386746 ] Konstantin Gribov commented on TIKA-1587: - LGTM. Commited with integration test

[jira] [Resolved] (TIKA-1587) ForkParser::setJavaCommand should take ListString

2015-03-30 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1587. - Resolution: Fixed ForkParser::setJavaCommand should take ListString

[jira] [Updated] (TIKA-1590) A particular PDF seems to trigger an infinite loop when being converted to HTML

2015-04-01 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1590: Fix Version/s: 1.8 A particular PDF seems to trigger an infinite loop when being converted

[jira] [Comment Edited] (TIKA-1590) A particular PDF seems to trigger an infinite loop when being converted to HTML

2015-04-01 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14390170#comment-14390170 ] Konstantin Gribov edited comment on TIKA-1590 at 4/1/15 7:53 AM:

[jira] [Commented] (TIKA-1590) A particular PDF seems to trigger an infinite loop when being converted to HTML

2015-04-01 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14390177#comment-14390177 ] Konstantin Gribov commented on TIKA-1590: - Thank you for the feedback, Matt. I

[jira] [Resolved] (TIKA-1590) A particular PDF seems to trigger an infinite loop when being converted to HTML

2015-04-01 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1590. - Resolution: Duplicate Fixed in trunk by update of pdfbox to 1.8.9. See alse TIKA-1575 and

[jira] [Commented] (TIKA-1004) Support ansi as an alias for windows-1252 charset

2015-03-03 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14345687#comment-14345687 ] Konstantin Gribov commented on TIKA-1004: - -1 for this ticket. Windows references

[jira] [Commented] (TIKA-1529) Turn forbidden-apis back on

2015-01-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289333#comment-14289333 ] Konstantin Gribov commented on TIKA-1529: - I vote for throwing {{RuntimeException}}

[jira] [Commented] (TIKA-1529) Turn forbidden-apis back on

2015-01-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289460#comment-14289460 ] Konstantin Gribov commented on TIKA-1529: - {{new String(bytes, Charset)}} will

[jira] [Commented] (TIKA-1529) Turn forbidden-apis back on

2015-01-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289429#comment-14289429 ] Konstantin Gribov commented on TIKA-1529: - [~talli...@mitre.org], it works with

[jira] [Resolved] (TIKA-1591) Tika Parsers uses wrong version of bouncycastle

2015-04-01 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1591. - Resolution: Fixed Fix Version/s: 1.8 Updated in r1670802 Tika Parsers uses wrong

[jira] [Commented] (TIKA-1593) Doco: Broken link to Parser Quick Start Guide

2015-04-03 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14394555#comment-14394555 ] Konstantin Gribov commented on TIKA-1593: - Thank you, Dan. Seems, it should be

[jira] [Commented] (TIKA-1532) DIF Parser

2015-04-21 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504904#comment-14504904 ] Konstantin Gribov commented on TIKA-1532: - {{text/\*+xml}} is quite unusual type.

[jira] [Resolved] (TIKA-1626) Upgrade to POI 3.12-final

2015-05-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1626. - Resolution: Fixed Fix Version/s: 1.9 Upgrade to POI 3.12-final

[jira] [Commented] (TIKA-1330) Add robust tika-batch code

2015-04-06 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481555#comment-14481555 ] Konstantin Gribov commented on TIKA-1330: - [~talli...@mitre.org], you have mixed

[jira] [Commented] (TIKA-1330) Add robust tika-batch code

2015-04-06 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481917#comment-14481917 ] Konstantin Gribov commented on TIKA-1330: - That was just a test to check that

[jira] [Updated] (TIKA-1595) tika-batch cleanup

2015-04-07 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1595: Description: New module tika-batch cleanup. Related to TIKA-1330 (was: New module

[jira] [Created] (TIKA-1595) tika-batch cleanup

2015-04-07 Thread Konstantin Gribov (JIRA)
Konstantin Gribov created TIKA-1595: --- Summary: tika-batch cleanup Key: TIKA-1595 URL: https://issues.apache.org/jira/browse/TIKA-1595 Project: Tika Issue Type: Improvement

[jira] [Resolved] (TIKA-1596) tika-cli logging cleanup

2015-04-07 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1596. - Resolution: Fixed tika-cli logging cleanup

[jira] [Closed] (TIKA-1596) tika-cli logging cleanup

2015-04-07 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov closed TIKA-1596. --- tika-cli logging cleanup Key: TIKA-1596

[jira] [Created] (TIKA-1596) tika-cli logging cleanup

2015-04-07 Thread Konstantin Gribov (JIRA)
Konstantin Gribov created TIKA-1596: --- Summary: tika-cli logging cleanup Key: TIKA-1596 URL: https://issues.apache.org/jira/browse/TIKA-1596 Project: Tika Issue Type: Improvement

[jira] [Created] (TIKA-1597) RTF with embedded image parsing produces div before html

2015-04-08 Thread Konstantin Gribov (JIRA)
Konstantin Gribov created TIKA-1597: --- Summary: RTF with embedded image parsing produces div before html Key: TIKA-1597 URL: https://issues.apache.org/jira/browse/TIKA-1597 Project: Tika

[jira] [Updated] (TIKA-1597) RTF with embedded image parsing produces div before html

2015-04-08 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1597: Attachment: 3.rtf 2.rtf RTF with embedded image parsing produces div before

[jira] [Closed] (TIKA-1590) A particular PDF seems to trigger an infinite loop when being converted to HTML

2015-04-02 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov closed TIKA-1590. --- A particular PDF seems to trigger an infinite loop when being converted to HTML

[jira] [Commented] (TIKA-1519) Don't allow whatever is in http-equiv Content-Type to overwrite actual Content-Type in HtmlParser

2015-04-09 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14487708#comment-14487708 ] Konstantin Gribov commented on TIKA-1519: - It can be either refinement or not. E.g.

[jira] [Closed] (TIKA-1626) Upgrade to POI 3.12-final

2015-06-29 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov closed TIKA-1626. --- Upgrade to POI 3.12-final - Key: TIKA-1626

[jira] [Commented] (TIKA-1536) Upgrade compiler definition in pom's to Java 7

2015-06-29 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605344#comment-14605344 ] Konstantin Gribov commented on TIKA-1536: - Did we announced that current release

[jira] [Commented] (TIKA-1655) Inconsistent formatting in parsers pom.xml file

2015-06-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582097#comment-14582097 ] Konstantin Gribov commented on TIKA-1655: - Thank you, Paul. Inconsistent

[jira] [Resolved] (TIKA-1655) Inconsistent formatting in parsers pom.xml file

2015-06-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1655. - Resolution: Fixed Inconsistent formatting in parsers pom.xml file

[jira] [Closed] (TIKA-1655) Inconsistent formatting in parsers pom.xml file

2015-06-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov closed TIKA-1655. --- Inconsistent formatting in parsers pom.xml file ---

[jira] [Commented] (TIKA-1655) Inconsistent formatting in parsers pom.xml file

2015-06-11 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582084#comment-14582084 ] Konstantin Gribov commented on TIKA-1655: - Merged in r1684917 Inconsistent

[jira] [Closed] (TIKA-1258) Update NetCDF dependency

2015-06-29 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov closed TIKA-1258. --- Update NetCDF dependency Key: TIKA-1258

[jira] [Updated] (TIKA-1524) Can install Tika-Bundle, missing JUnit dependency

2015-07-28 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1524: Attachment: TIKA-1524.patch Combined with reformatted TikaBundleNoJUnit.patch.

[jira] [Commented] (TIKA-1524) Can install Tika-Bundle, missing JUnit dependency

2015-07-28 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14644345#comment-14644345 ] Konstantin Gribov commented on TIKA-1524: - [~talli...@mitre.org], [~bobpaulin],

[jira] [Resolved] (TIKA-1524) Can install Tika-Bundle, missing JUnit dependency

2015-07-28 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1524. - Resolution: Fixed Fix Version/s: 1.10 Can install Tika-Bundle, missing JUnit

[jira] [Commented] (TIKA-1524) Can install Tika-Bundle, missing JUnit dependency

2015-07-27 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642887#comment-14642887 ] Konstantin Gribov commented on TIKA-1524: - JFYI, removing explicit {{junit.textui}}

[jira] [Commented] (TIKA-1524) Can install Tika-Bundle, missing JUnit dependency

2015-07-27 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642882#comment-14642882 ] Konstantin Gribov commented on TIKA-1524: - [~talli...@mitre.org], it doesn't

[jira] [Updated] (TIKA-1524) Can install Tika-Bundle, missing JUnit dependency

2015-07-27 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1524: Attachment: TIKA-1524.patch Can install Tika-Bundle, missing JUnit dependency

[jira] [Commented] (TIKA-1694) Missing scopetest/scope for junit dependency

2015-07-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14638942#comment-14638942 ] Konstantin Gribov commented on TIKA-1694: - {{junit}} dependency is declared with

[jira] [Assigned] (TIKA-1694) Missing scopetest/scope for junit dependency

2015-07-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov reassigned TIKA-1694: --- Assignee: Konstantin Gribov Missing scopetest/scope for junit dependency

[jira] [Created] (TIKA-1695) Update json-simple when available

2015-07-23 Thread Konstantin Gribov (JIRA)
Konstantin Gribov created TIKA-1695: --- Summary: Update json-simple when available Key: TIKA-1695 URL: https://issues.apache.org/jira/browse/TIKA-1695 Project: Tika Issue Type: Bug

[jira] [Updated] (TIKA-1695) Update json-simple when available

2015-07-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1695: Description: Currently json-simple 1.1.1 contains non-test dependency on junit, which is

[jira] [Assigned] (TIKA-1752) Use java.nio.file.Path in org.apache.tika.detect

2015-09-30 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov reassigned TIKA-1752: --- Assignee: Konstantin Gribov > Use java.nio.file.Path in org.apache.tika.detect >

[jira] [Resolved] (TIKA-1752) Use java.nio.file.Path in org.apache.tika.detect

2015-09-30 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov resolved TIKA-1752. - Resolution: Fixed > Use java.nio.file.Path in org.apache.tika.detect >

[jira] [Commented] (TIKA-1743) NetworkParser can create Unbounded Number of Threads

2015-09-23 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14904602#comment-14904602 ] Konstantin Gribov commented on TIKA-1743: - [~bobpaulin], I have two ideas on the issue: - by

[jira] [Updated] (TIKA-1744) Use java.nio.file.Path in TikaInputStream

2015-09-24 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1744: Labels: java7 (was: ) > Use java.nio.file.Path in TikaInputStream >

[jira] [Updated] (TIKA-1726) Augment public methods that use a java.io.File with methods that use a java.nio.file.Path

2015-09-24 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1726: Labels: java7 (was: ) > Augment public methods that use a java.io.File with methods that

[jira] [Updated] (TIKA-1751) Use java.nio.file.Path in TikaConfig

2015-09-24 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Konstantin Gribov updated TIKA-1751: Labels: java7 (was: ) > Use java.nio.file.Path in TikaConfig >

[jira] [Comment Edited] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-02-05 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821 ] Konstantin Gribov edited comment on TIKA-1824 at 2/5/16 8:22 AM: - I'm on

[jira] [Commented] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-02-05 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821 ] Konstantin Gribov commented on TIKA-1824: - I'm on vacation now, so reveiwed this topic only

  1   2   3   >