[jira] [Commented] (TIKA-1876) Integrate Natural Language Toolkit (NLTK) into Tika to perform Named Entity Recognition

2016-02-28 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15171368#comment-15171368 ] ASF GitHub Bot commented on TIKA-1876: -- GitHub user manalishah opened a pull request:

[jira] [Commented] (TIKA-1870) Relocating RichTextContentHandler into tika-core from tika-server

2016-02-25 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15167570#comment-15167570 ] ASF GitHub Bot commented on TIKA-1870: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1877) On updating the tika-mimetypes.xml to detect .fts file format, tika detector does not return anything

2016-02-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15172890#comment-15172890 ] ASF GitHub Bot commented on TIKA-1877: -- GitHub user prasadns14 opened a pull request:

[jira] [Commented] (TIKA-1840) No way to link slide notes to slide in PPT output.

2016-01-22 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15112270#comment-15112270 ] ASF GitHub Bot commented on TIKA-1840: -- GitHub user zetisam opened a pull request:

[jira] [Commented] (TIKA-1840) No way to link slide notes to slide in PPT output.

2016-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114576#comment-15114576 ] ASF GitHub Bot commented on TIKA-1840: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1857) Enhance PDFParser to extract text from XFA forms

2016-03-01 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15174859#comment-15174859 ] ASF GitHub Bot commented on TIKA-1857: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1882) Updating the tika-mimetypes.xml for new mime magic patterns

2016-03-01 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173509#comment-15173509 ] ASF GitHub Bot commented on TIKA-1882: -- GitHub user mkampasi opened a pull request:

[jira] [Commented] (TIKA-1881) On updating mime magic for existing mime types

2016-03-01 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15173524#comment-15173524 ] ASF GitHub Bot commented on TIKA-1881: -- GitHub user NamithaGS opened a pull request:

[jira] [Commented] (TIKA-1508) Add uniformity to parser parameter configuration

2016-03-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186434#comment-15186434 ] ASF GitHub Bot commented on TIKA-1508: -- GitHub user thammegowda opened a pull request:

[jira] [Commented] (TIKA-1916) NPE in OpenDocumentParser

2016-03-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219246#comment-15219246 ] ASF GitHub Bot commented on TIKA-1916: -- GitHub user fxfixer opened a pull request:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15237656#comment-15237656 ] ASF GitHub Bot commented on TIKA-1943: -- Github user reevapp closed the pull request at:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15237654#comment-15237654 ] ASF GitHub Bot commented on TIKA-1943: -- Github user reevapp closed the pull request at:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15237655#comment-15237655 ] ASF GitHub Bot commented on TIKA-1943: -- Github user reevapp closed the pull request at:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15237662#comment-15237662 ] ASF GitHub Bot commented on TIKA-1943: -- GitHub user reevapp opened a pull request:

[jira] [Commented] (TIKA-1941) Can only respond correctly to its first request and cannot assign a User-Key dynamically

2016-04-08 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233286#comment-15233286 ] ASF GitHub Bot commented on TIKA-1941: -- GitHub user reevapp opened a pull request:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233652#comment-15233652 ] ASF GitHub Bot commented on TIKA-1943: -- GitHub user reevapp opened a pull request:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233651#comment-15233651 ] ASF GitHub Bot commented on TIKA-1943: -- GitHub user reevapp opened a pull request:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233653#comment-15233653 ] ASF GitHub Bot commented on TIKA-1943: -- GitHub user reevapp opened a pull request:

[jira] [Commented] (TIKA-774) ExifTool Parser

2016-03-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15205245#comment-15205245 ] ASF GitHub Bot commented on TIKA-774: - GitHub user rgauss opened a pull request:

[jira] [Commented] (TIKA-1877) On updating the tika-mimetypes.xml to detect .fts file format, tika detector does not return anything

2016-03-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15182129#comment-15182129 ] ASF GitHub Bot commented on TIKA-1877: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1816) Lenient testing for NamedEntityParser

2016-03-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175785#comment-15175785 ] ASF GitHub Bot commented on TIKA-1816: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1876) Integrate Natural Language Toolkit (NLTK) into Tika to perform Named Entity Recognition

2016-03-01 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175077#comment-15175077 ] ASF GitHub Bot commented on TIKA-1876: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1816) Lenient testing for NamedEntityParser

2016-03-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175318#comment-15175318 ] ASF GitHub Bot commented on TIKA-1816: -- GitHub user thammegowda opened a pull request:

[jira] [Commented] (TIKA-1841) Different XML output structure for PPT and PPTX

2016-03-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15177637#comment-15177637 ] ASF GitHub Bot commented on TIKA-1841: -- GitHub user zetisam opened a pull request:

[jira] [Commented] (TIKA-1883) Identification of Mime Type for Empty Files

2016-03-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15178469#comment-15178469 ] ASF GitHub Bot commented on TIKA-1883: -- GitHub user adityardesai opened a pull request:

[jira] [Commented] (TIKA-1926) JSON TEI Exception

2016-04-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223560#comment-15223560 ] ASF GitHub Bot commented on TIKA-1926: -- GitHub user hasanayesha opened a pull request:

[jira] [Commented] (TIKA-1916) NPE in OpenDocumentParser

2016-04-04 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15223990#comment-15223990 ] ASF GitHub Bot commented on TIKA-1916: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1927) NPE in JDBCTableReader

2016-04-04 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15224207#comment-15224207 ] ASF GitHub Bot commented on TIKA-1927: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1925) Composite External Parser like Exiftool fails to run on Windows.

2016-04-01 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15222655#comment-15222655 ] ASF GitHub Bot commented on TIKA-1925: -- GitHub user mit2nil opened a pull request:

[jira] [Commented] (TIKA-1914) ExecutableParser doesn't call start document

2016-04-04 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15224842#comment-15224842 ] ASF GitHub Bot commented on TIKA-1914: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1893) Add new mimetype for *.icns (Apple Icon Image Format) files

2016-04-25 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15257333#comment-15257333 ] ASF GitHub Bot commented on TIKA-1893: -- GitHub user mkampasi opened a pull request:

[jira] [Commented] (TIKA-1885) Tika MIME updates for *.cdf and *.xar and custom zero length file detector based on TREC-DD-Polar

2016-04-27 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15260457#comment-15260457 ] ASF GitHub Bot commented on TIKA-1885: -- Github user adeshgupta closed the pull request at:

[jira] [Commented] (TIKA-1938) HtmlParser drops

2016-04-27 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15260771#comment-15260771 ] ASF GitHub Bot commented on TIKA-1938: -- GitHub user naegelejd opened a pull request:

[jira] [Commented] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2016-04-27 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261039#comment-15261039 ] ASF GitHub Bot commented on TIKA-1343: -- GitHub user lewismc opened a pull request:

[jira] [Commented] (TIKA-1926) JSON TEI Exception

2016-04-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255329#comment-15255329 ] ASF GitHub Bot commented on TIKA-1926: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1943) Include support for Yandex Translate API

2016-04-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255335#comment-15255335 ] ASF GitHub Bot commented on TIKA-1943: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1980) HTML head tags found after first script not parsed by HtmlParser (regression)

2016-05-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15294197#comment-15294197 ] ASF GitHub Bot commented on TIKA-1980: -- GitHub user naegelejd opened a pull request:

[jira] [Commented] (TIKA-1941) Can only respond correctly to its first request and cannot assign a User-Key dynamically

2016-05-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15287330#comment-15287330 ] ASF GitHub Bot commented on TIKA-1941: -- Github user reevapp closed the pull request at:

[jira] [Commented] (TIKA-1941) Can only respond correctly to its first request and cannot assign a User-Key dynamically

2016-05-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15287333#comment-15287333 ] ASF GitHub Bot commented on TIKA-1941: -- GitHub user reevapp opened a pull request:

[jira] [Commented] (TIKA-1885) Tika MIME updates for *.cdf and *.xar and custom zero length file detector based on TREC-DD-Polar

2016-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15268057#comment-15268057 ] ASF GitHub Bot commented on TIKA-1885: -- GitHub user adeshgupta opened a pull request:

[jira] [Commented] (TIKA-1965) Added types to Grobid quantities parser

2016-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15267039#comment-15267039 ] ASF GitHub Bot commented on TIKA-1965: -- GitHub user cmenekse opened a pull request:

[jira] [Commented] (TIKA-1893) Add new mimetype for *.icns (Apple Icon Image Format) files

2016-05-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15267409#comment-15267409 ] ASF GitHub Bot commented on TIKA-1893: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1938) HtmlParser drops

2016-05-10 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15278290#comment-15278290 ] ASF GitHub Bot commented on TIKA-1938: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1881) Updates to MIME types for Postscript, WordPerfect, image and RSS based on Polar analysis

2016-04-18 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15246981#comment-15246981 ] ASF GitHub Bot commented on TIKA-1881: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1844) PooledTimeSeriesParser takes precedence over MP4Parser

2016-04-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15251034#comment-15251034 ] ASF GitHub Bot commented on TIKA-1844: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1882) Scientific MIME updates to .cab files, .xar and .mobi and .mov files based on TREC-DD-Polar analysis

2016-04-18 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247213#comment-15247213 ] ASF GitHub Bot commented on TIKA-1882: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1883) Identification of Mime Type for Empty Files

2016-04-18 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15247190#comment-15247190 ] ASF GitHub Bot commented on TIKA-1883: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1965) Added types to Grobid quantities parser

2016-05-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15275441#comment-15275441 ] ASF GitHub Bot commented on TIKA-1965: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-05-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15299047#comment-15299047 ] ASF GitHub Bot commented on TIKA-1978: -- GitHub user lewismc opened a pull request:

[jira] [Commented] (TIKA-2053) Adding TagRatio to Tika Parser

2016-08-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15418450#comment-15418450 ] ASF GitHub Bot commented on TIKA-2053: -- GitHub user AravindRam opened a pull request:

[jira] [Commented] (TIKA-1508) Add uniformity to parser parameter configuration

2016-08-14 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15420445#comment-15420445 ] ASF GitHub Bot commented on TIKA-1508: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1986) support parser parameters with type (int, double, etc) in configuration XML file

2016-08-14 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15420446#comment-15420446 ] ASF GitHub Bot commented on TIKA-1986: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1993) Image Recognition with Tika

2016-08-14 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15420444#comment-15420444 ] ASF GitHub Bot commented on TIKA-1993: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1941) Can only respond correctly to its first request and cannot assign a User-Key dynamically

2016-08-14 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15420618#comment-15420618 ] ASF GitHub Bot commented on TIKA-1941: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1925) Composite External Parser like Exiftool fails to run on Windows.

2016-08-14 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15420588#comment-15420588 ] ASF GitHub Bot commented on TIKA-1925: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2031) Update Tesseract OCR Parser

2016-08-14 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15420619#comment-15420619 ] ASF GitHub Bot commented on TIKA-2031: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1885) Tika MIME updates for *.cdf and *.xar and custom zero length file detector based on TREC-DD-Polar

2016-08-14 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15420593#comment-15420593 ] ASF GitHub Bot commented on TIKA-1885: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1980) HTML head tags found after first script not parsed by HtmlParser (regression)

2016-08-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15419059#comment-15419059 ] ASF GitHub Bot commented on TIKA-1980: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2021) Improving accuracy of Tesseract parser

2016-07-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15365686#comment-15365686 ] ASF GitHub Bot commented on TIKA-2021: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2031) Update Tesseract OCR Parser

2016-07-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15399823#comment-15399823 ] ASF GitHub Bot commented on TIKA-2031: -- GitHub user Zarana-Parekh opened a pull request:

[jira] [Commented] (TIKA-1978) Invocation of java.net.URL.equals(Object), which blocks to do domain name resolution, in org.apache.tika.parser.geo.topic.GeoParser.initialize(URL)

2016-06-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15357710#comment-15357710 ] ASF GitHub Bot commented on TIKA-1978: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results

2017-02-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15850067#comment-15850067 ] ASF GitHub Bot commented on TIKA-2025: -- GitHub user vulpes8 opened a pull request:

[jira] [Commented] (TIKA-2025) Extraction of long sequences of digits from Excel spreadsheets using Tika 1.13 doesn’t yield the expected results

2017-02-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15850098#comment-15850098 ] ASF GitHub Bot commented on TIKA-2025: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2253) Obtain new Miredot license key and upgrade plugin version in tika-server

2017-01-27 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15843673#comment-15843673 ] ASF GitHub Bot commented on TIKA-2253: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2253) Obtain new Miredot license key and upgrade plugin version in tika-server

2017-01-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15840560#comment-15840560 ] ASF GitHub Bot commented on TIKA-2253: -- GitHub user lewismc opened a pull request:

[jira] [Commented] (TIKA-2269) NPE with FeedParser

2017-02-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15874562#comment-15874562 ] ASF GitHub Bot commented on TIKA-2269: -- GitHub user jnioche opened a pull request:

[jira] [Commented] (TIKA-2231) Invalid language code exception

2017-01-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15826902#comment-15826902 ] ASF GitHub Bot commented on TIKA-2231: -- GitHub user ham1 opened a pull request:

[jira] [Commented] (TIKA-2231) Invalid language code exception

2017-01-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15827335#comment-15827335 ] ASF GitHub Bot commented on TIKA-2231: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2053) Adding TagRatio to Tika Parser

2016-09-01 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15455824#comment-15455824 ] ASF GitHub Bot commented on TIKA-2053: -- Github user AravindRam closed the pull request at:

[jira] [Commented] (TIKA-2053) Adding TagRatio to Tika Parser

2016-08-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15447763#comment-15447763 ] ASF GitHub Bot commented on TIKA-2053: -- GitHub user AravindRam opened a pull request:

[jira] [Commented] (TIKA-2098) Tika.parseToString() with maxLength doesn't work correctly for PDF files

2016-09-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15523870#comment-15523870 ] ASF GitHub Bot commented on TIKA-2098: -- GitHub user alexshadow007 opened a pull request:

[jira] [Commented] (TIKA-2098) Tika.parseToString() with maxLength doesn't work correctly for PDF files

2016-09-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15523900#comment-15523900 ] ASF GitHub Bot commented on TIKA-2098: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2093) Add hOCR output type to the TesseractOCRParser

2016-09-22 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15515007#comment-15515007 ] ASF GitHub Bot commented on TIKA-2093: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2093) Add hOCR output type to the TesseractOCRParser

2016-09-22 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15514215#comment-15514215 ] ASF GitHub Bot commented on TIKA-2093: -- GitHub user epugh opened a pull request:

[jira] [Commented] (TIKA-2099) Tar files without magic bytes are sporadically detected as text

2016-09-27 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15528526#comment-15528526 ] ASF GitHub Bot commented on TIKA-2099: -- GitHub user theobisproject opened a pull request:

[jira] [Commented] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15536891#comment-15536891 ] ASF GitHub Bot commented on TIKA-2106: -- GitHub user epugh opened a pull request:

[jira] [Commented] (TIKA-2106) "hocr" case on Linux fails, but works on OSX. Related to TIKA-2093

2016-09-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15537423#comment-15537423 ] ASF GitHub Bot commented on TIKA-2106: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2016-10-25 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15607167#comment-15607167 ] ASF GitHub Bot commented on TIKA-1343: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2016-10-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15604274#comment-15604274 ] ASF GitHub Bot commented on TIKA-1343: -- GitHub user lewismc opened a pull request:

[jira] [Commented] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2016-10-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15604260#comment-15604260 ] ASF GitHub Bot commented on TIKA-1343: -- Github user lewismc closed the pull request at:

[jira] [Commented] (TIKA-2232) Add JBIG2 image parsing support

2017-01-10 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15815637#comment-15815637 ] ASF GitHub Bot commented on TIKA-2232: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2189) Default value mismatch for "enableImageProcessing" in TesseractOCRConfig.properties and TesseractOCRConfig.java

2016-12-02 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15715126#comment-15715126 ] ASF GitHub Bot commented on TIKA-2189: -- GitHub user dasbipulkumar opened a pull request:

[jira] [Commented] (TIKA-2232) Add JBIG2 image parsing support

2017-01-05 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15803275#comment-15803275 ] ASF GitHub Bot commented on TIKA-2232: -- GitHub user essiembre opened a pull request:

[jira] [Commented] (TIKA-2189) Default value mismatch for "enableImageProcessing" in TesseractOCRConfig.properties and TesseractOCRConfig.java

2016-12-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765300#comment-15765300 ] ASF GitHub Bot commented on TIKA-2189: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1946) Add mime detection and parser for WordPerfect

2016-12-20 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15765330#comment-15765330 ] ASF GitHub Bot commented on TIKA-1946: -- GitHub user essiembre opened a pull request:

[jira] [Commented] (TIKA-1946) Add mime detection and parser for WordPerfect

2016-12-21 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15767583#comment-15767583 ] ASF GitHub Bot commented on TIKA-1946: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-2222) Contributing a XFDL Parser

2016-12-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15774207#comment-15774207 ] ASF GitHub Bot commented on TIKA-: -- GitHub user essiembre opened a pull request:

[jira] [Commented] (TIKA-2228) WordPerfect parser update to support 5.x

2016-12-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15773586#comment-15773586 ] ASF GitHub Bot commented on TIKA-2228: -- GitHub user essiembre opened a pull request:

[jira] [Commented] (TIKA-2309) New Detector and Parser classes for Time Stamped Data Envelope file format

2017-03-28 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15945266#comment-15945266 ] ASF GitHub Bot commented on TIKA-2309: -- GitHub user Shinobi75 opened a pull request:

[jira] [Commented] (TIKA-2312) [Mp3Parser] expose fields form ID3TagsAndAudio

2017-03-27 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15943423#comment-15943423 ] ASF GitHub Bot commented on TIKA-2312: -- GitHub user lukaszozimek opened a pull request:

[jira] [Commented] (TIKA-2262) Supporting Image-to-Text (Image Captioning) in Tika for Image MIME Types

2017-03-28 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15946534#comment-15946534 ] ASF GitHub Bot commented on TIKA-2262: -- GitHub user KranthiGV opened a pull request:

[jira] [Commented] (TIKA-2309) New Detector and Parser classes for Time Stamped Data Envelope file format

2017-03-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15946994#comment-15946994 ] ASF GitHub Bot commented on TIKA-2309: -- GitHub user Shinobi75 reopened a pull request:

[jira] [Commented] (TIKA-2309) New Detector and Parser classes for Time Stamped Data Envelope file format

2017-03-29 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15946983#comment-15946983 ] ASF GitHub Bot commented on TIKA-2309: -- Github user Shinobi75 closed the pull request at:

[jira] [Commented] (TIKA-2298) To improve object recognition parser so that it may work without external RESTful service setup

2017-03-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15940472#comment-15940472 ] ASF GitHub Bot commented on TIKA-2298: -- GitHub user asmehra95 opened a pull request:

[jira] [Commented] (TIKA-2303) PDFParser with optional bookmarks text extraction

2017-03-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15928470#comment-15928470 ] ASF GitHub Bot commented on TIKA-2303: -- ppalazon opened a new pull request #157: Fix for TIKA-2303

[jira] [Commented] (TIKA-2303) PDFParser with optional bookmarks text extraction

2017-03-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15930501#comment-15930501 ] ASF GitHub Bot commented on TIKA-2303: -- Github user tballison closed the pull request at:

[jira] [Commented] (TIKA-2303) PDFParser with optional bookmarks text extraction

2017-03-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15930500#comment-15930500 ] ASF GitHub Bot commented on TIKA-2303: -- ppalazon closed pull request #157: Fix for TIKA-2303

[jira] [Commented] (TIKA-2293) Tess4jOCRParser - A simpler Java version of TesseractOCRParser

2017-03-18 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15931133#comment-15931133 ] ASF GitHub Bot commented on TIKA-2293: -- ThejanW opened a new pull request #158: TIKA-2293 -

[jira] [Commented] (TIKA-2293) Tess4jOCRParser - A simpler Java version of TesseractOCRParser

2017-03-18 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15931134#comment-15931134 ] ASF GitHub Bot commented on TIKA-2293: -- GitHub user ThejanW opened a pull request:

[jira] [Commented] (TIKA-2309) New Detector and Parser classes for Time Stamped Data Envelope file format

2017-04-04 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15955099#comment-15955099 ] ASF GitHub Bot commented on TIKA-2309: -- grossws commented on a change in pull request #161: fix for

<    1   2   3   4   5   6   7   8   9   10   >