[jira] [Commented] (TIKA-2750) Update regression corpus

2018-10-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665663#comment-16665663 ] Tim Allison commented on TIKA-2750: --- I kicked off an initial pull of MSOffice and PDF files... There are

[jira] [Commented] (TIKA-2750) Update regression corpus

2018-10-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665307#comment-16665307 ] Tim Allison commented on TIKA-2750: --- I attached two tables in

[jira] [Updated] (TIKA-2750) Update regression corpus

2018-10-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2750: -- Attachment: CC-MAIN-2018-39-mimes-charsets-by-tld.zip > Update regression corpus >

[jira] [Commented] (TIKA-2750) Update regression corpus

2018-10-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665298#comment-16665298 ] Tim Allison commented on TIKA-2750: --- I've rm'd the initial common crawl zips and the zips from govdocs1.

[jira] [Commented] (TIKA-2758) Possible error charset detection

2018-10-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665295#comment-16665295 ] Tim Allison commented on TIKA-2758: --- I just attached grep_charsets.csv which shows the results of

[jira] [Updated] (TIKA-2758) Possible error charset detection

2018-10-26 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2758: -- Attachment: grep_charsets.csv > Possible error charset detection > > >

[jira] [Updated] (TIKA-2767) Problem with import xlsx with null cells

2018-10-26 Thread ionut hodor (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ionut hodor updated TIKA-2767: -- Affects Version/s: 1.18 > Problem with import xlsx with null cells >

[jira] [Updated] (TIKA-2767) Problem with import xlsx with null cells

2018-10-26 Thread ionut hodor (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ionut hodor updated TIKA-2767: -- Summary: Problem with import xlsx with null cells (was: Problem with import xlsx) > Problem with

[jira] [Updated] (TIKA-2767) Problem with import xlsx

2018-10-26 Thread ionut hodor (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ionut hodor updated TIKA-2767: -- Description: I have a problem with xlsx when there are cell without value. The cells are not

[jira] [Updated] (TIKA-2767) Problem with import xlsx

2018-10-26 Thread ionut hodor (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ionut hodor updated TIKA-2767: -- Description: I have a problem with xlsx when there are cell without value. The cells are not

[jira] [Updated] (TIKA-2767) Problem with import xlsx

2018-10-26 Thread ionut hodor (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ionut hodor updated TIKA-2767: -- Description: I have a problem with xlsx when there are cell without value. The cells are not

[jira] [Created] (TIKA-2767) Problem with import xlsx

2018-10-26 Thread ionut hodor (JIRA)
ionut hodor created TIKA-2767: - Summary: Problem with import xlsx Key: TIKA-2767 URL: https://issues.apache.org/jira/browse/TIKA-2767 Project: Tika Issue Type: Bug Reporter: ionut