[jira] [Commented] (TIKA-2703) Error indexing a xlsx file

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568090#comment-16568090 ] Tim Allison commented on TIKA-2703: --- Google docs/ Dropbox? GitHub? Anywhere else to share file? > Error

[jira] [Commented] (TIKA-2703) Error indexing a xlsx file

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568239#comment-16568239 ] Tim Allison commented on TIKA-2703: --- :P Thank you for sharing the file with me. Bottom line: there's a

[jira] [Comment Edited] (TIKA-2703) Error indexing a xlsx file

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568239#comment-16568239 ] Tim Allison edited comment on TIKA-2703 at 8/3/18 2:19 PM: --- :P Thank you for

[jira] [Commented] (TIKA-2703) Error indexing a xlsx file

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568251#comment-16568251 ] Tim Allison commented on TIKA-2703: --- That said, 1) you still shouldn't use Solr's integration with Tika

[jira] [Commented] (TIKA-2701) Text is not extracted properly from WMF files

2018-08-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568292#comment-16568292 ] ASF GitHub Bot commented on TIKA-2701: -- tballison closed pull request #245: fix for TIKA-2701

[jira] [Commented] (TIKA-2703) Error indexing a xlsx file

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568375#comment-16568375 ] Hudson commented on TIKA-2703: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #291 (See

[jira] [Commented] (TIKA-2701) Text is not extracted properly from WMF files

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568374#comment-16568374 ] Hudson commented on TIKA-2701: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #291 (See

[jira] [Commented] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568465#comment-16568465 ] Hudson commented on TIKA-2673: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #292 (See

[jira] [Commented] (TIKA-2648) mime detection based on resource name detects resources as "text/x-php" instead of "text/html"

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568464#comment-16568464 ] Hudson commented on TIKA-2648: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #292 (See

[jira] [Commented] (TIKA-2702) Different behavior between TIKA and pdfbox

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568483#comment-16568483 ] Tim Allison commented on TIKA-2702: --- Right, there is no guarantee or desire that Tika extracts the same

[jira] [Resolved] (TIKA-2702) Different behavior between TIKA and pdfbox

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2702. --- Resolution: Not A Problem Let's continue the discussion on our user list u...@tika.apache.org if you

[jira] [Resolved] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2673. --- Resolution: Fixed Assignee: Tim Allison Fix Version/s: 2.0.0 1.19

[jira] [Commented] (TIKA-2648) mime detection based on resource name detects resources as "text/x-php" instead of "text/html"

2018-08-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568369#comment-16568369 ] ASF GitHub Bot commented on TIKA-2648: -- tballison closed pull request #236: TIKA-2648 : detect

[jira] [Commented] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568398#comment-16568398 ] Tim Allison commented on TIKA-2673: --- [~gbouchar], I'm sorry if I missed it in the above, but would you

[jira] [Created] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame

2018-08-03 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2704: - Summary: MPEGStream should throw an EOF if appropriate in skipFrame Key: TIKA-2704 URL: https://issues.apache.org/jira/browse/TIKA-2704 Project: Tika Issue Type:

[jira] [Resolved] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame

2018-08-03 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2704. --- Resolution: Fixed Fix Version/s: 2.0.0 1.19 > MPEGStream should throw an

[jira] [Commented] (TIKA-2648) mime detection based on resource name detects resources as "text/x-php" instead of "text/html"

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568548#comment-16568548 ] Hudson commented on TIKA-2648: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1536 (See

[jira] [Commented] (TIKA-2703) Error indexing a xlsx file

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568551#comment-16568551 ] Hudson commented on TIKA-2703: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1536 (See

[jira] [Commented] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568549#comment-16568549 ] Hudson commented on TIKA-2673: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1536 (See

[jira] [Commented] (TIKA-2701) Text is not extracted properly from WMF files

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568550#comment-16568550 ] Hudson commented on TIKA-2701: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1536 (See

[jira] [Commented] (TIKA-2703) Error indexing a xlsx file

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568565#comment-16568565 ] Hudson commented on TIKA-2703: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #67 (See

[jira] [Commented] (TIKA-2673) HtmlEncodingDetector doesn't follow the specification

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568567#comment-16568567 ] Hudson commented on TIKA-2673: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #67 (See

[jira] [Commented] (TIKA-2701) Text is not extracted properly from WMF files

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568566#comment-16568566 ] Hudson commented on TIKA-2701: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #67 (See

[jira] [Commented] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568610#comment-16568610 ] Hudson commented on TIKA-2704: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #293 (See

[jira] [Commented] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568897#comment-16568897 ] Hudson commented on TIKA-2704: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1541 (See

[jira] [Commented] (TIKA-2704) MPEGStream should throw an EOF if appropriate in skipFrame

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568899#comment-16568899 ] Hudson commented on TIKA-2704: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #72 (See

[jira] [Commented] (TIKA-2648) mime detection based on resource name detects resources as "text/x-php" instead of "text/html"

2018-08-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16568898#comment-16568898 ] Hudson commented on TIKA-2648: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #72 (See