[jira] [Commented] (TIKA-3710) HTML document detected incorrect as message/rfc822

2022-04-01 Thread Sam Stephens (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17516142#comment-17516142 ] Sam Stephens commented on TIKA-3710: The HTML document is exactly what you see there;

[jira] [Commented] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Sam Stephens (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17516141#comment-17516141 ] Sam Stephens commented on TIKA-3711: I guess the question is what are the semantics of

[jira] [Commented] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17516029#comment-17516029 ] Tim Allison commented on TIKA-3711: --- Y, I totally hear you [~lfcnassif]. I think we hav

[jira] [Commented] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515974#comment-17515974 ] Luís Filipe Nassif commented on TIKA-3711: -- Well, IMHO I think the user may be in

[jira] [Commented] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515956#comment-17515956 ] Tim Allison commented on TIKA-3711: --- [~lfcnassif], it feels inelegant to write embedded

[jira] [Updated] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luís Filipe Nassif updated TIKA-3711: - Issue Type: Improvement (was: Bug) > Image file names included in parsed Word Document te

[jira] [Updated] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luís Filipe Nassif updated TIKA-3711: - Priority: Minor (was: Major) > Image file names included in parsed Word Document text > -

[jira] [Commented] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515941#comment-17515941 ] Luís Filipe Nassif commented on TIKA-3711: -- I strongly prefer current behavior, t

[jira] [Comment Edited] (TIKA-3710) HTML document detected incorrect as message/rfc822

2022-04-01 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515921#comment-17515921 ] Tim Allison edited comment on TIKA-3710 at 4/1/22 1:35 PM: --- Y, I

[jira] [Commented] (TIKA-3710) HTML document detected incorrect as message/rfc822

2022-04-01 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515921#comment-17515921 ] Tim Allison commented on TIKA-3710: --- Did the original html file actually have an html he

[jira] [Commented] (TIKA-3711) Image file names included in parsed Word Document text

2022-04-01 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17515919#comment-17515919 ] Tim Allison commented on TIKA-3711: --- I introduced that change because some parsers were