[jira] [Comment Edited] (TIKA-2471) Tab-prefixed message body lines in Mbox interpreted as headers

2017-10-17 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208769#comment-16208769 ] Luis Filipe Nassif edited comment on TIKA-2471 at 10/18/17 3:47 AM: Hi

[jira] [Commented] (TIKA-2471) Tab-prefixed message body lines in Mbox interpreted as headers

2017-10-17 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208775#comment-16208775 ] Luis Filipe Nassif commented on TIKA-2471: -- Also, the tracking metadata feature was added before

[jira] [Commented] (TIKA-2471) Tab-prefixed message body lines in Mbox interpreted as headers

2017-10-17 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208769#comment-16208769 ] Luis Filipe Nassif commented on TIKA-2471: -- Hi Matthew, If I remember correctly, some headers

[jira] [Commented] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208562#comment-16208562 ] Luis Filipe Nassif commented on TIKA-2478: -- Robert, related to your last suggestion, I think

[jira] [Commented] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208554#comment-16208554 ] Luis Filipe Nassif commented on TIKA-2478: -- Although I have seen in the past emls with very

[jira] [Commented] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Robert Letzler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208439#comment-16208439 ] Robert Letzler commented on TIKA-2478: -- Also, the current MBOX parser often puts the subject line in

[jira] [Commented] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16207598#comment-16207598 ] Nick Burch commented on TIKA-2478: -- Following the outlook parser model seems likely to deliver "least

[jira] [Comment Edited] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16207543#comment-16207543 ] Tim Allison edited comment on TIKA-2478 at 10/17/17 12:07 PM: -- Thank you

[jira] [Comment Edited] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16207543#comment-16207543 ] Tim Allison edited comment on TIKA-2478 at 10/17/17 12:07 PM: -- Thank you

[jira] [Commented] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16207548#comment-16207548 ] Tim Allison commented on TIKA-2478: --- There seems to be a nexus of areas for improvements in mbox/rfc822

[jira] [Commented] (TIKA-2478) MBOX import includes redundant copies of the text

2017-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16207543#comment-16207543 ] Tim Allison commented on TIKA-2478: --- Thank you [~letzlerr] for opening this and pointing to a triggering