[ 
https://issues.apache.org/jira/browse/TIKA-3962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17683133#comment-17683133
 ] 

Tim Allison commented on TIKA-3962:
-----------------------------------

I'm attaching the output from 2.6.0 and 2.7.0 to show the difference on one of 
our unit test files from TIKA-2680.

> Set RFC822 parser to noRecurse
> ------------------------------
>
>                 Key: TIKA-3962
>                 URL: https://issues.apache.org/jira/browse/TIKA-3962
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 2.7.0
>
>         Attachments: TIKA-2680-1.eml-2.7.0-prerc1.json, 
> TIKA-2680-1.eml.-2.6.0.json
>
>
> On our test file {{testGroupWiseEml.eml}}, there's an embedded rfc822 
> attachment that is currently not treated as an attachment but is inlined. 
> The relevant section of the test file is:
> {noformat}
> Content-Type: message/rfc822
> Content-Transfer-Encoding: base64
> Content-Disposition: attachment; filename="test.eml"
> {noformat}
> When I open the email in several email clients, it shows this {{test.eml}} 
> correctly as an attachment.  
> It turns out there's a setting on mime4j's parser "setNoRecurse" that yields 
> the correct behavior on this test file.  Given that Tika handles files 
> recursively already by default, I _think_ we should be safe to set no recurse 
> in the mime4j parser and rely on Tika's own recursive parsing.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to