[ https://issues.apache.org/jira/browse/MIME4J-250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17417062#comment-17417062 ]
Benoit Tellier commented on MIME4J-250: --------------------------------------- Got the issue. I bet the Q encoding can be normalized after unfolding. I will have a look today, it looks fun! {code:java} @Test public void test() throws Exception { DefaultMessageBuilder messageBuilder = new DefaultMessageBuilder(); messageBuilder.setMimeEntityConfig(MimeConfig.PERMISSIVE); messageBuilder.setDecodeMonitor(DecodeMonitor.SILENT); final Message message = messageBuilder.parseMessage(new ByteArrayInputStream(("Subject: Re: =?UTF-8?Q?=D8=AA=D8=B2_=D8=A2=D9=82=D8=A7=DB=8C_=DA=A9=D8=B1=D8=A7=D9=85=D8=AA=DB=8C?=\r\n").getBytes())); System.out.println(message.getSubject()); } {code} Returns {code:java} Re: تز آقای کرامتی {code} But {code:java} @Test public void test() throws Exception { DefaultMessageBuilder messageBuilder = new DefaultMessageBuilder(); messageBuilder.setMimeEntityConfig(MimeConfig.PERMISSIVE); messageBuilder.setDecodeMonitor(DecodeMonitor.SILENT); final Message message = messageBuilder.parseMessage(new ByteArrayInputStream(("Subject: Re: =?UTF-8?Q?=D8=AA=D8=B2_=D8=A2=D9=82=D8=A7=DB=8C_=DA=A9=D8=B1=D8=A7=D9?=\r\n" + " =?UTF-8?Q?=85=D8=AA=DB=8C?=\r\n").getBytes())); System.out.println(message.getSubject()); } {code} Returns {code:java} Re: تز آقای کرا��تی {code} I also hereby acknowledge Thunderbird handles the splited encoding correctly... (I manually changed the subject of an email to check.) {code:java} Re:تز آقای کرامتی {code} > subject decode problem > ---------------------- > > Key: MIME4J-250 > URL: https://issues.apache.org/jira/browse/MIME4J-250 > Project: James Mime4j > Issue Type: Bug > Components: parser (core) > Affects Versions: 0.7.2 > Reporter: mahdi > Priority: Critical > > when decoding a multi line subject a problem occurring, assume that we have: > `Subject: Re: > =?UTF-8?Q?=D8=AA=D8=B2_=D8=A2=D9=82=D8=A7=DB=8C_=DA=A9=D8=B1=D8=A7=D9?= > =?UTF-8?Q?=85=D8=AA=DB=8C?=` > after decoding the result is: `Re: تز آقای کرا��تی` > half of unicode \uD985 is in the end of line 2 and half in beginning of line > 3. > Thunderbird, gmail,... have no problem with this email and show the subject > correctly like this: `Re: تز آقای کرامتی` -- This message was sent by Atlassian Jira (v8.3.4#803005)