Re: [mime4j] newlines and parsing of nested (encoded) rfc822 messages

Stefano Bagnara Fri, 18 Jul 2008 01:59:52 -0700

Oleg Kalnichevski ha scritto:

On Thu, 2008-07-17 at 20:21 +0200, Stefano Bagnara wrote:
Oleg Kalnichevski ha scritto:
Stefano Bagnara wrote:
...
Not only does this change completely reverts the performance gains andmakes the whole refactroring exercise completely pointless due to anutterly inefficient implementation of EOLConvertingInputStream, it isalso conceptually wrong (in my humble opinion), as it causes mime4j tocorrupt 8bit encoded 'application/octet-stream' content. This basicallyrenders mime4j incompatible with commons browsers and HttpClient
The performance of the EOLConvertingInputStream is not important at allif removing it we have an unusable library.
And the last thing. This kind of argument works both ways. The strict
RFC compliance is not important if we have an unusable library as a
result.


Oleg, I agree with you! I'm well aware of this.

I think that slowly this discussion is givin a bit more knowledge tojudge what is the right compromise between strict behaviour, permissiveinteroperabily and compliance.

Most time there is no need to be non-compliant to support permissiveinteroperability but we just need to be less strict.

I hope you understand I'm not fighting your patch/changes and I'm evenmuch more far from fighting you (in fact I like you because you providecode and not complaints!). I want to make sure we do the right thingbecause we understand it or if we do the wrong thing I want to be surewe understand what we are doing and agree that even if it is wrong isacceptable to us.


E.g: I'm slowly coming to a possible proposal about parsing.

- strict mode: no conversion is done, a CR or LF in headers (or othernon 7bit content) make mime4j fail parsing.

- permissive modes:

- default binary: no conversion happen, isolated CR and LF areaccepted everywhere but not considered newlines (as like as other 8bitbytes), the default content-transfer-encoding is "binary" when notspecified (7bit, 8bit and binary are read as binary).- default text: we convert isolated CR and LF to CRLF almosteverywhere but in "binary" content-transfer-encoding parts.I'm not proposing this yet (not sure this is enough and we don't needmore granular tweakings), but this is something I'm evaluating rightnow... The strict mode is desiderable to have, but less important thanthe permissive parsing (we want to be strict in output, not in input).OTOH someone may want to use mime4j for validating if a content iswellformed or not (wrt RFC) and in this case a strict mode would benecessary.


Stefano

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [mime4j] newlines and parsing of nested (encoded) rfc822 messages

Reply via email to