Guido van Rossum wrote:

> The encoding/decoding behavior should be no different from that of the
> encode() and decode() methods on unicode strings and byte arrays.

Except that it must work in incremental mode. The new (in 2.5) 
incremental codecs should be usable for that.

> Certainly no normalization of diacritics will be done; surrogate
> handling depends on the encoding and whether the unicode string
> implementation uses 16 or 32 bits per character.
> 
> I agree that we need to be able to specify the error handling as well.

Should it be possible to change the error handling during the lifetime 
of a stream? Then this change would have to be passed through to the 
underlying codec.

> UnicodeErrors may be raised.

Servus,
    Walter

> On 2/27/07, Jim Jewett <[EMAIL PROTECTED]> wrote:
>> On 2/27/07, Adam Olsen <[EMAIL PROTECTED]> wrote:
>>> On 2/26/07, Mike Verdone <[EMAIL PROTECTED]> wrote:
>>>> Text I/O
>>>> ... operate on a per-character basis instead of a per-byte basis.
>>> "per-character" needs some clarification.  I'm guessing this will only
>>> return entire code points, but the unicode type will expose them as
>>> code units, so it could be seen as both per-code-point and
>>> per-code-unit.
>> Does this just mean that you assume
>> (1) UTF32
>> (2) surrogate pairs will show up as two characters
>> (3) diacritics may (or may not) show up separately from their base 
>> characters?
>>
>> This does suggest that error-correction should be specified (or at
>> least explicitly not specified).  If the underlying input byte-stream
>> contains an invalid sequence, will the TextIO raise a
>> UnicodeDecodeError?  Or will its error/replace/delete behavior be
>> settable?
>>
>> Does the Text class promise to catch things like an invalid
>> combination of surrogates?
>>
>> -jJ
>> _______________________________________________
>> Python-3000 mailing list
>> Python-3000@python.org
>> http://mail.python.org/mailman/listinfo/python-3000
>> Unsubscribe: 
>> http://mail.python.org/mailman/options/python-3000/guido%40python.org
>>
> 
> 

_______________________________________________
Python-3000 mailing list
Python-3000@python.org
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to