Re: RFC: UTF-8 perl (not Perl) source code

2011-08-08 Thread Craig A. Berry

On Aug 8, 2011, at 7:56 AM, John E. Malmberg wrote:

> On 8/8/2011 5:25 AM, Nicholas Clark wrote:
>> On Sun, Aug 07, 2011 at 01:44:23PM -0700, Father Chrysostomos wrote:
>>> If I change the quotes at the top of the *.c files in the perl source code 
>>> to use UTF-8 instead of Latin-1, would anyone object?

Not me.  I believe Tom C. is responsible for putting them in their current form 
so he might have an opinion.

>>> I have my text editor set to use UTF-8 by default, so it complains whenever 
>>> I open those files.
>> 
>> I have a vague memory (possibly wrong) that there was a VMS editor that
>> choked (in some fashion) on UTF-8. [And if so, I guess because it assumes
>> that source code is ISO-8859-1, and no C1 control characters]

I don't remember anything about an editor in this regard.

> The VMS DECTerm terminal emulator (X11 based) locks up from the output of the 
> unicode tests while running Perl.

You mean running the test suite, right?  A problem with sending a (possibly 
intentionally illegal) sequence to the terminal as part of a test shouldn't 
preclude putting valid UTF-8 in the source code.

As far as what git does, I think it simply averts its eyes from all questions 
of encoding and stores bytes.


Craig A. Berry
mailto:craigbe...@mac.com

"... getting out of a sonnet is much more
 difficult than getting in."
 Brad Leithauser



Re: RFC: UTF-8 perl (not Perl) source code

2011-08-08 Thread John E. Malmberg

On 8/8/2011 5:25 AM, Nicholas Clark wrote:

On Sun, Aug 07, 2011 at 01:44:23PM -0700, Father Chrysostomos wrote:

If I change the quotes at the top of the *.c files in the perl source code to 
use UTF-8 instead of Latin-1, would anyone object?

I have my text editor set to use UTF-8 by default, so it complains whenever I 
open those files.


I have a vague memory (possibly wrong) that there was a VMS editor that
choked (in some fashion) on UTF-8. [And if so, I guess because it assumes
that source code is ISO-8859-1, and no C1 control characters]


The VMS DECTerm terminal emulator (X11 based) locks up from the output 
of the unicode tests while running Perl.


VMS basically only supports ASCII and the 8 bit ISO character sets for 
terminal output for the non-internationalized versions.


I have not worked with the internationalized variants to know if they 
have the same issues.


-John
malmberg@encompasserve.organization
Personal Opinion Only



Re: RFC: UTF-8 perl (not Perl) source code

2011-08-08 Thread Nicholas Clark
On Sun, Aug 07, 2011 at 01:44:23PM -0700, Father Chrysostomos wrote:
> If I change the quotes at the top of the *.c files in the perl source code to 
> use UTF-8 instead of Latin-1, would anyone object?
> 
> I have my text editor set to use UTF-8 by default, so it complains whenever I 
> open those files.

I have a vague memory (possibly wrong) that there was a VMS editor that
choked (in some fashion) on UTF-8. [And if so, I guess because it assumes
that source code is ISO-8859-1, and no C1 control characters]

Nicholas Clark