Re: case mangling and binary strings

2004-11-03 Thread Adam Thomason
On Tue, 2 Nov 2004 13:33:47 -0500, Dan Sugalski [EMAIL PROTECTED] wrote: At 1:16 PM -0500 11/2/04, Sam Ruby wrote: Making ICU optional, at least. It's too problematic on too many platforms, and just turns into a big headache. It seemed like a good idea at the time, and while it's still

Re: case mangling and binary strings

2004-11-03 Thread Leopold Toetsch
Dan Sugalski [EMAIL PROTECTED] wrote: I expect I'll put together a Unicode charset that uses ICU to do its thing, and go from there. We certainly need Unicode support, so it's not like we can't do it. (And we still don't have a better option, unfortunately) ICU 3.0 should be out AFAIK. This

Re: case mangling and binary strings

2004-11-03 Thread Leopold Toetsch
Dan Sugalski [EMAIL PROTECTED] wrote: At 6:16 PM +0100 11/2/04, Leopold Toetsch wrote: If it's pure binary pitch a fit. If it has an encoding attached, continue, Yeah, that's the plan. I'd like to add another entry to the internal API: OPTIONAL_INTERNAL_EXCEPTION which works like

Re: case mangling and binary strings

2004-11-03 Thread Leopold Toetsch
Sam Ruby [EMAIL PROTECTED] wrote: Python has two data types: str and unicode. Python's unicode features are probably not really good examples generally. Ongoing discussion in Python lists seem to indicate that there a rather rough edges still. - Sam Ruby leo

case mangling and binary strings

2004-11-02 Thread Dan Sugalski
Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? (You may assume that we have strings which are explicitly marked at least Unicode, so there

Re: case mangling and binary strings

2004-11-02 Thread Nicholas Clark
On Tue, Nov 02, 2004 at 11:53:08AM -0500, Dan Sugalski wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? (You may assume that we

Re: case mangling and binary strings

2004-11-02 Thread Matt Fowles
Dan~ I vote for doing nothing in the up/down case options as those are frequently just used to get a cannonical form for comparison. Although I could understand an argument for throwing an exception... Matt On Tue, 2 Nov 2004 11:53:08 -0500, Dan Sugalski [EMAIL PROTECTED] wrote: Okay, here's

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 12:11 PM -0500 11/2/04, Matt Fowles wrote: Dan~ I vote for doing nothing in the up/down case options as those are frequently just used to get a cannonical form for comparison. Although I could understand an argument for throwing an exception... People better not be using binary data as a

Re: case mangling and binary strings

2004-11-02 Thread Sam Ruby
Dan Sugalski wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? (You may assume that we have strings which are explicitly marked at

Re: case mangling and binary strings

2004-11-02 Thread James Mastros
Dan Sugalski wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? (You may assume that we have strings which are explicitly marked at

Re: case mangling and binary strings

2004-11-02 Thread Nicholas Clark
On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that correspond to us-ascii and perform a us-ascii uppercase on them. The remaining bytes are left alone. I'd prefer parrot not to

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that correspond to us-ascii and perform a us-ascii uppercase on them. The remaining

Re: case mangling and binary strings

2004-11-02 Thread Leopold Toetsch
Dan Sugalski [EMAIL PROTECTED] wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? If it's pure binary pitch a fit. If it has an

Re: case mangling and binary strings

2004-11-02 Thread Sam Ruby
Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that correspond to us-ascii and perform a us-ascii uppercase on

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 6:16 PM +0100 11/2/04, Leopold Toetsch wrote: Dan Sugalski [EMAIL PROTECTED] wrote: Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? If

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 1:16 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that correspond to us-ascii

Re: case mangling and binary strings

2004-11-02 Thread Sam Ruby
Dan Sugalski wrote: At 1:16 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it operates is to take the range of bytes that

Re: case mangling and binary strings

2004-11-02 Thread Dan Sugalski
At 1:42 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 1:16 PM -0500 11/2/04, Sam Ruby wrote: Dan Sugalski wrote: At 5:43 PM + 11/2/04, Nicholas Clark wrote: On Tue, Nov 02, 2004 at 12:35:26PM -0500, Sam Ruby wrote: However, str has an upper() method defined on it. The way it

Re: case mangling and binary strings

2004-11-02 Thread Ben Morrow
Quoth [EMAIL PROTECTED] (Dan Sugalski): Okay, here's a question for everyone to hash out. Assuming I have a parrot string which is explicitly marked as a binary string... What should happen when it's told to upcase/downcase/titlecase itself? (You may assume that we have strings which

Re: case mangling and binary strings

2004-11-02 Thread Jeff Clites
On Nov 2, 2004, at 10:46 AM, Dan Sugalski wrote: At 1:42 PM -0500 11/2/04, Sam Ruby wrote: I don't care if Parrot uses ICU on any platform. I do care that Parrot supports utf-8 on every platform. Ah, OK. Yes, we will support all the unicode encodings, as well as the unicode character set, on all