Dan Bron wrote:
>> ¿J602a discriminate  Spanish? 
> 
> It is more accurate to say J discriminates (for) ASCII.  Unicode is a
> second class citizen.  

I don't think it is right to say that unicode is a second class citizen 
in J.

The core assumption is that text in a literal is in unicode utf8 format, 
which is a de facto industry standard for plain text.

That some utf8 characters take more than one byte is well understood and 
should not be a problem - just use ucpcount to get character count if 
desired, or convert to the alternative 2-byte unicode datatype. Both 
utf8 and 2-byte unicode datatypes are useful in programming.

> I've been struggling with this myself, recently (I'm trying to create a
> quick ad-hoc report but am hampered because my data is in Unicode but I
> don't know enough about that to diagnose the issue or normalize the data).
>  I wish a character was a character was a character.

What exactly are you struggling with? Can you provide a simple example?

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to