Dan Bron wrote: >> ¿J602a discriminate Spanish? > > It is more accurate to say J discriminates (for) ASCII. Unicode is a > second class citizen.
I don't think it is right to say that unicode is a second class citizen in J. The core assumption is that text in a literal is in unicode utf8 format, which is a de facto industry standard for plain text. That some utf8 characters take more than one byte is well understood and should not be a problem - just use ucpcount to get character count if desired, or convert to the alternative 2-byte unicode datatype. Both utf8 and 2-byte unicode datatypes are useful in programming. > I've been struggling with this myself, recently (I'm trying to create a > quick ad-hoc report but am hampered because my data is in Unicode but I > don't know enough about that to diagnose the issue or normalize the data). > I wish a character was a character was a character. What exactly are you struggling with? Can you provide a simple example? ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm
