Sorry for troubling u 
 But if i try a \w match on this String AO?nC 
 which is a unicode string i get only 4 matches 
 Iam using the Applet that is  a part of jakarta site 
 Morever is there any way out where i can unicode characters in the Browser 
 Is there any editor where i can enter UTF-8 Characters in the Browser 


-----Original Message-----
From: Daniel F. Savarese [mailto:[EMAIL PROTECTED]]
Sent: Thursday, January 10, 2002 8:44 PM
To: ORO Users List
Subject: Re: Doubt about ORO 



In message <[EMAIL PROTECTED]>,
Chan
dramouli Kharidehal writes:
>How do i detect Unicode  Characters using the ORO package 
>For example usign \w i can detect all the ASCII characters 
>How about the characters beyond the ASCII Unicode Characters 

As I said before:

>\d matches based on Character.isDigit() and \w matches based on
>Character.isLetterOrDigit() or '_'.  So, you see, it's all Unicode based
>on Java's interpretation of how to classify characters with some attempt
>to remain true to Perl (e.g., the inclusion of '_' in \w).  Even though
>the source is somewhat inscrutable, these types of questions can be
>answered by looking at the source.

If you skimmed it the first time, please reread the first sentence of
my original reply which explicitly answers your question.  Since
Character.isLetterOrDigit(), or any other Character.isFoo()
method, is not restricted to ASCII, neither is \w, \d, or any of the
other character set specifiers.  Also pay special attention to the last
sentence of my original reply :)

daniel



--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to