Hello. I write a module that parses a character class including grouping, intersection, union, and removal (subtraction), according to Unicode Regular Expression (e.g. [A & B], [A-Z - XYZ]) and converts it into a regular expression in Perl.
For example, [A-Z & C-S & K-V] can be used as well as [K-S]. Well, this module cannot do such optimization but utilizes perl regex syntax (?! ) and (?= ). tarball http://homepage1.nifty.com/nomenclator/perl/Unicode-Regex-Set-0.01.tar.gz html-pod http://homepage1.nifty.com/nomenclator/perl/Unicode-Regex-Set.html see also (UTR #18) http://www.unicode.org/unicode/reports/tr18/ Thank you, SADAHIRO Tomoyuki