On Sat, Aug 11, 2012 at 4:23 PM, Mikel Artetxe <[email protected]> wrote:

> 3) Develop a Java port of it. Probably the best solution but, obviously,
>>> the hardest one to implement...
>>>
>> Haven't really looked into that as I consider JNI a better solution.
>> But, it's all hash maps and hash sets, so maybe not that hard to convert.
>>
> Again, regex is a significant feature and apparently only Java 7 and newer
>> gets that right.
>>
>
> As far as I know java.util.regex is available since early versions of Java
> (you can look 
> here<http://docs.oracle.com/javase/tutorial/essential/regex/>for more details 
> about what it offers). But perhaps Java 7 introduces some
> significant improvements in this field, I don't know...
>

Java regex before Java 7 worked with ASCII-only semantics. The newer regex
engine has many more Unicode features and a new flag
http://docs.oracle.com/javase/7/docs/api/java/util/regex/Pattern.html#UNICODE_CHARACTER_CLASSto
enable proper Unicode handling.

Only the newer Java 7 engine with the flag would be ICU compatible, from
what I could read.

-- Tino Didriksen
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to