Hi,

I'd like to use re2c to parse some natural language text.  I think I
can make it work as it is now with a little hackery, since it
recognizes utf-8 byte sequences, but it would be better if it actually
understood unicode better, naturally.  Any plans to improve Unicode
support?  I haven't looked at the internals yet, but an obvious
approach would be to add a byte->char conversion layer and modify the
regex machinery to work on chars instead of bytes.  Any idea how much
work that would be?

I'd also like to be able to generate javascript code so I can embed a
parser in a webpage.  Any idea how much work that would be?

Thanks,

Gregg

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Re2c-general mailing list
Re2c-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/re2c-general

Reply via email to