El Viernes, 20 de Noviembre de 2009, Григорий Фучеджи escribió:
> I think this would work
> (0x20 0x70..0xFF) | (0x21 0x00..0x8F)
> But I think there should be better solution

Finally I've understood that this conversion is not valid:
  #x2070-#x218F  !=   (0x20 0x70..0xFF) | (0x21 0x00..0x8F)

The former grammar are Unicode character representation in hexadecimal code 
point. It represents the symbol ⁰ and in hex UTF-8 bytes it would be "0xE2 
0x81 0xB0".

A conversor utility:

  http://www.ltg.ed.ac.uk/~richard/utf-8.cgi?input=C0&mode=hex


Of course it would be great if Ragel wouud provide Unicode hex code point 
grammar in the .rl files so I could do:

  0x2070..0x218F

instead of having to convert it to lots of hexadecimal bytes ranges.

Regards.



-- 
Iñaki Baz Castillo <[email protected]>

_______________________________________________
ragel-users mailing list
[email protected]
http://www.complang.org/mailman/listinfo/ragel-users

Reply via email to