El 2018-11-13 16:29, Kevin Brubeck Unhammer escribió:
mansur <6688...@gmail.com> čálii:

Hello!

There are so many symbols that are not recognized by Apertium's tagger and not marked in any way. For example, apertium-tat does not recognize the
following symbols:
_ @ % ~ |
and many others.

Is it possible to use some special tag (^_/_<unknown>$) for such cases?

Yes, just give them analyses in tat.dix, e.g.:

<e><re>[_@%~|]</re><p><l/><r><s n="symb"/></r></p></e>

(untested)

I generally use <sym> for that, but there are a lot of Unicode symbols
and it's impossible to list them all in the .dix file, there should be
some kind of builtin for that I think.

Fran


_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to