El 2018-11-13 15:10, mansur escribió:
Hello!

There are so many symbols that are not recognized by Apertium's tagger
and not marked in any way. For example, apertium-tat does not
recognize the following symbols:
_ @ % ~ |

and many others.

Is it possible to use some special tag (^_/_<unknown>$) for such
cases?

Without tagging it is difficult to process Apertium's output.
Streamparser also leaves such cases in "blank" variable. Maybe you can
give some recommendations?


I agree that it would be nice to have a mode that does explicit tokenisation
of non-whitespace symbols and does not leave them in the blanks.

Could you file an issue for that?

Fran


_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to