El 2018-11-13 15:10, mansur escribió:
Hello!
There are so many symbols that are not recognized by Apertium's tagger
and not marked in any way. For example, apertium-tat does not
recognize the following symbols:
_ @ % ~ |
and many others.
Is it possible to use some special tag (^_/_<unknown>$) for such
cases?
Without tagging it is difficult to process Apertium's output.
Streamparser also leaves such cases in "blank" variable. Maybe you can
give some recommendations?
I agree that it would be nice to have a mode that does explicit
tokenisation
of non-whitespace symbols and does not leave them in the blanks.
Could you file an issue for that?
Fran
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff