El 2020-06-13 17:09, Jonathan Washington escribió:
On Sat, Jun 13, 2020, 11:50 Francis Tyers <fty...@prompsit.com> wrote:

El 2020-06-13 15:20, Tino Didriksen escribió:
I would like everyone to read and seriously consider this thread
and
give your opinion. This meanders a bit, so please read it all.


Here is a non-exhaustive list of potential pitfalls of using the
"surface
form is a tag" thing. As far as I understand the objective is to be
able
to
put the original surface form in the output translation as an
unknown
token
instead of the lemma.

Could you provide disambiguated analyses of each of these?  It's hard
to picture what the problem is for people who can't at least do the
relevant tokenisation in their heads.  (I'm not familiar with the
example in (2)—I imagine other people are similarly uninformed about
the other examples.)


infrastruktuurontwikkelingsplan - ^infrastruktuurontwikkelingsplan/infrastruktuur<n><cmp>+ontwikkeling<n><cmp>+plan<n><sg>$

chawe - ^chawe/chi<pr>+<px2sg>wech<n><rel>$

dárselo - ^dárselo/dar<vblex><inf>+le<prn><enc><p3><mf><sp>+lo<prn><enc><p3><nt>$

Fran


_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to