On Fri, Mar 27, 2020 at 09:58:53AM +0800, 杨伟哲 wrote: > > Of course, as a Chinese student, I would also be very happy to work > on the CJK. We can keep communicating about the tweaks of the plan > and the other details.
Awesome, could you perhaps then make even a small example of how apertium would currently tokenise any Chinese language and how that would be improved. If/when there is no existing apertium dictionary you can make a toy example with just a handful of words, this would be very interesting. -- Doktor Tommi A Pirinen, Computational Linguist, <https://flammie.github.io/purplemonkeydishwasher/>, Universität Hamburg, Hamburger Zentrum für Sprachkorpora <http://hzsk.de>. CLARIN-D Entwickler. President of ACL SIGUR SIG for Uralic languages <http://gtweb.uit.no/sigur/>. I tend to follow inline-posting style in desktop e-mail messages.
signature.asc
Description: PGP signature
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff