Hi, I am this week on hliday with low internet availability so only few quick points. Firstly I strogly recommend joining #apertium IRC channel, I think even non-mentors will have useful clues. For the tokenisation problem I think the main resource is to understand various unicode technical reports that describe tokenisations and a C++ library like ICU, and then how apertium currently does tokenisations and how this projects code will interact, especially for the last point many other people in IRC know it better than me.
Regards, On Thu, Feb 27, 2020 at 01:45:09PM +0800, 杨伟哲 wrote: > Hi Francis and Flammie, > > I’m interested in the “Robust tokenisation in lttoolbox”[1] GSoC project. > And > currently I’m writing the proposal. > > I have completed the code challenge listed in the project, which has been > put > on Pastebin[2]. However, I’m not quite clear where this project starting > with. > And I will be much appreciate if you could list somewhere (e.g. GitHub repo > related to this project) for me to get started with. I will also try to > learn > and solve issues there if possible. > > Bio: I’m Chinese undergraduate in Software Engineering. In my freshman > year, I > joined the high-performance computing center[3] of the university as a > research > assistant. Through research and learning during the period, I have a deep > understanding of software architecture and open source projects. > > > [1] > http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/Robust_tokenisation > > [2] https://github.com/GavinWz/Apertium > > [3] http://cs.wfu.edu.cn/2014/0603/c1227a33048/page.htm > > > Regards, > > Weizhe Yang > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff -- Regards, Flammie <https://flammie.github.io> (Please note, that I will often include my replies inline instead of top or bottom of the mail)
signature.asc
Description: PGP signature
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff