El dt 12 de 03 de 2013 a les 10:55 +0000, en/na Jimmy O'Regan va escriure: > On 11 March 2013 18:04, sphinx jiang <[email protected]> wrote: > > Hi, > > > > I would like to suggest an idea for Apertium GSOC program. Several days age > > I talked to Jimmy, and was enlightened by the idea "Segmentation by itself". > > > > Sorry, I wasn't clear enough. The idea is "segmentation". I said that > segmentation by itself would probably make a good project, where "by > itself" was intended to mean that the project would just be > segmentation. > > In practice, you will also have to work on a language pair where this > can be used. zh_ZH-zh_TW is a perfect candidate, because segmentation > is not strictly necessary for this language pair - i.e., you use it to > demonstrate that segmentation is working, without _needing_ to. In > that regard, you will need to also allot some time to developing that > language pair, though it will not be the primary focus of the project.
So this would be for languages where word boundaries are not written ... Chinese/Thai/Lao/Khmer/Burmese etc. ? Yes, that could be interesting. But, if it was the case that the project would be for just segmentation, then ideally it would be tested on more than one language. Fran ------------------------------------------------------------------------------ Symantec Endpoint Protection 12 positioned as A LEADER in The Forrester Wave(TM): Endpoint Security, Q1 2013 and "remains a good choice" in the endpoint security space. For insight on selecting the right partner to tackle endpoint security challenges, access the full report. http://p.sf.net/sfu/symantec-dev2dev _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
