Dear Sir/Ma’am, This is to inform you that, I have completed with the robust tokenisation coding challenge with the help of Lu (Letter, uppercase), Ll (Letter, lowercase), Lm (Letter, modified) and Lo(Letter, others), in Unicode as alphabetic character and non-alphabetic otherwise. The link to the solution is given below, https://github.com/git-ayush-pradhan/Apertium_gsoc Thanks and Regards, Ayush Pradhan
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff