Hi Utku, Just to follow up on Hèctor's points, for the coding challenge, we'd want to see a prototype of the transducer. It doesn't have to have any real level of usefulness, but it should solve some of the types of problems you expect to encounter. That is, you should be about to implement a little bit of the morphotactics and morphophonology, even if for only a couple words.
The same goes for any GSoC student looking to work on a transducer. I'm also curious about the corpus. Not just how big it is, but whether it is available. It's certainly not a requirement for GSoC that it be open; if it needs to be kept private, it can be kept between you and your mentors. But it would be good to document these things early on. -- Jonathan 14 apr 2021, Ç. tarixində 13:50 tarixində Hèctor Alòs i Font <hectora...@gmail.com> yazdı: > > Hi Utku, > > Your proposal seems interesting. > > 1. Did you take look to apertium-ell ? How much could it help? > 2. In your proposal, you speak about a corpus. You intend to reach 80% > coverage. From what kind of corpus are you speaking? How much Romeyka is > written? > 3. Could you explain on what you understand by "modelling allomorphy"? Is > that Apertium's morphological disambiguation? > 4. Could you also explain how do you intend to tag "content phenomenon"? > 5. I couldn't find anything about your coding challenge. The coding challenge > is a must. It shows that you know to install and have a basic understanding > of Apertium. > > Hèctor > > Missatge de Utku Turk <utkuturkb...@gmail.com> del dia dc., 14 d’abr. 2021 a > les 15:58: >> >> Hi, >> >> My name is Utku Türk. I am a linguistics student at Boğaziçi University, >> Turkey. I want to attend GSoC with a Romeyka morphological analyzer project. >> >> Romeyka is one of the many Modern Greek dialects spoken in Asia Minor. It >> has no NLP footprint, and I believe it is an important first step for >> Quantitative Language Contact and Dialectology studies. Its morphology and >> lexicon are heavily influenced by Ancient Greek, Turkish, and Laz. >> >> The following link[1] is my draft for the GSoC proposal. Any feedback is >> very much appreciated! >> >> [1]: >> https://docs.google.com/document/d/1CJrD7TRJvFKKD5qsW_fnLdNbk1iQ2t4_dim3MA3g4_E/edit?usp=sharing >> _______________________________________________ >> Apertium-stuff mailing list >> Apertium-stuff@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff _______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff