hi im interested in 1.12 ud and apertium integration. How do i get started ?
On Fri, 15 Feb, 2019, 17:34 Priyank Modi, <priyankmod...@gmail.com> wrote: > Hi everyone, > I am really interested in working on 1.2(for hin-pan(Hindi-Punjabi)) if > it's open. Could you please connect me to the POC for the language pair. If > that's not available, could I work on 1.15? I want to get started right > away, I have already gone through the wiki pages for both and just want to > get connected to the mentor and access to the repository (for hin-pan, as > the latest commit had a bug, I need to revert back to check the functioning) > > On Mon 28 Jan, 2019, 10:43 PM Francis Tyers <fty...@prompsit.com wrote: > >> Here is my run-down on the current GSOC ideas page: >> >> 1.1 Anaphora resolution for machine translation >> >> Nice project idea, but not sure in 3 months. >> >> 1.2 Bring a released language pair up to state-of-the-art quality >> >> Always needed >> >> 1.3 Robust tokenisation in lttoolbox >> >> Up for grabs, we need this >> >> 1.4 Adopt an unreleased language pair >> >> Always needed >> >> 1.5 Extend lttoolbox to have the power of HFST >> >> I think getting this one is unlikely and requires more than 3 months. >> >> 1.6 Robust recursive transfer >> >> Keep, this would be really great. I got asked to run a workshop on >> Apertium >> recently and then unasked when they found out that the formalisms >> didn't >> actually create parse trees :) >> >> 1.7 Extend weighted transfer rules >> >> There is ongoing work in this, it would need to be supervised carefully: >> >> https://github.com/sevilaybayatli/apertium-ambiguous >> >> I would say a nice project would be to really use this on a new language >> pair >> >> 1.8 Improvements to the Apertium website >> >> Not sure >> >> 1.9 User-friendly lexical selection training >> >> I think getting this one is unlikely and requires more than 3 months. >> Also has >> been tried several times without luck. >> >> 1.10 Light alternative format for all XML files in an Apertium >> language pair >> >> I'm not sure about this one. >> >> 1.11 Bilingual dictionary enrichment via graph completion >> >> There is code for this, it was a GSOC project last year but wasn't >> merged, I'm >> not sure how well it works. >> >> 1.12 UD and Apertium integration >> >> This is a very useful project. If we can take advantage of UD corpora we >> can >> make supervised taggers for around 70% of our languages. >> >> 1.13 Add weights to lttoolbox >> >> This was done last year. A nice project would be to actually make use of >> it. >> >> 1.14 Improving language pairs mining Mediawiki Content Translation >> postedits >> 1.15 Unsupervised weighting of automata >> >> Open >> >> 1.16 Improvements to UD Annotatrix >> >> This is a really useful tool. >> >> 1.17 apertium-separable language-pair integration >> >> Agree, but I think that it should not just be apertium-separable, but >> perhaps >> something like "upgrade a language pair to use all the latest apertium >> tricks" >> >> 1.18 Create FST-based module for disambiguating >> >> I like this idea, but I'm not sure three months is enough time, without >> someone >> who really knows what they are doing with both the FST library and >> apertium. >> >> 1.19 Python API/library for Apertium >> >> This was mostly done right? I think this is still a really important >> project >> >> 1.20 TIPP functionality for Apertium >> >> Not sure >> >> There is a lot of functionality that is not used widely that could be >> really >> used to improve performance of language pairs. >> >> * apertium-separable >> * weights in lttoolbox >> * weighted transfer >> >> Fran >> >> >> _______________________________________________ >> Apertium-stuff mailing list >> Apertium-stuff@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/apertium-stuff >> > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff >
_______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff