Hi,
I'm sorry, I don't know GSoC well. Maybe some other people on the
mailing list can answer you better.
Regards,
AureSÉGUIER
Responsabla del pòle informatic
Congrès permanent de la lenga occitana
mobilePhone
+33 (0)5 32 00 00 64 <tel:+33 (0)5 32 00 00 64>
website
www.locongres.org <//www.locongres.org>
address
La Ciutat - Creem! , 5-7 rue de la Fontaine, 64000 Pau
facebook <https://www.facebook.com/lo.congres>
twitter <https://twitter.com/locongres>
linkedin
<https://www.linkedin.com/company/congres-permanent-de-la-lenga-occitane/>
instagram <https://www.instagram.com/locongres/>
Le 26/01/2025 à 00:03, Dhruva Kumar Kaushal (B22AI017) via
Apertium-stuff a écrit :
Hi Aure,
I have been trying Apertium as a Developer for a month, and tried to
solve two issues on github. I was hoping for GSoC, can you please
guide me what to do, and which projects to target so that i
could continue it in further time too?
Waiting for your reply. Thanks
On Wed, Jan 22, 2025 at 6:53 PM Aure Séguier <a.segu...@locongres.org>
wrote:
Hi,
For occitan language, we would need to change the dialect
management, to have something like Catalan and Nyorsk. But we
aren't able to mentor this, as nobody in my company is able to do
this. That's why we didn't propose this for GSoC. I think we will
need to do it ourselves with taking time to understand how to do this.
But if someone is able and ready to monitor this, it would be
great to have it done by GSoC.
Regards
AureSÉGUIER
Responsabla del pòle informatic
Congrès permanent de la lenga occitana
mobilePhone
+33 (0)5 32 00 00 64 <tel:+33%20(0)5%2032%2000%2000%2064>
website
www.locongres.org <//www.locongres.org>
address
La Ciutat - Creem! , 5-7 rue de la Fontaine, 64000 Pau
facebook <https://www.facebook.com/lo.congres>
twitter <https://twitter.com/locongres>
linkedin
<https://www.linkedin.com/company/congres-permanent-de-la-lenga-occitane/>
instagram <https://www.instagram.com/locongres/>
Le 22/01/2025 à 11:58, Kevin Brubeck Unhammer a écrit :
Hi,
Unfortunately I won't be able to mentor this summer.
But if others are able to mentor, I think most of my thoughts from last
year still need work:
https://sourceforge.net/p/apertium/mailman/apertium-stuff/thread/87r0i4qvnv.fsf%40fsfe.org/#msg58729086
More language data in particular is always relevant.
Another idea that would be cool would be to have a way to query a small
language model from transfer, e.g. when matching genitives we'd like to
decide between keeping the genitive, rewriting with preposition (which
preposition?) or rewriting to compound – these kinds of rules tend to
end up with long lists, using little linguistic insight, ie. a better
fit for learnt models than rules. Would it be possible to use a
cpu-running lm trained on the target language and do something like
<lm query="…"/> to decide which rewrite rule to apply; or to find the
antecedent of some word? Would be an interesting project to find out how
much help if any there is in a model that's small enough to work on the
systems Apertium currently works on.
best regards,
Kevin Brubeck Unhammer
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff