Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-27 Thread Hèctor Alòs i Font
Awesome, thanks, Tino! Missatge de Tino Didriksen del dia dj., 27 d’ag. 2020 a les 15:41: > First run is now online at https://apertium.projectjj.com/gsoc2020/ > > Collected for elmurod1202, hectoralos, khannatanmai, priyankmodiPM. > Collection period is 2020-05-04 through 2020-08-31. > > --

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-27 Thread Tino Didriksen
First run is now online at https://apertium.projectjj.com/gsoc2020/ Collected for elmurod1202, hectoralos, khannatanmai, priyankmodiPM. Collection period is 2020-05-04 through 2020-08-31. -- Tino Didriksen On Sat, 22 Aug 2020 at 14:21, Tino Didriksen wrote: > As for previous years, I will

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-27 Thread Hèctor Alòs i Font
Hi Tino, My commits can be already collected. As said, that is all commits in: https://github.com/apertium/apertium-frp https://github.com/apertium/apertium-fra https://github.com/apertium/apertium-fra-frp username: hectoralos I'd collect everything since June 1st (the first official day of

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-22 Thread Elmurod Kuriyozov
Hi. My contribution is being reported/documented here . Repos: - Apertium-Uzb - Apertium-Tur-Uzb - Apertium-Tur

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-22 Thread Priyank Modi
I've submitted all commits in these repos . *Username** :* priyankmodiPM Still got some stuff to do, will be done in a few days. Regards, Priyank On Sat, Aug 22, 2020 at 8:24 PM Hèctor Alòs i Font

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-22 Thread Hèctor Alòs i Font
Thank you very much, Tino! My stuff are all commits of the user hectoralos in: https://github.com/apertium/apertium-frp https://github.com/apertium/apertium-fra https://github.com/apertium/apertium-fra-frp ... but it is not ready yet. I have to work a few more days. Hèctor Missatge de Tino

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-22 Thread Tanmai Khanna
My contributions are documented here . *तन्मय खन्ना * *Tanmai Khanna* On Sat, Aug 22, 2020 at 5:52 PM Tino Didriksen wrote: > As for previous years, I will run a code collection for GSoC changes. > > I just need to

[Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-22 Thread Tino Didriksen
As for previous years, I will run a code collection for GSoC changes. I just need to know who and what. Usernames and repos. -- Tino Didriksen ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net

[Apertium-stuff] GSoC 2020 Final Proposal

2020-03-31 Thread TEJASVI CHEBROLU
Hello Everyone, My final GSoC proposal "Adopt an Unreleased Language Pair: Hindi - Telugu" is up on the Apertium Wiki. The link is http://wiki.apertium.org/wiki/Chebrolutejasvi/GSOC_2020_proposal:_Hindi-Telugu. Any last-minute suggestions or comments would be greatly appreciated. Regards,

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-27 Thread Rajarshi Roychoudhury
I have modified the proposal for better explanation of the process. Kindly give a look at it. The bilingual dictionary needs some work to be done, I didn't time to complete it as I was busy determining the sentiment tag . I will try to incorporate it as soon as possible. Please suggest if any

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-27 Thread Rajarshi Roychoudhury
The sentiment tags will help to form more detailed and diverse patterns which can help to form better rules to disambiguate, lexical selection and reorder . As far as those languages where sentiwordnet does not exist, a linguist will be able to determine sentiment polarity. Since i have the

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-27 Thread Tanmai Khanna
Hey I have one doubt, The examples given for mistranslation, I didn't quite understand how sentiment analysis would fix those. Also what about languages for which a SentiWordNet doesn't exist? Thanks and Regards, Tanmai On Fri, Mar 27, 2020 at 3:56 PM Rajarshi Roychoudhury <

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-27 Thread Rajarshi Roychoudhury
Hi, I have finished writing my proposal , wrote a code on how to do sentiment analysis with character embedding as a coding challenge, added words to monolingual and bilingual dictionaries and designed a constraint grammar. I am working on building the bidix and lrx files for now.. Would be very

Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-24 Thread Katherine Wang
Hi Daniel, Thank you for your help. Unfortunately, I will not be joining Apertium for GSOC 2020. Please feel free to remove my account to free up available username. Thank you and best wishes, Katherine Wang On Mon, Mar 16, 2020 at 9:56 PM Daniel Swanson wrote: > A randomly generated password

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-23 Thread Tino Didriksen
"A randomly generated password for Rroychoudhury has been sent to rroychoudhu...@gmail.com." -- Tino Didriksen On Mon, 23 Mar 2020 at 03:10, Rajarshi Roychoudhury < rroychoudhu...@gmail.com> wrote: > I have completed writing my gsoc proposal, can I get a wiki account? > > Username:

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-22 Thread Rajarshi Roychoudhury
I have completed writing my gsoc proposal, can I get a wiki account? Username: rroychoudhury email: rroychoudhu...@gmail.com On Fri, Mar 6, 2020, 21:40 Rajarshi Roychoudhury wrote: > One is .odt format , the other in .pdf. Kindly give it a read and give > suggestions. > Best, > Rajarshi > > On

[Apertium-stuff] GSoC 2020 proposal: French-Arpitan

2020-03-20 Thread Hèctor Alòs i Font
Hi all, I've put a preliminary proposal for GSoC 2020 at http://wiki.apertium.org/wiki/Hectoralos/GSOC_2020_proposal:_French-Arpitan The goal is to create a bidirectional translator between French and Arpitan. Arpitan (often called Franco-Provençal) is an endangered and heavily under-resourced

Re: [Apertium-stuff] [GSoC 2020] Requesting a Wiki Account

2020-03-18 Thread Daniel Swanson
A randomly generated password for Chinmayk has been sent to chinmay.kunki...@gmail.com. It can be changed on the change password page upon logging in. On Wed, Mar 18, 2020 at 2:16 PM Chinmay Kunkikar wrote: > I am interested in working on - Adopt an unreleased language pair > [mar-kon] - and

[Apertium-stuff] [GSoC 2020] Requesting a Wiki Account

2020-03-18 Thread Chinmay Kunkikar
I am interested in working on - Adopt an unreleased language pair [mar-kon] - and wish to submit a proposal for the same. I am requesting a wiki account with username as chinmayk. Thanks ___ Apertium-stuff mailing list

Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Daniel Swanson
A randomly generated password for Katherinew has been sent to katheri...@nyu.edu. It can be changed on the change password page upon logging in. On Tue, Mar 17, 2020 at 12:50 AM Katherine Wang wrote: > Hi Daniel, > 2. You can probably tell from my last name that I know am familiar with >

Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Katherine Wang
Hi Daniel, 2. You can probably tell from my last name that I know am familiar with Chinese (but I have limited knowledge) so unfortunately, my skills would only lie in French-English. 1. If you think I could join your team, username: katherine ? or katherinew ? I don't really have a preference,

Re: [Apertium-stuff] Gsoc 2020

2020-03-16 Thread Daniel Swanson
A randomly generated password for Shrey1608 has been sent to modishrey...@gmail.com. It can be changed on the change password page upon logging in. On Tue, Mar 17, 2020 at 12:46 AM Shrey Modi wrote: > Hello Daniel > I am applying for gsoc 2020 and i am working on one of the ideas so can i >

[Apertium-stuff] Gsoc 2020

2020-03-16 Thread Shrey Modi
Hello Daniel I am applying for gsoc 2020 and i am working on one of the ideas so can i get a wiki account? I would like the username shrey1608 ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net

Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Shubham Dikshit
Thank you for your help. On Tue, Mar 17, 2020 at 9:29 AM Daniel Swanson wrote: > A randomly generated password for Shubham16 > > has > been sent to iamsds...@gmail.com. It can be changed on the *change > password

Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Daniel Swanson
A randomly generated password for Shubham16 has been sent to iamsds...@gmail.com. It can be changed on the *change password * page upon logging in. On Mon, Mar 16, 2020

Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Shubham Dikshit
Hi, I would like my username to be shubham16 or shubham1011 On Tue, Mar 17, 2020 at 9:17 AM Daniel Swanson wrote: > Hi Shubham, > > What would you like your username to be? > > Daniel > > On Mon, Mar 16, 2020 at 11:44 PM Shubham Dikshit > wrote: > >> Hi, >> I have applied to Apertium in GSOC

Re: [Apertium-stuff] GSOC 2020

2020-03-16 Thread Daniel Swanson
Hi Shubham, What would you like your username to be? Daniel On Mon, Mar 16, 2020 at 11:44 PM Shubham Dikshit wrote: > Hi, > I have applied to Apertium in GSOC 2020 with the title of the project: > Indian Language Parsing. > And would like to request for a WIki-Account. > Thank you >

[Apertium-stuff] GSOC 2020

2020-03-16 Thread Shubham Dikshit
Hi, I have applied to Apertium in GSOC 2020 with the title of the project: Indian Language Parsing. And would like to request for a WIki-Account. Thank you ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net

Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Daniel Swanson
Hi Katherine, What would you like as your username? Two things to note about your proposed project: 1. We do pretty much everything on Github rather than SourceForge now, so the link you want is https://github.com/apertium/apertium-fra-eng 2. We generally prefer to focus on low-resource language

[Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Katherine Wang
Name: Katherine Wang E-mail address: katheri...@nyu.edu Interested in the Following Projects. *Would like to connect with the mentor leading the unreleased language pair [fra-eng] please.* 1. Adopt an unreleased language pair [fra-eng]

Re: [Apertium-stuff] GSOC 2020

2020-03-12 Thread Saurabh Rai
Shrey, He is already in the IRC. He needs to get in contact with the mentors regarding the same. On Thu, Mar 12, 2020, 5:19 PM Shrey Modi wrote: > Hey kalyfa nice to know that you are partici[pating in gsoc and want to > contribute.You can contact mentors through freenode irc >

Re: [Apertium-stuff] GSOC 2020

2020-03-12 Thread Shrey Modi
Hey kalyfa nice to know that you are partici[pating in gsoc and want to contribute.You can contact mentors through freenode irc http://webchat.freenode.net/?channels=apertium On Thu, 12 Mar 2020 at 12:53, Abdullahi Muhammad Khalifa < kalyfacl...@gmail.com> wrote: > Hi Apertium. I am kalyfa,

[Apertium-stuff] GSOC 2020

2020-03-12 Thread Abdullahi Muhammad Khalifa
Hi Apertium. I am kalyfa, looking forward to contribute to GSOC Apertium(Apertium English - Hausa). i am ready to submit my coding challenge but couldnt contact my mentors: my mentors are Mikel Forcada and Memdu Gokirmak. Thanks ___ Apertium-stuff

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-06 Thread Rajarshi Roychoudhury
One is .odt format , the other in .pdf. Kindly give it a read and give suggestions. Best, Rajarshi On Fri, 6 Mar 2020 at 21:15, Francis Tyers wrote: > El 2020-03-06 15:35, Scoop Gracie escribió: > > Sending it as .odt would be great. > > > > On Fri, Mar 6, 2020, 07:27 Rajarshi Roychoudhury > >

Re: [Apertium-stuff] GSOC-2020

2020-03-06 Thread Francis Tyers
El 2020-03-06 15:27, Himanshu choudhary escribió: Thank you. I will look into it. On Fri, Mar 6, 2020, 2:03 PM Kevin Brubeck Unhammer wrote: Himanshu choudhary čálii: Also can you please give me an overview of how and where I can use the neural methods in Apertium. I am really

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-06 Thread Francis Tyers
El 2020-03-06 15:35, Scoop Gracie escribió: Sending it as .odt would be great. On Fri, Mar 6, 2020, 07:27 Rajarshi Roychoudhury wrote: Then how should I send the file. I don't know if there is anyone to mentor this since this is not from the list of ideas mentioned . On Fri, Mar 6, 2020,

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-06 Thread Scoop Gracie
Sending it as .odt would be great. On Fri, Mar 6, 2020, 07:27 Rajarshi Roychoudhury wrote: > Then how should I send the file. I don't know if there is anyone to mentor > this since this is not from the list of ideas mentioned . > > On Fri, Mar 6, 2020, 20:49 Francis Tyers wrote: > >> El

Re: [Apertium-stuff] GSOC-2020

2020-03-06 Thread Himanshu choudhary
Thank you. I will look into it. On Fri, Mar 6, 2020, 2:03 PM Kevin Brubeck Unhammer wrote: > Himanshu choudhary > > čálii: > > > Also can you please give me an overview of how and where I can use the > > neural methods in Apertium. I am really interested to work on those > > projects. > >

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-06 Thread Rajarshi Roychoudhury
Then how should I send the file. I don't know if there is anyone to mentor this since this is not from the list of ideas mentioned . On Fri, Mar 6, 2020, 20:49 Francis Tyers wrote: > El 2020-03-06 08:40, Rajarshi Roychoudhury escribió: > > Hi, > > I have written my idea in the file attached .

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-06 Thread Francis Tyers
El 2020-03-06 08:40, Rajarshi Roychoudhury escribió: Hi, I have written my idea in the file attached . It is just the idea , not the project proposal . Kindly read the idea and give feedback on whether this can be a feasible GSoC project. Best, Rajarshi Please do not use proprietary formats

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-06 Thread Rajarshi Roychoudhury
Hi, I have written my idea in the file attached . It is just the idea , not the project proposal . Kindly read the idea and give feedback on whether this can be a feasible GSoC project. Best, Rajarshi On Fri, 28 Feb 2020 at 06:31, Rajarshi Roychoudhury < rroychoudhu...@gmail.com> wrote: > Here

Re: [Apertium-stuff] GSOC-2020

2020-03-06 Thread Kevin Brubeck Unhammer
Himanshu choudhary čálii: > Also can you please give me an overview of how and where I can use the > neural methods in Apertium. I am really interested to work on those > projects. https://github.com/apertium/lttoolbox/issues/78 is one such, though that alone probably isn't a GsoC-sized task.

Re: [Apertium-stuff] GSOC-2020

2020-03-05 Thread Himanshu choudhary
Hi, Thanks for the reply, I totally understand the need for the languages, even I have done and published some research work on Indian Languages. That's why I got interested and want to contribute to Apertium. As we have to use rule-based learning so as a native speaker I can only work on Hindi

Re: [Apertium-stuff] GSOC-2020

2020-03-05 Thread Jonathan Washington
Hi Himanshu, Apertium generally isn't looking for "some developer" to write MT systems. The community is usually looking for people who know a language and care about it. Also, there are over 150 million speakers of those languages combined, so I'm not sure how rare it is to find developers who

Re: [Apertium-stuff] GSOC-2020

2020-03-05 Thread Himanshu choudhary
Ok thanks On Fri, Mar 6, 2020, 1:49 AM Scoop Gracie wrote: > It needs to be rule based. > > On Thu, Mar 5, 2020, 12:06 Himanshu choudhary < > himanshuchoudhary_bt2...@dtu.ac.in> wrote: > >> Hi, >> >> I just want to ask can't we use Neural machine translation or >> unsupervised machine

Re: [Apertium-stuff] GSOC-2020

2020-03-05 Thread Scoop Gracie
It needs to be rule based. On Thu, Mar 5, 2020, 12:06 Himanshu choudhary < himanshuchoudhary_bt2...@dtu.ac.in> wrote: > Hi, > > I just want to ask can't we use Neural machine translation or unsupervised > machine translation rather than rule-based learning for the task "Apertium >

[Apertium-stuff] GSOC-2020

2020-03-05 Thread Himanshu choudhary
Hi, I just want to ask can't we use Neural machine translation or unsupervised machine translation rather than rule-based learning for the task "Apertium English--Hausa/Igbo/Swahili/Tigrinya/Yoruba". As I got some open-source data for some of these languages and I believe neural machine

Re: [Apertium-stuff] GSOC-2020

2020-03-04 Thread Mikel L. Forcada
Dear Himanshu: I am copying the message to the project list (apertium-stuff), just in case someone else wants to follow up. You mention 5 African language pairs. Which language pair are you proficient in? If you think you are well equipped to tackle the building of a rule-based (not

Re: [Apertium-stuff] GSOC-2020

2020-02-28 Thread Himanshu choudhary
Ok, thanks On Fri, Feb 28, 2020, 10:06 PM Sevilay Bayatlı wrote: > Hello, > > In this link http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code > you can choose the idea to contribute to the project. > > Sevilay > > On Fri, 28 Feb 2020, 19:14 Himanshu choudhary, < >

Re: [Apertium-stuff] GSOC-2020

2020-02-28 Thread Sevilay Bayatlı
Hello, In this link http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code you can choose the idea to contribute to the project. Sevilay On Fri, 28 Feb 2020, 19:14 Himanshu choudhary, < himanshuchoudhary_bt2...@dtu.ac.in> wrote: > My name is Himanshu, I am currently studying at Delhi

[Apertium-stuff] GSOC-2020

2020-02-28 Thread Himanshu choudhary
My name is Himanshu, I am currently studying at Delhi College of Engineering India. I want to know how I can contribute to Apertium. I am highly interested in machine translation tasks and want to work on an open-source project. I had also addressed some same issues in my research papers as in

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
Here are some published papers on how character embeddings are used for classification. https://www.google.com/url?sa=t=web=j=https://arxiv.org/abs/1810.03595=2ahUKEwiu-ajdgvPnAhXXxzgGHQAWA3cQFjAVegQIDBAB=AOvVaw0LQ60M-KXtk-NGyAoVqmeU https://lsm.media.mit.edu/papers/tweet2vec_vvr.pdf

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Kevin Brubeck Unhammer
Tino Didriksen čálii: > One major issue specific to Apertium is that the source information is no > longer available in the target generation step. It might make sense to have something like this right after bilingual dictionary lookup (as an alternative or complement to lrx-proc). Perhaps a

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Tanmai Khanna
How exactly can characters predict sentiment? Don’t you still need some training data for pairs? English, Hindi, Bangla aren’t really low resource languages. Anyway, we can continue this discussion on the IRC so that it’ll be easier and more people can contribute to the discussion. Tanmai

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
To answer the question on how to analyse sentiment on low resource language , I think character embedding would be the best option. The words in the corpus is not exhaustive but the number of unique characters is certainly well deterministic. We can figure out the embedding weight for each

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
As I mentioned earlier, I would like to work on English-Hindi or English-Bengali translation, the dataset can be obtained from sentiwordnet for Indian languages, https://amitavadas.com/sentiwordnet.php which is by far the most resourceful dataset available for sentiment analysis.It contains data

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Tanmai Khanna
Hi, I have a few questions about this: 1. How would you analyse the sentiment of the source text? Considering the language pairs that Apertium deals with are low resource languages. 2. As Tino mentions, is there a problem of sentiment loss in Apertium? Any examples of this? 3. Doesn't the

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
The effect won't be very evident on simple sentences, I think it would be more effective on sentences where choice of words can decide the efficiency of translation. It's not about if "Watch out" could be " be careful" , it's about choosing words that can retain the urgency in "watch out".

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Scoop Gracie
So, "Watch out!" Could become "Be careful"? On Thu, Feb 27, 2020, 10:13 Rajarshi Roychoudhury wrote: > It is not just about minimizing loss of sentiment , it is about using > that information for better translation. A very trivial example would be > that for some situations , sentences can

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
It is not just about minimizing loss of sentiment , it is about using that information for better translation. A very trivial example would be that for some situations , sentences can project a strong sentiment and simple translation may not always yield the best result. However if we can use the

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Tino Didriksen
My first question would be, is this actually a problem for rule-based machine translation? I am not a linguist, but given how RBMT works I can't really see where sentiment would be lost in the process, especially because Apertium is designed for related languages where sentiment is mostly the

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
I just need to know which libraries are used(if any STL) to store the words and how the translation is actually done. I plan to use an ordered map to store the word as key and sentiment value as value . I can choose the one with best sentiment by running an iterative search. Or a better idea would

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Scoop Gracie
Oh okay. That should be fine. On Thu, Feb 27, 2020, 08:24 Rajarshi Roychoudhury wrote: > No I just need python to get the result, which can be written in a text > file and read using c++. It won't depend on python. > > On Thu, Feb 27, 2020, 21:52 Scoop Gracie wrote: > >> Oh, okay. So Python

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
No I just need python to get the result, which can be written in a text file and read using c++. It won't depend on python. On Thu, Feb 27, 2020, 21:52 Scoop Gracie wrote: > Oh, okay. So Python would not be needed at runtime? > > On Thu, Feb 27, 2020, 08:20 Rajarshi Roychoudhury < >

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Scoop Gracie
Oh, okay. So Python would not be needed at runtime? On Thu, Feb 27, 2020, 08:20 Rajarshi Roychoudhury wrote: > I just need to write the dictionary I would get in python in a file and > read it using c++. I guess I can use a map to solve my purpose. > > On Thu, Feb 27, 2020, 21:40 Scoop Gracie

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Rajarshi Roychoudhury
I just need to write the dictionary I would get in python in a file and read it using c++. I guess I can use a map to solve my purpose. On Thu, Feb 27, 2020, 21:40 Scoop Gracie wrote: > I believe it must use C++, so nltk won't work. > > On Wed, Feb 26, 2020, 23:17 Rajarshi Roychoudhury < >

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Scoop Gracie
I believe it must use C++, so nltk won't work. On Wed, Feb 26, 2020, 23:17 Rajarshi Roychoudhury wrote: > Formally i present my idea in this form: > From my understanding of RBMT , > > The RBMT system contains: > >- a *SL morphological analyser* - analyses a source language word and >

Re: [Apertium-stuff] GSoC 2020

2020-02-27 Thread Shubham Dikshit
Hi Hèctor, After looking all the work which has been done I felt that it is still lagging and wanted to contribute more in bengali language for apertium if I get selected Thanking you Shubham Dikshit On Thu, 27 Feb 2020, 13:37 Hèctor Alòs i Font, wrote: > Hi Shubham, > > There has been done

Re: [Apertium-stuff] GSoC 2020

2020-02-27 Thread Hèctor Alòs i Font
Hi Shubham, There has been done some work for Bangla in Apertium, although seemingly not very much. In any case, it is worth taking a look. For instance: https://github.com/apertium/apertium-ben https://github.com/apertium/apertium-bn-hi

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Rajarshi Roychoudhury
Formally i present my idea in this form: >From my understanding of RBMT , The RBMT system contains: - a *SL morphological analyser* - analyses a source language word and provides the morphological information; - a *SL parser* - is a syntax analyser which analyses source language

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Scoop Gracie
It is absolutely fine to use languages you are most comfortable with. On Wed, Feb 26, 2020, 22:18 Rajarshi Roychoudhury wrote: > I need to study more about RBMT to develop an idea of how to preserve > sentiment while translating, which I think can increase the efficiency of > translation. It

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Rajarshi Roychoudhury
I need to study more about RBMT to develop an idea of how to preserve sentiment while translating, which I think can increase the efficiency of translation. It will also help my research , thank you so much for suggesting it. Also, will it be okay if I work on languages I am comfortable with? Say

[Apertium-stuff] GSoC 2020

2020-02-26 Thread Shubham Dikshit
Hi, I am Shubham Dikshit and student of IMS Engineering College and I was looking at the possible GSoC 2020 projects and I found the "Adopt an unreleased language pair" to be something that I would be interested in working on. I looked through the possible language pairs and I thought I would

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Scoop Gracie
I think it is worth looking into, it is just that anything that needs a neural network is not possible. I'm sure sentiment translation is possible in RBMT too. On Wed, Feb 26, 2020, 21:58 Rajarshi Roychoudhury wrote: > Ok,then I wont pursue this idea and will look for one in the idea list . > >

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Rajarshi Roychoudhury
Ok,then I wont pursue this idea and will look for one in the idea list . On Thu, 27 Feb 2020 at 11:10, Scoop Gracie wrote: > The main problem is that I don't believe there is a way to send > information down the pipeline without breaking stuff. > > On Wed, Feb 26, 2020, 21:37 Rajarshi

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Scoop Gracie
The main problem is that I don't believe there is a way to send information down the pipeline without breaking stuff. On Wed, Feb 26, 2020, 21:37 Rajarshi Roychoudhury wrote: > Thank you so much for the feedback,i will try to think of any other way of > doing this without using neural networks

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Rajarshi Roychoudhury
Thank you so much for the feedback,i will try to think of any other way of doing this without using neural networks or propose a new project http://wiki.apertium.org/wiki/Apertium_for_Dummies#The_units_of_translation is an excellent starting point for beginners, however it would be very helpful if

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Scoop Gracie
I'm not an expert in this, but given the non-neural nature of Apertium, this does not seem feasible to me, at least in the way you described. On Wed, Feb 26, 2020, 21:02 Rajarshi Roychoudhury wrote: > Hi, > I am Rajarshi Roychoudhury,a second year undergraduate student at Jadavpur >

[Apertium-stuff] GSOC 2020 idea

2020-02-26 Thread Rajarshi Roychoudhury
Hi, I am Rajarshi Roychoudhury,a second year undergraduate student at Jadavpur University,Kolkata,India.I have done many projects in Natural Language Processing,mainly focussing on sentiment analysis and machine translation. Most of the machine translation have no explicit preservation on the

[Apertium-stuff] GSoC 2020

2020-02-26 Thread TEJASVI CHEBROLU
Hello, My name is Tejasvi Chebrolu (chebrolutejasvi on Apertium Freenode). I am a student of Computational Linguistics at the International Institute of Information Technology, Hyderabad. I was looking at the possible GSoC 2020 projects and I found the "Adopt an unreleased language pair" to be

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Saurabh Rai
m Saurabh (srbhr), Computer Science Undergraduate, in my Pre-Final (3rd) year. I want to work on the Idea *Improving language pairs by mining MediaWiki Content Translation postedits*. ( http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/automatic-postediting ). I have looked onto the

[Apertium-stuff] GSOC 2020 : automatic-postediting

2020-02-26 Thread Saurabh Rai
Hi, I am Saurabh (srbhr), Computer Science Undergraduate, in my Pre-Final (3rd) year. I want to work on the Idea Improving language pairs by mining MediaWiki Content Translation postedits. ( http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code/automatic-postediting ). I have looked onto

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Tomohiro Akazawa
Jonathan, Yes, I would have a look at the resources more thoroughly and see what I can do. Thank you very much for your advice. -- Tomohiro 2020年2月26日(水) 22:50 Jonathan Washington : > Hi Tomohiro, > > Actually, my point was that there is still a lot to be done. The work I > pointed you to is a

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Jonathan Washington
Hi Tomohiro, Actually, my point was that there is still a lot to be done. The work I pointed you to is a proof of concept more than anything, and it has not been integrated into Apertium. If I were you, and interested in participating in GSoC, I would have a look at those resources and try to

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Tomohiro Akazawa
Hi Jonathan, thank you for your feedback. there seem to be enough implementations for Japanese. -- Tomohiro 2020年2月26日(水) 22:26 Jonathan Washington : > Hi Tommi, all, > > A couple years ago, a Swarthmore student implemented an algorithm for > tokenisation of spaceless orthographies using

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Jonathan Washington
Hi Tommi, all, A couple years ago, a Swarthmore student implemented an algorithm for tokenisation of spaceless orthographies using morphological transducers. She used a fork of a prototype Japanese transducer developed by another of my students to evaluate it. The work is available at the

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Tomohiro Akazawa
Thank you for your reply. If "improving the support of Japanese on Apertium" could be a new project on GSoC, I would find the problems of the current version of Apertium and figure out the solutions for them. Thank you. 2020年2月26日(水) 0:47 Tommi A Pirinen : > Hi all, > one thing that might be

Re: [Apertium-stuff] GSoC 2020

2020-02-25 Thread Tommi A Pirinen
Hi all, one thing that might be worth considering ia improving support of Japanese in Apertium, is that we currently do not have any good generic solution for the word-tokenisation, this affects especially languages like Japanese where a space- and punct-based tokenisation is much more suboptimal

Re: [Apertium-stuff] GSoC 2020

2020-02-23 Thread Tomohiro Akazawa
Thank you for your reply. Considering there are many resources for English and Japanese, possibly I should change my plan . Thank you On Sun, 23 Feb 2020, 23:58 Hèctor Alòs i Font, wrote: > Hi Tomohiro, > > Maybe it is not the 2019 version of the application form, but the 2020 one > (if Apertium

Re: [Apertium-stuff] GSoC 2020

2020-02-23 Thread Hèctor Alòs i Font
Hi Tomohiro, Maybe it is not the 2019 version of the application form, but the 2020 one (if Apertium is elected by Google as a partner organisation) should not be very different of this one: http://wiki.apertium.org/wiki/Top_tips_for_GSOC_applications Essentially, for a pair like English and

Re: [Apertium-stuff] GSoC 2020

2020-02-23 Thread NITESH Bhargav
I would like to contribute in this. On Sun, Feb 23, 2020, 4:57 PM Tomohiro Akazawa wrote: > Hello. > My name is Tomohiro and I am a student of the University of Tokyo in Japan. > Seeing the Apertium's idea list for GSoC 2020, I found "Adopt an > unreleased language pair" interesting. > Do

[Apertium-stuff] GSoC 2020

2020-02-23 Thread Tomohiro Akazawa
Hello. My name is Tomohiro and I am a student of the University of Tokyo in Japan. Seeing the Apertium's idea list for GSoC 2020, I found "Adopt an unreleased language pair" interesting. Do you think it is possible to make the language pair between English and Japanese? Thank you very much.

Re: [Apertium-stuff] GSoC 2020 Ideas Page

2020-02-21 Thread Daniel Swanson
Hi Apertiumers, I've removed the ones that I'm aware of being completed last year and I've organized the page into some subsections. Daniel On Fri, Feb 21, 2020 at 9:40 AM Flammie A Pirinen wrote: > On Fri, Feb 21, 2020 at 03:10:40PM +0100, Tino Didriksen wrote: > > Apertium is in GSoC 2020!

Re: [Apertium-stuff] GSoC 2020 Ideas Page

2020-02-21 Thread Flammie A Pirinen
On Fri, Feb 21, 2020 at 03:10:40PM +0100, Tino Didriksen wrote: > Apertium is in GSoC 2020! > > Time to update the > http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code page. > > What projects were actually completed to the mentors' satisfaction last > year? I think at least

Re: [Apertium-stuff] GSoC 2020 Ideas Page

2020-02-21 Thread Sevilay Bayatlı
Congurlation! Regarding weighted transfer rules, we have plan to finish it during next few mouths, it's better don't include it as a task for GSoc, what do you think? Sevilay On Fri, 21 Feb 2020, 17:12 Tino Didriksen, wrote: > Apertium is in GSoC 2020! > > Time to update the >

[Apertium-stuff] GSoC 2020 Ideas Page

2020-02-21 Thread Tino Didriksen
Apertium is in GSoC 2020! Time to update the http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code page. What projects were actually completed to the mentors' satisfaction last year? What new projects do people want to add? -- Tino Didriksen

Re: [Apertium-stuff] Gsoc 2020

2019-12-09 Thread ilnar . salimzianov
Hi Kiran, thank you for you interest in the Apertium project! To be honest, it is a bit early to propose projects for GSoC, since GSoC 2020 hasn't even been announced :) In the meantime, I encourage you to read [1] and [2] (and other documentation on the wiki), and learn more about Apertium

Re: [Apertium-stuff] Gsoc 2020

2019-12-05 Thread kiran srigiri
Can a mentor help me with proposal and other things such that this gets selected On Thu, 5 Dec 2019, 19:44 kiran srigiri, wrote: > I want to propose a new language pair project for Gsoc 2020, translation > from English to Hausa > ___ Apertium-stuff

[Apertium-stuff] Gsoc 2020

2019-12-05 Thread kiran srigiri
I want to propose a new language pair project for Gsoc 2020, translation from English to Hausa ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff