Re: [Apertium-stuff] How should I start contributing to Apertium

2017-12-10 Thread Jonathan Washington
Hi Paridhi, I assume you're looking to get involved in Google Code-In? You can find a slightly out-of-date list of the locations of Indic language resources in Apertium here: http://wiki.apertium.org/wiki/Indic_languages In addition to those resources listed, we have a basic Gujarati transducer

Re: [Apertium-stuff] enquiry for python API for Apertium

2018-05-07 Thread Jonathan Washington
itional feature that you specifically want. > Any updates on this? -- Jonathan > > > Regards > Arghya Bhattacharya > > On Mon, 30 Apr 2018 at 18:26 Jonathan Washington < > jonathan.n.washing...@gmail.com> wrote: > >> Hi Arghya, >> >> Could yo

Re: [Apertium-stuff] beta.apertium.org

2018-06-06 Thread Jonathan Washington
Hi Sushain, Hèctor, I think there's another issue being suggested, so I opened this: https://github.com/apertium/apertium-html-tools/issues/316 -- Jonathan 2018-05-24 2:57 GMT+06:00 Sushain Cherivirala : > Hi Hector, > > For future reference, technical suggestions for >

Re: [Apertium-stuff] enquiry for python API for Apertium

2018-04-30 Thread Jonathan Washington
Hi Arghya, Could you clarify a few things about the survey? - How should I understand the difference between "like" and "enjoy"? - Could you give a hypothetical coding example of each of the lttoolbox models? - Does "UD-compatible style" refer to CoNLL-U format, or does it have to do with tag

Re: [Apertium-stuff] Lttoolbox tasks - use HFST instead?

2018-02-18 Thread Jonathan Washington
Hi Tino, I don't oppose the idea of adding support for compiling dix-format lexicons with HFST and implementing it for existing language modules, but we'd still need to process the transducers with lt-proc because of the tokenisation bug.¹ If the HFST tokenisation bug were to disappear, then

Re: [Apertium-stuff] Possessive Determiners Inflected for Possessor and Possessed

2017-12-26 Thread Jonathan Washington
alysis would be: > tyacha: ^to+ch > > (The tag is because 'tya' can be followed by any postposition, and > there is also a distinct nominative form of the determiner.) > > Again, thanks to everyone for the clarification. And happy holidays! > > > Shardul C. > >

Re: [Apertium-stuff] [Apertium-turkic] turkic.apertium.org

2018-01-24 Thread Jonathan Washington
Hi Mansur, I did see your first message, but didn't get a chance to look at it until now. I'm not sure what the problem was, but I updated the software to the latest version from github and restarted it, and it seems to be working okay now. Don't hesitate to let us know if you encounter further

Re: [Apertium-stuff] Command line for listing Apertium branches and other Git problems

2018-04-09 Thread Jonathan Washington
Hi all, 2018-04-09 19:40 GMT-04:00 Sushain Cherivirala : > Bernard, > [...] > > Interesting. I don't know when it was created > > > It was deployed late last December. The website/server code itself is > identical > to the code that has been on apertium.org for many years.

[Apertium-stuff] apertium-ambiguous for new language pairs

2018-11-18 Thread Jonathan Washington
Hi everyone, Sevilay has done a lot of work on apertium-ambiguous, which allows for weighted ambiguous transfer rules. She's implemented it for the kaz-tur pair, but needs people to test it for other pairs. This part is crucial for her dissertation work. The documentation is here:

Re: [Apertium-stuff] GSoC proposal Turkish-Kazakh pair

2019-04-04 Thread Jonathan Washington
Сәлем Данияр, A couple important note, before I look at your proposal is the following: Evaluation for RBMT isn't to verify that every rule works, but to have a passing "testvoc": http://wiki.apertium.org/wiki/Testvoc . We have a particular strategy for performing testvoc for morphologically

Re: [Apertium-stuff] Apertium Wiki Account for GSoC proposal

2019-04-06 Thread Jonathan Washington
It looks like this user name is either already taken or someone created the account for you and forgot to inform the list. -- Jonathan On Wed, Apr 3, 2019, 01:10 Lokendra Singh wrote: > Can someone help me with creating an account. I am working on Python > API/library for Apertium. > > I would

Re: [Apertium-stuff] beta.apertium.org

2019-03-30 Thread Jonathan Washington
Hi Sushain, I think Hèctor means that in both cases there are two translation pairs for the same two languages on GitHub. His question, then, is which one(s) show up on the site? -- Jonathan сб, 30 мар. 2019 г. в 02:56, Sushain Cherivirala : > Hi Hèctor, > > It should be using whatever is

Re: [Apertium-stuff] How to upload my GSOC proposal in wiki

2019-04-08 Thread Jonathan Washington
ertium.org/wiki/User:Rupjyoti/Proposal > > thanks and regards. > > On Sun, Apr 7, 2019 at 8:23 PM Rupjyoti Baruah Res. Scholar, Dept. of > Comp. Engg. IIT (BHU) wrote: > >> I am not getting where to provide username? My username is Rupjyoti. >> >> regards >&g

Re: [Apertium-stuff] Proposal

2019-03-26 Thread Jonathan Washington
Hi Mohit, Could you give the URL to your application draft on the wiki? That would help us mentors a lot :) -- Jonathan пн, 25 мар. 2019 г. в 16:30, Mohit Raj : > hi all > I have submit my proposal on Morphological analyzer of Magahi > User name - Mohitraj > Please give your valuable feedback

Re: [Apertium-stuff] Account on Apertium wiki

2019-02-28 Thread Jonathan Washington
Done! -- Jonathan чт, 28 февр. 2019 г. в 23:02, Daniel Swanson : > I would also like a wiki account for GSoC. > > Username: popcorndude > > Daniel > > On Thu, Feb 28, 2019 at 12:09 AM ashwath s via Apertium-stuff < > apertium-stuff@lists.sourceforge.net> wrote: > >> thanks >> >> On Thu, 28 Feb,

[Apertium-stuff] phonology across word boundaries for HFST generator?

2019-02-28 Thread Jonathan Washington
Hi all, I have some students trying to trigger alternations across word boundaries like the following: inh japỹ / ijapỹ inh tũ / isũ These alternations consistently triggered with certain common words that end in "nh". They're using lexc/twol for the morphological generator. Our first

Re: [Apertium-stuff] morph segmentation with Apertium

2019-03-06 Thread Jonathan Washington
Yes, Mikel, you're right—I was thinking that too. Adding in that morpheme boundary should be trivial. -- Jonathan ср, 6 мар. 2019 г. в 16:28, Mikel L. Forcada : > Cool! > > Mikel > > P.S. Actually, why not щетка- -да- -ғы (щетка+locative+adjectivizer)? > > El 6/3/19 a les 17:32, Francis Tyers

Re: [Apertium-stuff] morph segmentation with Apertium

2019-03-06 Thread Jonathan Washington
Hi Antonio, I have something mostly working, but have a few questions about what specifically you're after. I guess to start things out, for поезд:поезд>{D}{A}:поезде, do you prefer поез>де or поезд>де, or something else? -- Jonathan ср, 6 мар. 2019 г. в 11:33, Francis Tyers : > El 2019-03-06

Re: [Apertium-stuff] Reusing Apertium linguistic data and licensing

2019-03-16 Thread Jonathan Washington
чт, 14 мар. 2019 г. в 04:01, Tino Didriksen : > On Thu, 14 Mar 2019 at 05:54, Jonathan Washington < > jonathan.n.washing...@gmail.com> wrote: > >> By "agree on CC-BY-SA", I assume you mean "agree to dual-license the code >> as CC-BY-SA in addition

Re: [Apertium-stuff] Reusing Apertium linguistic data and licensing

2019-03-13 Thread Jonathan Washington
ср, 13 мар. 2019 г. в 08:44, Tino Didriksen : > Unfortunately, Apertium licensing is messy. Most things are under GPLv2+, > some things are GPLv3+, and a few are CC-BY-SA. > > - apertium-eng is GPLv3+ > - apertium-cat is GPLv2+ (note 2) > - apertium-eng-cat is GPLv3+ > > So any derivative of

Re: [Apertium-stuff] Fwd: RBMT from Kazakh to Turkish

2019-03-14 Thread Jonathan Washington
Сәлем, Данияр! Қауымымызға қош келдіңіз! Thanks for getting in touch with Ilnar and the rest of the Apertium community about your project idea. Memduh is right that Kazakh-to-Turkish MT is receiving a lot of attention right now in Apertium, and an additional project on it would likely create a

[Apertium-stuff] Fwd: CfP: 2nd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2019 at MT Summit XVII)

2019-03-21 Thread Jonathan Washington
abetically) Alina Karakanta FBK-Fondazione Bruno Kessler Atul Kr. Ojha Panlingua Language Processing LLP/Jawaharlal Nehru University Chao-Hong Liu ADAPT Centre, Dublin City University Jonathan Washington Swarthmore College Nathaniel Oco National University (P

[Apertium-stuff] Assamese-Bengali

2019-02-09 Thread Jonathan Washington
сб, 9 февр. 2019 г. в 09:02, Rupjyoti Baruah Res. Scholar, Dept. of Comp. Engg. IIT (BHU) via Apertium-stuff : > Sir > Hi Rupjyoti, First of all, not everyone on this list is male, so "sir" is probably not an appropriate way to address the list. > I am in trouble in Machine Translation of the

Re: [Apertium-stuff] Assamese-Bengali

2019-02-10 Thread Jonathan Washington
dix analyzer.bin* > > Error: Cannot not open file 'apertium-asm.morph.dix'. > > I am unable to find the file 'apertium-asm.morph.dix'. > How to solve this?I am frustrated .Help me please. > > On Sun, Feb 10, 2019 at 10:13 AM Jonathan Washington < > jonathan.n.washing...@gmai

[Apertium-stuff] GSoC flyer translations

2019-01-29 Thread Jonathan Washington
Hi everyone, As most of you know, Apertium is applying to participate in Google Summer of Code this year. As part of the application effort every year, we translate Google's official GSoC flyer into as many languages as possible. A repository where we're collecting translations is here:

[Apertium-stuff] apertium-separable (was Re: Current GSOC ideas)

2019-01-28 Thread Jonathan Washington
пн, 28 янв. 2019 г. в 15:32, Hèctor Alòs i Font : > Missatge de Francis Tyers del dia dl., 28 de gen. > 2019 a les 21:13: > >> >> There is a lot of functionality that is not used widely that could be >> really >> used to improve performance of language pairs. >> >> * apertium-separable >> *

Re: [Apertium-stuff] GSoC 2019 project discussion - Unsupervised weighting of automata

2019-04-06 Thread Jonathan Washington
nce Date;" > > Thanks for noting that. > > Best Regards, > Amr Keleg > -- > *From:* Jonathan Washington > *Sent:* Saturday, April 6, 2019 3:52:56 PM > *To:* Amr Mohamed Hosny Anwar; Francis Tyers; > tommi.antero.piri...@uni-hambur

Re: [Apertium-stuff] Gsoc Proposal

2019-04-06 Thread Jonathan Washington
Salom, Ogabek, I think if you complete the coding challenge, you'll have a better idea of what work is required and will have more specific questions about what your work plan should look like. You might also consider looking at previously successful workplans for translation modes:

Re: [Apertium-stuff] How to upload my GSOC proposal in wiki

2019-04-07 Thread Jonathan Washington
Could you provide the username you'd like on the wiki? -- Jonathan On Sun, Apr 7, 2019, 04:29 Rupjyoti Baruah Res. Scholar, Dept. of Comp. Engg. IIT (BHU) via Apertium-stuff wrote: > Respected Mentors, > I want to upload my GSOC proposal in wiki list via apertium. As like the > user Mohitraj.

Re: [Apertium-stuff] Apertium wiki Account

2019-04-07 Thread Jonathan Washington
Hi Neerav, It's great that there is a Braj corpus. That *is* a computational research, and would help a lot with the development of other resources. I have one question: did you complete a coding challenge? I don't see that with your application. -- Jonathan On Mon, Mar 25, 2019, 16:35

Re: [Apertium-stuff] GSoC 2019 project discussion - Unsupervised weighting of automata

2019-04-07 Thread Jonathan Washington
e at faculty of engineering, Ain > Shams University. > Additionally, I am giving weekly tutorials to undergraduate students as a > teacher assistant. > > Amr > ------ > *From:* Jonathan Washington > *Sent:* Sunday, April 7, 2019 7:14:37 AM > *To:* Amr Moh

[Apertium-stuff] Fwd: 2nd CfP: LoResMT 2019 (Workshop on Technologies for MT of Low Resource Languages) at MT Summit XVII

2019-04-27 Thread Jonathan Washington
tically) Alina Karakanta FBK-Fondazione Bruno Kessler Atul Kr. Ojha Panlingua Language Processing LLP/Jawaharlal Nehru University Chao-Hong Liu ADAPT Centre, Dublin City University Jonathan Washington Swarthmore College Nathaniel Oco National Uni

Re: [Apertium-stuff] Web Dictionary Propose

2019-04-28 Thread Jonathan Washington
Hi Vinicius, This is impressive, especially the level of integration with git. The main problem is that the purpose of a web dictionary editor is for people who have linguistic knowledge but not knowledge of git or xml or the like to contribute linguistic data—and perhaps also for advanced

Re: [Apertium-stuff] GSOC: Annotatrix

2019-04-24 Thread Jonathan Washington
Hi Alyaxey, We have some thoughts on your proposal. Would you mind joining #apertium or #_u-dep on freenode to discuss them? -- Jonathan пн, 22 апр. 2019 г. в 08:16, Alyaxey Yaskevich : > Hello, colleagues! > > My proposal is about development of Annotatrix project ( >

Re: [Apertium-stuff] GSOC19 proposal

2019-04-10 Thread Jonathan Washington
I'm having trouble finding your work on the coding challenge. Could you point us to it? -- Jonathan вт, 9 апр. 2019 г. в 12:20, Nikolay Aleksandrov : > Yes, added here : http://wiki.apertium.org/wiki/User:Qavan > > пн, 8 апр. 2019 г. в 21:29, Francis Tyers : > >> El 2019-04-08 19:25,

Re: [Apertium-stuff] Portuguese-Catalan (almost) ready for packaging

2019-08-29 Thread Jonathan Washington
m-pt-ca. So I'll add now the > old AUTHORS information to the new package. > > Hèctor > > Missatge de Jonathan Washington del dia > dj., 29 d’ag. 2019 a les 5:59: > >> Hi Hèctor, >> >> Do I understand that apertium-por-cat is not a continuation of the code

Re: [Apertium-stuff] GCI 2019 Mentorship

2019-08-27 Thread Jonathan Washington
Hi Andi, The main requirement on our side for being a mentor is that you've contributed before and you're capable of mentoring some subset of tasks. So there should be no problem :) Of course this is all contingent on Apertium being accepted this year. If we're accepted, we'll reach out and

Re: [Apertium-stuff] Portuguese-Catalan (almost) ready for packaging

2019-08-28 Thread Jonathan Washington
Hi Hèctor, Do I understand that apertium-por-cat is not a continuation of the code from apertium-pt-ca? I would have expected that the code is a continuation (or branch) of the old repo. If this were the case, pushing the new code to the apertium-pt-ca repo and renaming it to apertium-por-cat

Re: [Apertium-stuff] TransducerHasWrongTypeException

2019-09-04 Thread Jonathan Washington
Hi Zanga, Normally we don't use HFST transducers (.hfst) in bilingual modules. Is there a reason you'd like to use it instead of lttoolbox transducers (.bin)? If you do want to use it in place of the lttoolbox transducer, as Fran suggests, Makefile.am is definitely where you'd want to specify

Re: [Apertium-stuff] genv*dix.py need conversion

2019-09-12 Thread Jonathan Washington
Thanks, Kevin. Yes, that's the approach I use. It's not well-documented, but it's pretty straightforward. Do you know anything about the metalrx.py script that Tino mentioned? -- Jonathan On Thu, Sep 12, 2019, 13:47 Kevin Brubeck Unhammer wrote: > Jonathan Washington > čálii: >

Re: [Apertium-stuff] genv*dix.py need conversion

2019-09-12 Thread Jonathan Washington
I'm curious about metalrx.py. Where is it used? What does it do? Is it documented anywhere? I've been using an xslt file I found in a Sámi pair for creating lrx from metalrx. -- Jonathan On Thu, Sep 12, 2019, 02:19 Tino Didriksen wrote: > Maybe I can do it myself, then. 2to3 did most of

Re: [Apertium-stuff] Adding invariant prefixes

2019-09-17 Thread Jonathan Washington
On Tue, Sep 17, 2019, 14:58 Kevin Brubeck Unhammer wrote: > > The upside is that you can combine words without listing everything > twice. If you've only got one prefix, the HFST-like method is probably > better. If you're combining lots, compounding may be worth considering. > We can and do

Re: [Apertium-stuff] Ôdp: [Pd: Implementing apertium-html-tools]

2019-07-21 Thread Jonathan Washington
Shouldn't it work without needing a two-letter code? -- Jonathan On Sun, Jul 21, 2019, 09:58 Grzegorz Kulik wrote: > Hi Sushain, > > thank you for your answer, it pointed me in the right direction. Turns out > Silesian wasn't defined at all in localization.js or in min.js. In 'var > languages'

Re: [Apertium-stuff] Ôdp: Ôdp: [Pd: Implementing apertium-html-tools]

2019-07-21 Thread Jonathan Washington
ode: > > var languages = {'af': 'Afrikaans',... > > and then you link it to the three-letter one > > var iso639Codes = {'afr': 'af',... > > I don't know about the others but I've got a pol-szl pair, so I have to do > it like that. :) > > Greg > > We niydzie

[Apertium-stuff] recursive transfer mode name

2019-07-22 Thread Jonathan Washington
Hi everyone, The recursive transfer GSoC project is progressing nicely, and it's anticipated that a release will occur at the end of GSoC. It's very exciting! We'd like to add debug mode support to Apertium for recursive transfer, via the following file:

[Apertium-stuff] question about -z and null flush behaviour

2019-07-16 Thread Jonathan Washington
Hi all, I understand that -z was added to a number of Apertium and Apertium-related programs, allowing them to continue to accept input and simply flush buffers when a null character is encountered. Question 1. Is it right that Apertium itself does not follow this behaviour? To use an existing

Re: [Apertium-stuff] Diccionario portugués y Acordo Ortográfico

2019-07-16 Thread Jonathan Washington
Oi Hèctor, Nós concordamos que as formas anteriores ao AO devem continuar a ser analisadas pelo analisador. A escolha entre as duas opções depende de como os falantes de Portugal se sentem sobre a ortografia pré-AO. Se muitas pessoas ainda antecipam as formas antigas, faria sentido manter

Re: [Apertium-stuff] Lexical selection rule learning questions

2019-06-13 Thread Jonathan Washington
чт, 13 июн. 2019 г. в 20:54, Francis Tyers : > El 2019-06-13 22:34, Danielle Rossetti Dos Santos escribió: > > Hello, > > > > I'm working with the monolingual transfer rule learning code and have > > a few questions: > > > > 1. I see some language pairs used to have a multi mode (such as in > >

[Apertium-stuff] Special Issue on Machine Translation for Low-Resource Languages (MT Journal)

2019-11-01 Thread Jonathan Washington
EDITORS (Listed alphabetically) • Alina Karakanta (FBK-Fondazione Bruno Kessler) • Audrey N. Tong (NIST) • Chao-Hong Liu (ADAPT Centre/Dublin City University) • Ian Soboroff (NIST) • Jonathan Washington (Swarthmore College) • Oleg Aulov (NIST) • Xiaobing Zhao (Minzu University of China) Machine

Re: [Apertium-stuff] Error with apertium-init

2019-11-02 Thread Jonathan Washington
Hi Kiran, Try python3? -- Jonathan On Sat, Nov 2, 2019, 21:45 kiran srigiri wrote: > $~/Desktop/apertium eng-hau prototype$ python apertium-init.py eng-hau > File "apertium-init.py", line 34 > SyntaxError: Non-ASCII character '\xca' in file apertium-init.py on line > 34, but no encoding

Re: [Apertium-stuff] [Apertium-contact] Utilgjengelige språk

2019-11-03 Thread Jonathan Washington
Hi Nebojsa, First of all, my apologies for not being able to respond in Norwegian. I would definitely have preferred to. One reason other languages would be greyed out is because Apertium has a limited number of released language pairs with English. To use unreleased translation pairs, you can

Re: [Apertium-stuff] Contact

2019-11-06 Thread Jonathan Washington
e.com/archive/2018/task/6465621755691008/ >> >> >> On Thu, 7 Nov 2019, 07:21 Jonathan Washington, >> wrote: >>> >>> Hi Kiran, >>> >>> A number of us on this list have mentored GSoC before. What's the >>> question? Feel free to

Re: [Apertium-stuff] Contact

2019-11-06 Thread Jonathan Washington
Hi Kiran, A number of us on this list have mentored GSoC before. What's the question? Feel free to ask on #apertium as well. -- Jonathan вт, 5 нояб. 2019 г. в 07:39, kiran srigiri : > > Any Google summer of code - Gsoc mentor I can talk with? > ___

Re: [Apertium-stuff] Updating the Apertium 2-page brochure

2019-11-06 Thread Jonathan Washington
> The same with the block diagram. >> >> Mikel >> >> >> El 5/11/19 a les 16:53, Jonathan Washington ha escrit: >> > Hi Mikel, >> > >> > I've been thinking recently that we also should put the Apertium SVG >> > logo i

Re: [Apertium-stuff] Updating the Apertium 2-page brochure

2019-11-05 Thread Jonathan Washington
Hi Mikel, I've been thinking recently that we also should put the Apertium SVG logo in the repos somewhere. I wonder if this sort of thing should be grouped together (maybe in a "promotional materials" repo?) or separately. What do other people familiar with Apertium's GitHub organisation

[Apertium-stuff] 2nd CFP: Special Issue on Machine Translation for Low-Resource Languages (MT Journal)

2019-11-21 Thread Jonathan Washington
deadline: 25 Feb 2020. GUEST EDITORS (Listed alphabetically) • Alina Karakanta (FBK-Fondazione Bruno Kessler) • Audrey N. Tong (NIST) • Chao-Hong Liu (ADAPT Centre/Dublin City University) • Ian Soboroff (NIST) • Jonathan Washington (Swarthmore College) • Oleg Aulov (NIST) • Xiaobing Zhao (Minzu

[Apertium-stuff] call for mentors for Google Code-In

2019-10-24 Thread Jonathan Washington
Greetings, Apertiumers! Our application for Google Code-In is due on Monday (the 28th). If you'd like to be a mentor, please do the following before then: 1. Add your name to tasks that you can mentor at the Task Ideas page: http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in 2. If

Re: [Apertium-stuff] Special Issue on Machine Translation for Low-Resource Languages (MT Journal)

2019-11-20 Thread Jonathan Washington
to submit, then please get in touch. -- Jonathan пт, 1 нояб. 2019 г. в 22:21, Jonathan Washington : > > Hi all, > > Below please find a revised CFP for the Machine Translation Special > Issue on MT for Low-Resource Languages. > > = > CALL FOR PAPERS: Machine Translation

Re: [Apertium-stuff] Special Issue on Machine Translation for Low-Resource Languages (MT Journal)

2019-11-26 Thread Jonathan Washington
. -- Jonathan вт, 26 нояб. 2019 г. в 08:26, Jonathan Washington : > > I can take care of it in an hour or two. Thanks for the reminder, Ilnar, and > for the organisational help, Sevilay! > > -- > Jonathan > > On Tue, Nov 26, 2019, 06:59 Sevilay Bayatlı wrote: >&g

Re: [Apertium-stuff] Special Issue on Machine Translation for Low-Resource Languages (MT Journal)

2019-11-26 Thread Jonathan Washington
to submit a title, list of authors, and > a short description. > > Sevilay > > > On Wed, Nov 20, 2019 at 8:05 PM Jonathan Washington < > jonathan.n.washing...@gmail.com> wrote: > >> Hi all, >> >> This is just a reminder that the expression

Re: [Apertium-stuff] Help

2019-11-14 Thread Jonathan Washington
Hi Kiran, Did you try the modes fix that Tino suggested? What are the contents of your modes.xml file? -- Jonathan On Mon, Nov 11, 2019, 21:45 kiran srigiri wrote: > Not working ! > > -Kiran srigiri > > On Thu, 7 Nov 2019, 22:41 Ngadou Yopa, wrote: > >> Hey Srigiri >> >> Try this `echo

Re: [Apertium-stuff] Kurmanji apertium data upload

2019-12-21 Thread Jonathan Washington
Hi Sakine, If I understand right, you'd like to use a web interface to analyse some Kurmancî data? If so, you can try this: https://beta.apertium.org/?choice=kmr#analyzation If that's not what you're after, could you clarify? -- Jonathan On Thu, Dec 19, 2019, 19:31 Sakine Cabuk Balli wrote:

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Jonathan Washington
Hi Tommi, all, A couple years ago, a Swarthmore student implemented an algorithm for tokenisation of spaceless orthographies using morphological transducers. She used a fork of a prototype Japanese transducer developed by another of my students to evaluate it. The work is available at the

Re: [Apertium-stuff] Apertium elections coming up.

2020-02-26 Thread Jonathan Washington
Thanks for calling the election, Mikel. It was pointed out on IRC during GCI that it was time to run an election again, but everyone was busy with GCI so it didn't receive more attention. +1 to Tino's points. I'll be running for PMC again, so I'll recuse myself from involvement with the

Re: [Apertium-stuff] GSoC 2020

2020-02-26 Thread Jonathan Washington
lementations for Japanese. > > -- > Tomohiro > > 2020年2月26日(水) 22:26 Jonathan Washington : > >> Hi Tommi, all, >> >> A couple years ago, a Swarthmore student implemented an algorithm for >> tokenisation of spaceless orthographies using morphological t

Re: [Apertium-stuff] Apertium elections coming up.

2020-02-26 Thread Jonathan Washington
I just invited you to have write access to the Apertium phenny repo :) This is an important discussion, and one that's come up before, but it won't be resolved in time for the election. So, for now, the easiest work-around is to make sure everyone who thinks and who other contributors think

Re: [Apertium-stuff] GSOC-2020

2020-03-05 Thread Jonathan Washington
Hi Himanshu, Apertium generally isn't looking for "some developer" to write MT systems. The community is usually looking for people who know a language and care about it. Also, there are over 150 million speakers of those languages combined, so I'm not sure how rare it is to find developers who

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-06 Thread Jonathan Washington
And since - Tino's running for president, and - taking a census like this is technically the election committee's responsibility (I believe), The election committee should probably verify Tino's work. -- Jonathan пт, 6 мар. 2020 г. в 07:05, Tino Didriksen : > Combined list of everyone I can

Re: [Apertium-stuff] Need your guidance for GSoC 2020

2020-02-23 Thread Jonathan Washington
w translation pair of Pahari > language with other languages. I believe that my work will be beneficial for > the society and also for the future developments. > Looking forward for precious feedbacks. > > Thanks and regards, > Arzoo > > On Fri, Feb 21, 2020 at 1:48 AM Jon

Re: [Apertium-stuff] GSoC--Apertium Website Development

2020-02-23 Thread Jonathan Washington
Hi Mohit, As with all GSoC projects, the best way to show your interest in the project and your ability to complete it is to start working on it. In this case, this would mean starting by tackling a few open issues in apertium-html-tools (https://github.com/apertium/apertium-html-tools/issues)

Re: [Apertium-stuff] GSOC: Ideas for Google Summer of Code Page Up-to-Date?

2020-02-23 Thread Jonathan Washington
Hi Nadia, The page is still being edited to be up-to-date. The anaphora resolution project was completed last year, and will need to be removed from the ideas page. This year a related project idea might be to integrate anaphora resolution into several Apertium translation pairs. Another

Re: [Apertium-stuff] Apertium elections coming up.

2020-02-26 Thread Jonathan Washington
That's a good start for a discussion about revising committer status in the bylaws, and maybe for use of a definition for the election, but it also raises a few questions: - How do we decide who's a member of the GitHub org? - How do we decide who has write access to an Apertium repo? These are

Re: [Apertium-stuff] Apertium elections coming up.

2020-02-26 Thread Jonathan Washington
> […] > Giving access rights to new Committers. > > Cheers > > Mikel > > > El 26/2/20 a les 17:22, Jonathan Washington ha escrit: > > That's a good start for a discussion about revising committer status in > the bylaws, and maybe for use of a definition for the electio

Re: [Apertium-stuff] Apertium GSoC 2020? Deadline Feb 5th

2020-02-04 Thread Jonathan Washington
Hi all, Thanks to those of you who have offered to help with GSoC. Everyone else interested in mentoring, please let me know. Also note the following draft of information for the application: wiki.apertium.org/wiki/Google_Summer_of_Code/Application_2020 -- Jonathan пн, 3 февр. 2020 г. в

Re: [Apertium-stuff] Apertium GSoC 2020? Deadline Feb 5th

2020-01-28 Thread Jonathan Washington
I've been eyeing the deadline, and it's been on my to-do list to bring it up. But I've also been busy with other things and kind of waiting for the last of the GCI admin stuff to be out of the way. In any case, I'm willing to be an org admin for GSoC—it's a little bit lower pressure than for GCI

Re: [Apertium-stuff] Apertium GSoC 2020? Deadline Feb 5th

2020-02-04 Thread Jonathan Washington
recognise them under a different name or username. -- Jonathan вт, 4 февр. 2020 г. в 15:00, James sandy : > > Hello Jonathan, > > I’m interested to mentor for Gsoc this year with Apertium > > On Tuesday, February 4, 2020, Jonathan Washington > wrote: >> >> Hi all,

Re: [Apertium-stuff] Need your guidance for GSoC 2020

2020-02-20 Thread Jonathan Washington
Other places to check are here: http://wiki.apertium.org/wiki/Contributing and here: http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code , the latter of which needs some updates. -- Jonathan чт, 20 февр. 2020 г. в 14:52, Sevilay Bayatlı : > > Hi Arzoo, > > Welcome to Apertium, you

Re: [Apertium-stuff] Apertium GSoC 2020? Deadline Feb 5th

2020-02-08 Thread Jonathan Washington
at 12:35 PM Tommi A Pirinen > wrote: >> >> On Tue, Feb 04, 2020 at 01:51:38PM -0500, Jonathan Washington wrote: >> >> > Everyone else interested in mentoring, please let me know. >> >> I can co-mentor this year again but with very randomly varying

Re: [Apertium-stuff] Bootstrapping a new language pair – Hanging Problem

2020-01-14 Thread Jonathan Washington
Hi Zanga, I believe you're running into a know issue with the tagger as apertium-init sets it up: https://github.com/apertium/apertium-init/issues/31 The fix is to add -x to the apertium-tagger lines in your bilingual modes.xml. A work-around is to just remove the apertium-tagger blocks

Re: [Apertium-stuff] PMC proposal

2020-04-09 Thread Jonathan Washington
n, it just seems fairly popular. > > On Thu, Apr 9, 2020, 10:45 Jonathan Washington > wrote: >> >> Why Material and not some other standard? >> >> -- >> Jonathan >> >> 8 apr 2020, Ç. tarixində 16:02 tarixində Samuel Sloniker >> yazdı: >&

Re: [Apertium-stuff] PMC proposal

2020-04-09 Thread Jonathan Washington
Why Material and not some other standard? -- Jonathan 8 apr 2020, Ç. tarixində 16:02 tarixində Samuel Sloniker yazdı: > > (for the next PMC) > http://wiki.apertium.org/wiki/PMC_proposals/Use_Material_Design > ___ > Apertium-stuff mailing list >

Re: [Apertium-stuff] Apertium offline translator Android app has gone ... offline.

2020-04-09 Thread Jonathan Washington
7 apr 2020, Ç.a. tarixində 05:44 tarixində Tino Didriksen < m...@tinodidriksen.com> yazdı: > Well, that makes sense from a security point of view. And that is a good > reason to revive the native port, which I talked about on IRC but never > wrote about to the mailing list. > > IRC logs: > >

Re: [Apertium-stuff] Lexical Selection

2020-04-13 Thread Jonathan Washington
I believe the multitrans script in lex-tools ( https://github.com/apertium/apertium-lex-tools) makes it possible to get all versions of the translation by expanding the dictionary and skipping lexical selection. So you'd get two sentences output for this particular example: - The season is more

Re: [Apertium-stuff] Where do I find the dictionaries

2020-04-21 Thread Jonathan Washington
Btw, dpkg -L shows the following on my Debian system: /usr/share/apertium/apertium-swe/apertium-swe.swe.dix It could probably work for what you want. -- Jonathan On Tue, Apr 21, 2020, 12:29 Jonathan Washington < jonathan.n.washing...@gmail.com> wrote: > Hi Per, > > To add to w

Re: [Apertium-stuff] Bylaws Overhaul Proposal

2020-04-21 Thread Jonathan Washington
On Mon, Apr 20, 2020, 17:57 Samuel Sloniker wrote: > 1. I oppose allowing the Secretary and Treasurer to be the same person. > What's your reasoning for this? -- Jonathan 2. Using additional criteria for tiebreaking could easily turn into a > rigged election. > 3. I believe there should be a

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-21 Thread Jonathan Washington
The main thing I worry about here is lrx rules. Currently a lot of pairs have rules that match e.g. tags="adj", but not necessarily tags="adj.*". So something that's normally hargle might now be hargle, and that means the lrx rule won't match. Since we want this to be backwards-compatible

Re: [Apertium-stuff] Where do I find the dictionaries

2020-04-21 Thread Jonathan Washington
Hi Per, To add to what Daniel said, language data installed from apt is put in system directories as root, and is not good for doing dev work. As a fairly up-to-date Apertium language data developer, I don't know the path of system-installed language data off the top of my head (you can always

Re: [Apertium-stuff] Bylaws Overhaul Proposal

2020-04-21 Thread Jonathan Washington
the US Constitution.  > > On Tue, Apr 21, 2020, 08:56 Jonathan Washington < > jonathan.n.washing...@gmail.com> wrote: > >> >> >> On Mon, Apr 20, 2020, 17:57 Samuel Sloniker >> wrote: >> >>> 1. I oppose allowing the Secretary and T

Re: [Apertium-stuff] PMC proposal

2020-04-10 Thread Jonathan Washington
tarixində Samuel Sloniker yazdı: > > I added some more pros as well as a few cons. > http://wiki.apertium.org/wiki/PMC_proposals/Use_Material_Design#In_detail > > On Thu, Apr 9, 2020 at 10:52 AM Jonathan Washington > wrote: >> >> That's not a good reason to choose so

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-13 Thread Jonathan Washington
um elections coming up. > > Pièce(s) jointes(s) probable(s)> > > > > So far, these people have indicated they want to run for PMC members: > > - Jonathan Washington > > - Francis Tyers > > - Tino Didriksen > > - Scoop Gracie (pseudonym) > > - Tanmai Kh

Re: [Apertium-stuff] Election Results

2020-04-04 Thread Jonathan Washington
> - Sushain K. Cherivirala > - Tino Didriksen > - Mikel L. Forcada > - Xavi Ivars > - Tanmai Khanna* > - Jonathan Washington > > * Due to participation in GSoC, Tanmai Khanna's appointment will be delayed. No one has been selected for GSoC yet (we're in the mentor selection stag

Re: [Apertium-stuff] PMC proposal

2020-04-02 Thread Jonathan Washington
Not everywhere observes the same days as the U.S. Different areas and peoples have their own traditions, just like they have their own languages. https://en.wikipedia.org/wiki/Massacre_of_the_Innocents#Feast_day -- Jonathan On Thu, Apr 2, 2020, 11:59 Scoop Gracie wrote: > Huh? > > On Thu,

Re: [Apertium-stuff] Bylaws Overhaul Proposal

2020-04-22 Thread Jonathan Washington
If you don't like the diff that GitHub offers, you can clone the repos and use whatever diff tool you prefer. The git executable itself has several diff modes too, including a word diff (which sounds like what you're after?). -- Jonathan On Wed, Apr 22, 2020, 01:14 Tino Didriksen wrote: > On

Re: [Apertium-stuff] Secondary Tag Prefixes

2020-05-09 Thread Jonathan Washington
Speaking as a language developer,¹ I prefer concise, textual tags. E.g., I don't think is good—it clogs the stream with verbosity, as Fran points out. On the other hand, I don't mind symbols here and there, like <§agent>. But I don't think this is a good secondary tag, unless we make it very

[Apertium-stuff] Fwd: CFP: Resources and Representations for Under-resourced Languages and Domains

2020-09-05 Thread Jonathan Washington
Apertiumers, See the following information about a workshop of relevance. -- Jonathan = Dear ALL, We are organizing RESOURCEFUL-2020 (RESOURCEs and Representations For Under-resourced Languages and Domains) which will be collocated with the Eighth Swedish Language Technology Conference

Re: [Apertium-stuff] Fixing Phonological Processes

2020-09-15 Thread Jonathan Washington
I ran into this recently too: https://pytwolc.readthedocs.io/en/latest/index.html I haven't looked at who wrote it, or looked closely at the whole thing, but it looks fairly thorough. -- Jonathan On Tue, Sep 15, 2020, 10:57 Francis Tyers wrote: > El 2020-09-14 07:27, Flammie A Pirinen

Re: [Apertium-stuff] Fixing Phonological Processes

2020-08-29 Thread Jonathan Washington
Hi Zanga, Given the highly agglutinative nature of Yao morphology, using dix to model it is probably not a great option. Also, as you and Hèctor have concluded, the morphophonology will be much easier to model using twol. Given the extent to which the morphology involves prefixes, lexc (what we

Re: [Apertium-stuff] Infinite loop in testvoc???

2020-08-15 Thread Jonathan Washington
Gabriel, Could you clarify what you mean by "language barriers"? -- Jonathan On Sat, Aug 15, 2020, 13:40 Medina, Gabriel wrote: > Does this have something to do with the language barriers for all > languages on the online translator? > > On Sat, Aug 15, 2020 at 1:03 PM Marc Riera Irigoyen < >

Re: [Apertium-stuff] let's move the mailing lists to sourcehut

2020-09-23 Thread Jonathan Washington
One other question: Will it be possible to move existing apertium-stuff (and PMC, etc) archives to the new location? Or would we be starting over with those archives? -- Jonathan On Mon, Sep 21, 2020, 10:15 Francis Tyers wrote: > El 2020-09-21 15:07, Tino Didriksen escribió: > > On Mon, 21

[Apertium-stuff] comments sought on Iraqi Türkman ISO code

2020-10-02 Thread Jonathan Washington
Dear colleagues (apologies for cross-posting), Sevilay (CCed) and I have submitted an application to the ISO 639-3 registrar for a new three-letter code for Sevilay's native language, Iraqi Türkman, to be added to the standard: https://iso639-3.sil.org/request/2020-039 The registration authority

  1   2   >