Re: [Apertium-stuff] Requesting Proposal Review

2024-03-31 Thread Daniel Swanson
Hi Ahmed, My biggest question after reading your proposal is how will the suggestion system work? What will people submit? Where will the suggestions be sent or stored? Daniel On Sun, Mar 31, 2024 at 8:55 PM Ahmed Elhossieny < ahmedamrelhossi...@gmail.com> wrote: > Dear Apertium GSoC Mentors,

Re: [Apertium-stuff] Request for Wiki Account

2024-03-28 Thread Daniel Swanson
The user account for Pedromanic (talk) has been created. password sent by email Daniel On Thu, Mar 28, 2024 at 2:29 PM Pedro Manicardi Soares via Apertium-stuff < apertium-stuff@lists.sourceforge.net> wrote: > Dear Apertium admins, > > My name is Pedro Manicardi and I am writing to request the

Re: [Apertium-stuff] Can't connect to Apertium IRC

2023-06-30 Thread Daniel Swanson
Hi Aure, [image: image.png] I just checked, and if I have "Status" selected, as in this picture, I get the behavior you described, but if I then click on the "#apertium" button, everything works fine. Daniel On Fri, Jun 30, 2023 at 8:16 PM Aure Séguier wrote: > Hi > > Since Apertium IRC

Re: [Apertium-stuff] Changes to apertium-preprocess-transfer

2023-06-28 Thread Daniel Swanson
which begins at line > 12933 or 15492. In addition, preprocessing often removes comments from rule > headers, so that even if you have the right line, it is not easy to find the > rule in the source code. I should put this as issues, but I have always been > lazy. > > Hèctor &g

Re: [Apertium-stuff] Changes to apertium-preprocess-transfer

2023-06-27 Thread Daniel Swanson
On Tue, Jun 27, 2023 at 5:18 AM Kevin Brubeck Unhammer wrote: > > > And I just wanted to let you all know, in case someone was depending > > on those. To compensate, I added a check to apertium-lint which can > > report roughly the same information: > > What's the recommended way of installing

[Apertium-stuff] Changes to apertium-preprocess-transfer

2023-06-26 Thread Daniel Swanson
Greetings Apertiumers! I recently identified a way that apertium-preprocess-transfer was being rather inefficient and today I fixed it, so tomorrow you all should be able to update to apertium 3.9.4 and see some improved compile times for any pairs not using apertium-recursive, with speedups

Re: [Apertium-stuff] Tagset Standardization

2023-06-07 Thread Daniel Swanson
:22 PM Daniel Swanson wrote: > > Yes, most of our tools assume that tags are position independent, but > I've come across a handful of languages that treat some tags as > position dependent, and I was more hoping to make it official to make > it less likely that we

Re: [cg] Re: Problems with WITH

2023-06-05 Thread Daniel Swanson
types. >> NOMAPPED prevents the rule from running on cohorts that have a dependency >> parent. > > -- > You received this message because you are subscribed to the Google Groups > "Constraint Grammar" group. > To unsubscribe from this group and stop receivi

Re: [Apertium-stuff] Style preferences in web interface

2023-05-13 Thread Daniel Swanson
Hi Juan, The only step I can see that you're missing is that the arg-spa makefile doesn't install the preference files. Here's the relevant lines in nno-nob: https://github.com/apertium/apertium-nno-nob/blob/master/Makefile.am#LL270C34-L270C34 Daniel On Sat, May 13, 2023 at 2:06 PM Juan Pablo

Re: [Apertium-stuff] GSoC

2023-05-08 Thread Daniel Swanson
replied off-list On Mon, May 8, 2023 at 4:55 PM Eiji Miyamoto wrote: > > Hello, I was wondering how should I contact with my mentor during GSoC? > > Thank you, > ___ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net >

Re: [Apertium-stuff] Two changes in behavior when updating to the last version of apertium-all-dev

2023-05-04 Thread Daniel Swanson
Hi Juan, $ echo "El Papa desea jubilarse" | apertium -d ./ spa-arg LO Papa deseya chubilar-se $ echo "Lo Papa deseya chubilar-se" | apertium -d ./ arg-spa El Papá desea jubilarse I'm not reproducing the pronoun issue. As for the strange capitalization coming from the postgenerator, no one has

Re: [Apertium-stuff] Wiki

2023-04-03 Thread Daniel Swanson
Done - you should have received an email with your password. On Mon, Apr 3, 2023 at 9:18 PM Fares M. Basosy wrote: > > Fares_Basousy > thank you! > > On Sun, Apr 2, 2023, 9:23 PM Daniel Swanson > wrote: >> >> Hi Fares, >> >> What would you like as a

Re: [Apertium-stuff] Wiki

2023-04-02 Thread Daniel Swanson
Hi Fares, What would you like as a username? Daniel On Sun, Apr 2, 2023 at 2:50 PM Fares M. Basosy wrote: > > Hi, My Name's Fares i'm a computer Engineering undergraduate, nice to meet > you all > can i get an account on wiki? > ___ > Apertium-stuff

Re: [Apertium-stuff] Regarding Rule Visualization Tool in Apertium GSOC 2023

2023-04-01 Thread Daniel Swanson
Hi Enock, I've written comments on your proposal below, but in writing them I had the broader realization that I don't think this project is actually particularly feasible for GSoC. I added it to the ideas list on the spur of the moment 2 years ago and never really considered what it would

Re: [Apertium-stuff] GSoc candidate: Interested in Project: Develop a prototype MT system for a strategic language pair

2023-03-30 Thread Daniel Swanson
Hi Pratham, I'm a bit confused. Your repo appears to actually be a coding challenge for the localization project, not the prototype translation project. Daniel On Thu, Mar 30, 2023 at 11:15 AM pratham bhanushali wrote: > > Hello. > My name is Pratham Bhanushali. I am a third-year Information

Re: [Apertium-stuff] wiki

2023-03-08 Thread Daniel Swanson
The user account for Eiji (talk) has been created. On Wed, Mar 8, 2023 at 10:31 AM Eiji Miyamoto wrote: > > Hello, I would like to have an account on wiki for proposal and could you > make mine? > UserName: Eiji > email:motopo...@gmail.com > > cheers, >

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Daniel Swanson
a version of lt-proc -g that accepts the tags in any order, which might be helpful for reducing generation errors, though it may turn out to be too much of a slowdown for production. Daniel On Tue, Mar 7, 2023 at 1:58 PM Kevin Brubeck Unhammer wrote: > > Daniel Swanson > čálii: > > &

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Daniel Swanson
On Tue, Mar 7, 2023 at 6:07 AM Kevin Brubeck Unhammer wrote: > > Daniel Swanson > čálii: > > > Greetings Apertiumers! > > > > This morning I set out to change the Ancient Hebrew analyzer from > > Latin script to Hebrew script (a task I don't wish upon any

[Apertium-stuff] Tagset Standardization

2023-03-06 Thread Daniel Swanson
Greetings Apertiumers! This morning I set out to change the Ancient Hebrew analyzer from Latin script to Hebrew script (a task I don't wish upon anyone) and in the process produced a search-and-replace tool that understands the structure of several of our source files:

Re: [Apertium-stuff] GSoC '23 application

2023-03-06 Thread Daniel Swanson
host but I'm facing some problems > with the installation of packages (Error while installing packages using > "pipenv install --dev" ). Maybe because of the python version used in the > project. My python version is 3.11 > How should I proceed! > > Thank you! > Faiz

Re: [Apertium-stuff] GSoC '23 application

2023-03-06 Thread Daniel Swanson
I have created the account and sent the password by email. There are various things listed that we would want such a project to support. Have you tried to get any of them working on the command line? Daniel On Mon, Mar 6, 2023 at 4:15 AM Faiz Alam wrote: > > Hey, > My name is Faiz Alam and I

Re: [Apertium-stuff] GSoC'23

2023-03-05 Thread Daniel Swanson
ince I think there's something > missing from me to be feel that hard? > > On Sun, Mar 5 2023 at 11:41:07 AM -0500, Daniel Swanson > wrote: > > Account created and password sent by email. You can find past proposals here: > https://wiki.apertium.org/wiki/Category:Student

Re: [Apertium-stuff] GSoC'23

2023-03-05 Thread Daniel Swanson
It's my first time to participate in open source and first time to be asked > for a task so it will be great for me to participate with you > > On Sun, Mar 5, 2023, 4:24 PM Daniel Swanson > wrote: >> >> What would you like your username to be? I can make you an account. >&

Re: [Apertium-stuff] GSoC'23

2023-03-05 Thread Daniel Swanson
or not? >> >> On Sun, Mar 5, 2023, 2:59 AM Daniel Swanson >> wrote: >>> >>> Put in on github, share the link, and then get started writing your work >>> plan. >>> >>> Do you have a wiki account? >>> >>> Daniel >>

Re: [Apertium-stuff] GSoC'23

2023-03-04 Thread Daniel Swanson
> What should I do next? > > On Wed, Mar 1, 2023, 11:27 PM Daniel Swanson > wrote: >> >> I don't care how you implement it, just that the editor extensions >> (any of >> https://microsoft.github.io/language-server-protocol/implementors/tools/) >> can connec

Re: [Apertium-stuff] GSoC'23

2023-03-01 Thread Daniel Swanson
cific API to be fetched or any general one? > Also, I will write it on Node since it's the technology I'm using. > > Last question, any problem if I used any library like axios or got or you > need just node and nothing else? > > On Wed, Mar 1, 2023, 4:48 PM Daniel Swanson

Re: [Apertium-stuff] GSOC2023

2023-03-01 Thread Daniel Swanson
You're certainly welcome to submit pull requests on the Japanese repository, but due to the tokenization problems, that probably shouldn't be your entire coding challenge, since we also need to see that you can work on that aspect of the project. Daniel On Wed, Mar 1, 2023 at 9:49 AM Eiji

Re: [Apertium-stuff] GSoC'23

2023-03-01 Thread Daniel Swanson
t so I need to know information about it > if it is still available) > > > On Tue, Feb 28 2023 at 11:41:26 PM -0500, Daniel Swanson > wrote: > > I'm not sure what the status of the Annotatrix project is, but I can give you > a coding challenge for the capitalization project

Re: [Apertium-stuff] GSoC'23

2023-02-28 Thread Daniel Swanson
I'm not sure what the status of the Annotatrix project is, but I can give you a coding challenge for the capitalization project, which is to fork a translation pair of your choice and modify the makefile and modes.xml so that capitalization is in the pipeline but doesn't do anything yet. Daniel

Re: [Apertium-stuff] Introduce myself and Asking for a wiki account

2023-02-27 Thread Daniel Swanson
Password sent by email. On Mon, Feb 27, 2023 at 12:59 PM Ahmed Siam wrote: > > Hello > > I am Ahmed Siam > Computer Science student at Ain Shams University, Egypt. > > Native Language: Arabic > Second Language: English > Timezone: UTC+2 > IRC Username: AhmedSiam > GitHub Username: ahmedsiam0 > >

Re: [Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Daniel Swanson
e: Mailtrack] > <https://mailtrack.io?utm_source=gmail_medium=signature_campaign=signaturevirality13;> > Email > delivery certified by > Mailtrack > <https://mailtrack.io?utm_source=gmail_medium=signature_campaign=signaturevirality13;> > 25/02/23, > 02:48:57 >

Re: [Apertium-stuff] Fwd: Re : GSOC 2023

2023-02-24 Thread Daniel Swanson
Hi Khushi, Yeah, that sounds like a good project to me. Next steps would be opening a pull request on https://github.com/apertium/apertium-mar-hin and requesting a wiki account to write your workplan. Daniel On Fri, Feb 24, 2023 at 4:09 AM Khushi - <12khushi...@gmail.com> wrote: > > >

Re: [Apertium-stuff] Extra output from lt-proc

2023-02-24 Thread Daniel Swanson
The issue is that %[%+cl1%] and %[%+cl9%] are not mentioned in apertium-yao.yao.twoc, and thus you get non-matching pairs. Daniel On Fri, Feb 24, 2023 at 7:57 AM Zanga Chimombo wrote: > > Gurus, > > I am getting extra output from lt-proc. From the lexc file >

Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-23 Thread Daniel Swanson
Hi Lahari, For translation pairs, Hindi-English has been tried several times without success. I would suggest considering Hindi-Telugu. For other project ideas or places to get started, you can check the wiki page for each idea and do the coding challenge. If an idea is missing a coding

Re: [Apertium-stuff] Suggest some beginner friendly issues

2023-02-01 Thread Daniel Swanson
Hi Kishor, See here: https://github.com/apertium/organisation Daniel On Wed, Feb 1, 2023 at 5:33 AM Kishor Hange wrote: > > Hey,I am beginner, can anyone suggest me beginner friendly issues to work on. > ___ > Apertium-stuff mailing list >

Re: [Apertium-stuff] Fwd: [GSoC Mentors] GSoC 2023 open for org applications January 23 - February 7

2023-01-19 Thread Daniel Swanson
for GSoC. Daniel On Thu, Jan 19, 2023 at 11:33 AM Peter Blackman wrote: > > On 19/01/2023 15:51, Daniel Swanson wrote: > > Greetings Apertiumers! > > > > I've started updating the ideas list: > > https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code >

Re: [Apertium-stuff] Fwd: [GSoC Mentors] GSoC 2023 open for org applications January 23 - February 7

2023-01-19 Thread Daniel Swanson
Greetings Apertiumers! I've started updating the ideas list: https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code We're currently at 27 ideas, several of which could use additional people willing to mentor and a few of which could use a better writeup. I'll be working on adding further

Re: [Apertium-stuff] Capitalization Handling

2022-12-29 Thread Daniel Swanson
han > > 27 dek 2022, Ç.a. tarixində 10:33 tarixində Daniel Swanson > yazdı: > > > > Greetings Apertiumers! > > > > For anyone testing this, I've now also added -w/--dictionary-case to > > apertium-{transfer,interchunk,postchunk} which makes the > &g

Re: [Apertium-stuff] Old Catalan morphological analyser

2022-12-28 Thread Daniel Swanson
Is this what you're looking for? https://sourceforge.net/p/apertium/svn/HEAD/tree/incubator/apertium-oldca-XX/ Daniel On Wed, Dec 28, 2022 at 11:27 AM Mikel L. Forcada wrote: > > Dear Apertiumers, > > I have searched but I haven't been able to recover the Apertium > dictionary for old Catalan,

[Apertium-stuff] apertium-lint: A Linter for Apertium Files

2022-12-27 Thread Daniel Swanson
Greetings Apertiumers! I have yet another announcement for you all. Tino Didriksen and I have at long last vanquished one of the Python package managers (actually a messy amalgamation of 2 of them, but it's best not to dwell on that point) to bring you apertium-lint! To install apertium-lint:

Re: [Apertium-stuff] Capitalization Handling

2022-12-27 Thread Daniel Swanson
i Font del dia ds., 24 de des. > 2022 a les 14:12: >> >> Looks very good, Daniel. Thanks in advance. I'll try to test in the next >> days in the pairs I maintain. >> Merry Christmas/Hanukkah/New Year/*. >> Hèctor >> >> Missatge de Daniel Swanson del dia dv

[Apertium-stuff] Capitalization Handling

2022-12-22 Thread Daniel Swanson
Greetings Apertiumers! I have two updates to report: First, I have rewritten the postgenerator (again), this time as part of apertium-separable (and so not breaking the old one, unlike last time), and in such a way that postgenerator rules can both match on lemma and tags in addition to surface

Re: [Apertium-stuff] Bootstrapping a new language pair: lttoolbox version condition

2022-08-14 Thread Daniel Swanson
The issue is that by default apertium-init assumes that both modules have a post-generator, but Hindi doesn't. Everything should be correct if you re-initialize the directory but add the option --no-pgen2. Daniel On Sun, Aug 14, 2022 at 8:41 AM Anil Singh wrote: > I did that. Bootstrapping the

[Apertium-stuff] New Paradigm Generation Tool

2022-06-30 Thread Daniel Swanson
Greetings Apertiumers! I have just created a new program in lttoolbox which generates paradigms. It is called lt-paradigm and it is similar to lt-expand and hfst-expand except that it takes a pattern as input and only lists paths matching that pattern. For example, here is it generating all

Re: [Apertium-stuff] Apertium PMC Election: Census & Candidates

2022-04-19 Thread Daniel Swanson
I'd be willing to do election admin again. Daniel On Tue, Apr 19, 2022 at 9:06 AM Tanmai Khanna wrote: > Hi, > I'd like to apply to be part of the PMC again. It's been great to see the > management side of things with Apertium and would really love to see > through our plan to create a formal

[Apertium-stuff] Shared Dictionary Components (For Even Faster Compilation)

2022-03-30 Thread Daniel Swanson
Greetings Apertiumers! It occurred to me recently that one of the slowest parts of dictionaries to compile (regular expressions) tends to overlap the parts that are most often similar across different languages (email addresses, dates, numbers), so I added a tool for merging binary files and have

Re: [Apertium-stuff] translation error

2022-03-03 Thread Daniel Swanson
Silvia Lombardini via Apertium-stuff < apertium-stuff@lists.sourceforge.net> wrote: > Here we have "partorisca"! > > > Il giovedì 3 marzo 2022, 15:06:54 CET, Daniel Swanson < > awesomeevildu...@gmail.com> ha scritto: > > > Hi Silvia, > > Could you

Re: [Apertium-stuff] translation error

2022-03-03 Thread Daniel Swanson
Hi Silvia, Could you provide a sentence where this happens so we can more effectively diagnose the problem? Daniel On Thu, Mar 3, 2022 at 6:42 AM Silvia Lombardini via Apertium-stuff < apertium-stuff@lists.sourceforge.net> wrote: > I often noticed that "for" is translated in Italian with

Re: [Apertium-stuff] Automatically change first-person to third-person

2022-02-14 Thread Daniel Swanson
The link in that earlier email is dead, so I can't see what the original script was doing, but based on the name it might have just been replacing with , in which case, if you still have that script, you could just edit it to replace with . Daniel On Mon, Feb 14, 2022 at 6:20 AM Per Tunedal

Re: [Apertium-stuff] TM not working for English-Santali pair

2022-01-02 Thread Daniel Swanson
You need to tell it to look for files in the right directory with -d . This worked for me: $ echo "Welcome to Wikipedia. Hello" | apertium -d . -m eng-sat.tmx -o eng-sat eng-sat ᱣᱤᱠᱤᱯᱤᱰᱤᱭᱟ ᱨᱮ ᱥᱟᱹᱜᱩᱱ ᱫᱟᱨᱟᱢ. *Hello On Sun, Jan 2, 2022 at 6:16 AM Prasanta Hembram < prasantahembram...@gmail.com>

Re: [Apertium-stuff] Left elisions: recognition and blanks

2022-01-02 Thread Daniel Swanson
Would putting the first element in or the second element in rather than work? Then one of the elements is distinct in combination from what it is on its own, making it postblank or preblank will insert an extra space after or before, respectively. Daniel On Sun, Jan 2, 2022 at 5:01 AM Hèctor

Re: [Apertium-stuff] Left elisions: post-generation

2022-01-02 Thread Daniel Swanson
Unhammer and I have made efforts in this direction at https://github.com/apertium/lttoolbox/issues/42, but it looks like we never finished getting it working. A possible, but probably slightly crazy, approach would be to use HFST to insert before every final vowel in the monodix. Daniel On

Re: [Apertium-stuff] Ôdp: Questions about lexical selection

2021-12-21 Thread Daniel Swanson
; there. > > Thank you! > Greg > > We wtorek, 21 gru 2021 ô godzinie 09:25, Hèctor Alòs i Font > (hectora...@gmail.com) pisze: > > > > Missatge de Daniel Swanson del dia dt., 21 de > des. 2021 a les 7:57: > > Hi Greg, > > The file where you want to write rules

Re: [Apertium-stuff] Questions about lexical selection

2021-12-20 Thread Daniel Swanson
Hi Greg, The file where you want to write rules for this is https://github.com/apertium/apertium-pol/blob/master/apertium-pol.pol.rlx If you want something like "tacy is before ", you could get that with SELECT DET IF (0 DET) (0 NOUN) (1 NOUN) ; Daniel On Mon, Dec 20, 2021 at 1:40 PM

Re: [Apertium-stuff] Thoughts on UDPipe, Apertium modules and translation system for Interslavic

2021-12-16 Thread Daniel Swanson
, but I should be able to manage that pretty soon (the hardest part of any project is starting). Daniel On Thu, Dec 16, 2021 at 7:17 PM Daniel Swanson wrote: > > Greetings Apertiumers! > > Figuring out how to incorporate UD parsers into Apertium pipelines is > something that's been

Re: [Apertium-stuff] Thoughts on UDPipe, Apertium modules and translation system for Interslavic

2021-12-16 Thread Daniel Swanson
Greetings Apertiumers! Figuring out how to incorporate UD parsers into Apertium pipelines is something that's been on my todo list for a while, but with the unfortunate property that it keeps getting sidelined by projects that have deadlines. With regards to your specific issue, here are the

Re: [Apertium-stuff] Question about regtest

2021-12-10 Thread Daniel Swanson
Hi Xavi, [/option] is a delimiter separating multiple golds, but it was simpler for some of the code to treat it as a terminator, which would be the reason for that particular diff. Daniel On Fri, Dec 10, 2021 at 2:03 PM Xavi Ivars wrote: > > Hi Daniel, Tino, > > I've been running

Re: [Apertium-stuff] Need help setting up the Apertium code base on macOS

2021-12-01 Thread Daniel Swanson
I should also mention that if you still run into problems, you'll probably get faster help by joining the IRC chat: https://wiki.apertium.org/wiki/IRC On Thu, Dec 2, 2021 at 1:02 AM Daniel Swanson wrote: > > Hi Faiz, > > As the message at the top of that page says, you likely a

Re: [Apertium-stuff] Need help setting up the Apertium code base on macOS

2021-12-01 Thread Daniel Swanson
Hi Faiz, As the message at the top of that page says, you likely actually want this page: https://wiki.apertium.org/wiki/Apertium_on_Mac_OS_X Based on that, it looks like you want the following: # install a couple more dependencies sudo port install icu cmake boost gperftools # install Apertium

Re: [Apertium-stuff] Problem with letter case in interchunk (or after)

2021-10-09 Thread Daniel Swanson
e seen. We > cannot read any tag of the header (I don't know why). So, I don't see how to > put a condition in interchunk for the postchunk, except, maybe, changing the > name of the chunk. > > Missatge de Daniel Swanson del dia ds., 9 d’oct. > 2021 a les 22:32: >

Re: [Apertium-stuff] Problem with letter case in interchunk (or after)

2021-10-09 Thread Daniel Swanson
You could set the case of the chunk pseudolemma or else append a tag to it and then condition on that in postchunk. On Sat, Oct 9, 2021 at 3:28 PM Hèctor Alòs i Font wrote: > > In the interchunk stage I am adding a word, which happens to be very often at > the beginning of a sentence. So I face

Re: [Apertium-stuff] Mentors

2021-09-27 Thread Daniel Swanson
Hi Rohit, The easiest way is to join IRC (https://wiki.apertium.org/wiki/IRC) and start asking questions. Who to talk to will depend largely on what you're interested in doing. Daniel On Mon, Sep 27, 2021 at 2:59 PM Rohit Rout wrote: > > Can someone plz tell how to talk to mentors and which

Re: [Apertium-stuff] Regression Testing Now Operational

2021-07-30 Thread Daniel Swanson
> Cool, I've been working with some language pairs I know, I have few > questions on cli usage and stuff: > > Commonly when I test things I get like: > > > Corpus 1 of 5: deu-fin-pending > > 11/27 (40.74%) tests pass (11/11 (100.0%) match gold) > > so I start up cli and see: > > deu-fin 1 of 1 >

Re: [Apertium-stuff] Regression Testing Now Operational

2021-07-27 Thread Daniel Swanson
On Tue, Jul 27, 2021 at 8:47 AM Flammie A Pirinen wrote: > > Am Fri, Jul 23, 2021 at 08:14:01PM -0500 schrieb Daniel Swanson: > > > At the beginning of last week, 47 languages and pairs had a meaningful > > 'make test', and several of those were failing. As of today, 365 repo

[Apertium-stuff] Regression Testing Now Operational

2021-07-23 Thread Daniel Swanson
Greetings Apertiumers! At the beginning of last week, 47 languages and pairs had a meaningful 'make test', and several of those were failing. As of today, 365 repos have 'make test' and virtually all are passing. These tests can be run with apertium-regtest, which can be installed from the

Re: [Apertium-stuff] Regression Testing System Proposal

2021-07-09 Thread Daniel Swanson
: popcorndude) On Tue, Mar 2, 2021 at 6:54 PM Daniel Swanson wrote: > > Greetings Apertiumers! > > TL;DR: I'm designing a regression testing framework and hope to use it > to improve testing across all of Apertium if no one objects. > > I have recently been designing a regress

Re: [Apertium-stuff] Usage of mfn tag in the Hindi dictionary

2021-05-05 Thread Daniel Swanson
The forms in question are used in the kok-hin bidix, so that would need to be updated too. I've been thinking about how to write a script to update all uses of a tag and I think next week or the week after I might have time to actually finish that, which sounds like it might be of use here.

Re: [Apertium-stuff] std::exception in es-ca

2021-04-15 Thread Daniel Swanson
In the command that you copied, note the $1 and $2. Try removing $2 and changing $1 to -g On Thu, Apr 15, 2021 at 5:55 AM Egea Piñeiro Helena wrote: > > No, please, for some reasons I need to still use this versions. I understand > there is no support but just in case those bugs had happened to

Re: [Apertium-stuff] GSoC Website Improvements - Proposal Draft

2021-04-03 Thread Daniel Swanson
Some things that stick out to me: Am I correct in reading week 3 as adding a Romanian translation of the interface? On weeks 5, 6, and 7, could you clarify what you're referring to? On weeks 8 and 9, what would we gain by rewriting the interface in a different framework? And, of course,

Re: [Apertium-stuff] Bitrotted releases redux

2021-03-30 Thread Daniel Swanson
-es-gl has its own dictionaries, but depends on -spa's .rlx file. On Tue, Mar 30, 2021 at 11:59 AM Xavi Ivars wrote: > > > > Missatge de Tino Didriksen del dia dl., 29 de març > 2021 a les 22:00: >> >> https://github.com/apertium/organisation/issues/23 is still open, and these >>

Re: [Apertium-stuff] Morphological analyser for Apertium

2021-03-22 Thread Daniel Swanson
Hi Gourab, My recommendation would be to evaluate the current status -ben and -bn-en in terms of corpus coverage and WER and then incorporate into your proposal what those numbers are now and how much you think you can improve them. A pull request to one of the repositories involved would also

Re: [Apertium-stuff] apertium-stuff

2021-03-16 Thread Daniel Swanson
Hi Arun, We currently work pretty much entirely with text rather than audio and video. Have you taken a look at https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code ? Daniel On Tue, Mar 16, 2021 at 11:12 AM hi wrote: > > Hi I am Arun.V; > > I have 3 years of

[Apertium-stuff] Regression Testing System Proposal

2021-03-02 Thread Daniel Swanson
Greetings Apertiumers! TL;DR: I'm designing a regression testing framework and hope to use it to improve testing across all of Apertium if no one objects. I have recently been designing a regression testing framework for Apertium language modules and translation pairs so that we can have a

Re: [Apertium-stuff] Begin of sentence

2021-02-17 Thread Daniel Swanson
Neither LRX nor transfer currently have a way of matching the beginning of a text. On Wed, Feb 17, 2021 at 10:23 AM Hèctor Alòs i Font wrote: > Is there any form to match a "begin of sentence" in lexical selection or > in transfer? In transfer, usually the point of the previous sentence is >

Re: [Apertium-stuff] Need help for changing the default pipeline in spa-arg mode

2021-01-04 Thread Daniel Swanson
Looking at what apertium-init generates, it looks like configure.ac is fine (covered by monolingual) so you just need to change Makefile.am. You can find the relevant changes by searching https://github.com/apertium/apertium-init/blob/master/bilingual-module/Makefile.am for rlx Looks like you

Re: [Apertium-stuff] Getting Started

2020-12-09 Thread Daniel Swanson
Hi Shubham, Are you familiar with any languages besides English? https://wiki.apertium.org/wiki/Apertium_New_Language_Pair_HOWTO https://wiki.apertium.org/wiki/Contributing_to_an_existing_pair Have information on contributing to translation pairs. You can also join us on IRC for quicker

Re: [Apertium-stuff] An easy tool to report bad translations and propose alternatives

2020-12-05 Thread Daniel Swanson
One thing that occurs to me is to have the form check the edit distance between the output and the suggestion and if they're too far apart notify the user, because it seems rather unlikely that the translator would get every single word wrong. That might cut down on useless results a little bit

Re: [Apertium-stuff] Setting up a new language pair

2020-11-25 Thread Daniel Swanson
Hi Christian, One of the following pages is probably what you're looking for: https://wiki.apertium.org/wiki/Apertium_New_Language_Pair_HOWTO https://wiki.apertium.org/wiki/How_to_bootstrap_a_new_pair Daniel On Wed, Nov 25, 2020 at 3:35 PM Christian Chiarcos via Apertium-stuff <

Re: [Apertium-stuff] let's move the mailing lists to sourcehut

2020-09-23 Thread Daniel Swanson
https://sourceforge.net/p/forge/documentation/Mailing List Archives/ It looks like at least exporting them is possible. On Wed, Sep 23, 2020 at 9:40 AM Francis Tyers wrote: > El 2020-09-23 15:03, Jonathan Washington escribió: > > One other question: > > > > Will it be possible to move existing

Re: [Apertium-stuff] Fixing Phonological Processes

2020-09-11 Thread Daniel Swanson
Hi Zanga, The best introduction to writing twol rules that I'm aware of is at https://wikis.swarthmore.edu/ling073/Morphological_generator The #apertium and #hfst IRC channels are also good places. Daniel On Fri, Sep 11, 2020 at 8:19 AM Zanga Chimombo wrote: > Hello again, > > I've had a bit

Re: [Apertium-stuff] Fixing Phonological Processes

2020-08-31 Thread Daniel Swanson
Hi Héctor, The Wamesa analyser was written by me as a class project. I originally wrote it in lexc and then converted to lexd for comparison and then left it that way because one of the initial design goals of lexd was to get rid of everything that annoyed me about lexc. Daniel On Mon, Aug 31,

Re: [Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-25 Thread Daniel Swanson
Hi Apertiumers, > Wasn't there a "separable"-based solution that looked good though? Besides trimming and not trimming, I would like to suggest a third alternative. As of yesterday, apertium-separable can read and merge multiple source files. I suggest moving MWEs from monodixes to -separable

Re: [Apertium-stuff] Secondary Tag Prefixes

2020-05-10 Thread Daniel Swanson
On Sun, May 10, 2020 at 6:15 AM Flammie A Pirinen wrote: > > I don't personally find apertium stream format readable, if I need to > make sense of it I will anyways have to preprocess a lot, enough that > I'd say apertium stream format need visualisation scripts to be > readable. It's not very

Re: [Apertium-stuff] List of verbs

2020-04-24 Thread Daniel Swanson
e same thing except it has () around the lemma so it can refer to it later and .* to match whatever tags there may be. \1 then replaces the line with the contents of the first (), i.e. the lemma. On Fri, Apr 24, 2020 at 10:19 AM Daniel Swanson wrote: > If you do want them mixed together, repl

Re: [Apertium-stuff] List of verbs

2020-04-24 Thread Daniel Swanson
If you do want them mixed together, replacing with (|||) should do it. On Fri, Apr 24, 2020 at 10:02 AM Alberto Navalón Lillo < alberto.navalonli...@gmail.com> wrote: > For me this works: > lt-expand apertium-swe.swe.dix | grep -E "[^<:>]+:[^<:>]+" | sed > -E 's/[^<:>]+:([^<:>]+).*/\1/g' | sort

Re: [Apertium-stuff] How do I get a list of lemmas for nouns

2020-04-22 Thread Daniel Swanson
Hi Per, If I understand correctly, this might give what you want: lt-expand apertium-swe.swe.dix | grep -E "[^<:>]+:[^<:>]+" | sed -E 's/[^<:>]+:([^<:>]+).*/\1/g' | uniq lt-expand lists all the forms, grep finds all the ones where the first tag is , sed gets rid of everything but the lemma, and

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-21 Thread Daniel Swanson
the generator*, so that we can use the >>> benefits of the source analysis and then before translation, we discard it >>> and use the source surface form. >>> >>> Currently the source surface form is discarded at the tagger. This is >>> where the

Re: [Apertium-stuff] Where do I find the dictionaries

2020-04-20 Thread Daniel Swanson
You can get the Swedish monodix from https://github.com/apertium/apertium-swe or by running 'apertium-get swe' On Mon, Apr 20, 2020 at 2:51 PM Per Tunedal wrote: > Hi, > I'm a bit rusty, not having used Apertium for a long time. > > I would like to get a dictionary containing Swedish lemmas,

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-20 Thread Daniel Swanson
> Another way of putting this is that it looks like a technical solution > in search of a problem, rather than a problem description in search > of a solution. To me the most obvious thing to do with it is to put markup information in secondary tags as a way of solving the superblank reordering

Re: [Apertium-stuff] Election Results

2020-04-04 Thread Daniel Swanson
gt; Committer, why was my vote counted? >>> >> >> Maybe I miss something. You were a candidate, weren't you? You were in >> the census and you got a voter code, didn't you? Of course, your vote was >> counted, and the Election Board even accepted the amendme

Re: [Apertium-stuff] Election Results

2020-04-04 Thread Daniel Swanson
f I was not eligible to run due to not being a > Committer, why was my vote counted? > > On Sat, Apr 4, 2020 at 11:55 AM Daniel Swanson > wrote: > >> Hi Apertiumers! >> >> The election proceedings are now complete and the votes have been tallied >> as follows: &g

[Apertium-stuff] Election Results

2020-04-04 Thread Daniel Swanson
Hi Apertiumers! The election proceedings are now complete and the votes have been tallied as follows: Votes: 41 For president : - Tino Didriksen 9 - Francis Tyers 30 For members : - Sushain K. Cherivirala 18 - Tino Didriksen 28 - Mikel L. Forcada 29 - Scoop Gracie (pseudonym) 4 - Xavi Ivars 20 -

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-28 Thread Daniel Swanson
I think you could reasonably consider it consistent, just with primary information having an empty prefix, which makes sense, given that it is primary. On Sat, Mar 28, 2020 at 6:00 PM Scoop Gracie wrote: > Oh, okay, that makes sense. I was also thinking it might make it easier > for humans to

Re: [Apertium-stuff] PMC election: Proclamation of the candidates.

2020-03-25 Thread Daniel Swanson
Barring technical difficulties, I plan to send them out tomorrow morning. On Wed, Mar 25, 2020 at 1:56 PM Scoop Gracie wrote: > > Not sure when the ballots are supposed to go out, but I don't have one yet. > > On Wed, Mar 25, 2020, 10:08 Hèctor Alòs i Font wrote: >> >> There's nothing to the

Re: [Apertium-stuff] Lexd: a transducer compiler for prefixes and stuff

2020-03-22 Thread Daniel Swanson
On Mon, Feb 3, 2020 at 8:28 AM Kevin Brubeck Unhammer wrote: > > Daniel Swanson > čálii: > > > https://github.com/mr-martian/lexd > > That's really interesting! I see it depends on lttoolbox, would it make > sense to include it in lttoolbox perhaps? > > Can

Re: [Apertium-stuff] Registration for wiki account

2020-03-21 Thread Daniel Swanson
A randomly generated password for Srj31 has been sent to sourabhr...@gmail.com. It can be changed on the change password page upon logging in. On Sat, Mar 21, 2020 at 12:58 AM Sourabh Raj wrote: > Hi, > I am working on Improving the Bengali-Hindi language pair and wish to > submit a proposal

Re: [Apertium-stuff] Get an account on wiki

2020-03-20 Thread Daniel Swanson
A randomly generated password for Weizhe has been sent to gavinwzma...@gmail.com. It can be changed on the change password page upon logging in. On Fri, Mar 20, 2020 at 10:02 PM 杨伟哲 wrote: > Hi list, > > My proposal is about to be completed, now I want to get an account on > wiki to > submit

Re: [Apertium-stuff] [GSoC 2020] Requesting a Wiki Account

2020-03-18 Thread Daniel Swanson
A randomly generated password for Chinmayk has been sent to chinmay.kunki...@gmail.com. It can be changed on the change password page upon logging in. On Wed, Mar 18, 2020 at 2:16 PM Chinmay Kunkikar wrote: > I am interested in working on - Adopt an unreleased language pair > [mar-kon] - and

Re: [Apertium-stuff] Apertium PMC elections

2020-03-17 Thread Daniel Swanson
gt;> for PMC members: >>> >>> - Tino Didriksen >>> - Scoop Gracie (pseudonym) >>> - Xavi Ivars >>> - Tanmai Khanna >>> - Mikel L. Forcada >>> - Francis Tyers >>> - Jonathan Washington >>> >>> These are standing fo

Re: [Apertium-stuff] GSOC 2020 | Requesting a Wiki Account

2020-03-16 Thread Daniel Swanson
gt; On Mon, Mar 16, 2020 at 11:34 PM Daniel Swanson < > awesomeevildu...@gmail.com> wrote: > >> Hi Katherine, >> >> What would you like as your username? >> >> Two things to note about your proposed project: >> 1. We do pretty much everything on Github

  1   2   >