Re: [Apertium-stuff] Apertium Python Module Names

2019-10-13 Thread Flammie A Pirinen
On Sun, Oct 13, 2019 at 12:27:10PM +0200, Tino Didriksen wrote: > https://www.debian.org/doc/packaging-manuals/python-policy/ch-module_packages.html#s-package_names > > The package python3-apertium must provide the Python module apertium, but > it provides apertium_core. I can fix this by either

Re: [Apertium-stuff] Willingness to participate in the project

2020-03-03 Thread Flammie A Pirinen
Hi, I am this week on hliday with low internet availability so only few quick points. Firstly I strogly recommend joining #apertium IRC channel, I think even non-mentors will have useful clues. For the tokenisation problem I think the main resource is to understand various unicode technical

Re: [Apertium-stuff] Lexd: a transducer compiler for prefixes and stuff

2020-02-05 Thread Flammie A Pirinen
On Tue, Feb 04, 2020 at 12:55:55PM -0500, Daniel Swanson wrote: > > Do you have plans on doing tests > > on runtime efficiency, i.e. how fast it is to run the automata on texts? > > One thing that we found with flag diacritics on lexc is is that it's > > kindof possible to abuse them to optimise

Re: [Apertium-stuff] GSoC 2020 Ideas Page

2020-02-21 Thread Flammie A Pirinen
On Fri, Feb 21, 2020 at 03:10:40PM +0100, Tino Didriksen wrote: > Apertium is in GSoC 2020! > > Time to update the > http://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code page. > > What projects were actually completed to the mentors' satisfaction last > year? I think at least

Re: [Apertium-stuff] Secondary Tag Prefixes

2020-05-10 Thread Flammie A Pirinen
On Fri, May 08, 2020 at 04:50:45PM +0200, Tino Didriksen wrote: > For khannatanmai's GSoC project, secondary tags will be implemented in a > backwards compatible manner. That it in itself indisputable. But, there is > a question of how the initial batch of secondary tags should look. > > I feel

Re: [Apertium-stuff] Registration for wiki page

2020-03-20 Thread Flammie A Pirinen
On Fri, Mar 20, 2020 at 11:34:22PM +0530, Ayush wrote: > Dear sir/ma’am, > This is to inform that I have successfully completed and submitted my > solution to coding task as assigned under the robust tokenisation. > Link for my solution to challenging task – >

Re: [Apertium-stuff] Registration for wiki page

2020-03-23 Thread Flammie A Pirinen
On Mon, Mar 23, 2020 at 04:46:06PM +0530, Ayush wrote: > Dear sir, > Actually I have quite reached nowhere while going through the lttoolbox. Can > you please help me with making of schedule for the proposal and also what all > thinks I would be working under for the task of robust tokenisation.

Re: [Apertium-stuff] Fixing Phonological Processes

2020-09-14 Thread Flammie A Pirinen
On Fri, Sep 11, 2020 at 03:18:44PM +0200, Zanga Chimombo wrote: > Hello again, > > I've had a bit of time to continue looking at this. I've copied over > something from: > https://github.com/apertium/apertium-lin/blob/master/apertium-lin.lin.twol > > %{K%}:k <=> :n :0 _ .#. ; > > But it's not

Re: [Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-25 Thread Flammie A Pirinen
On Mon, May 25, 2020 at 03:10:28PM +0530, Tanmai Khanna wrote: > *Disadvantages:* > 1. The monodix has some erroneous analyses - wrong surface forms, wrong > analyses, or even MWEs that aren't really MWEs and can be translated word > by word. These are currently removed since bidixes are more

Re: [Apertium-stuff] let's move the mailing lists to sourcehut

2020-09-21 Thread Flammie A Pirinen
On Sun, Sep 20, 2020 at 07:44:25PM -0700, Samuel Sloniker wrote: > I've also thought about the possibility of a forum? Forums can be nice but I think have different function than mailing list.. > Discourse looks nice. > Is this discourse.mozilla.org? I really dislike that, last I was forced to

Re: [Apertium-stuff] let's move the mailing lists to sourcehut

2020-09-20 Thread Flammie A Pirinen
On Sun, Sep 20, 2020 at 06:04:35PM +0100, Francis Tyers wrote: > Sourcehut is a free/open-source "forge" type thing run by Drew DeVault. They > have > mailing lists. > [...] > What do people think? Excellent idea. One suggestion I have is to make sure they get archived as well as usable through

Re: [Apertium-stuff] An easy tool to report bad translations and propose alternatives

2020-12-05 Thread Flammie A Pirinen
On Sat, Dec 05, 2020 at 12:28:14PM +0300, Hèctor Alòs i Font wrote: > A Sardinian collaborator commented to me: "Wouldn't it be possible that > every time there are more possible translations these come out in a little > window where the user chooses the right solution, as in spell checkers"? >

[Apertium-stuff] Proper noun classification considered harmful

2021-02-01 Thread Flammie A Pirinen
Hi all, I've written a handful of apertium-fin-* prototypes and I usually end up spending way too much time with all the useless subclasses of proper nouns we have (cogs, ants, als, tops, orgs, and to top all that, sometimes ms and fs for some extra (mis)gendering). Could we just get rid of those

Re: [Apertium-stuff] Proper noun classification considered harmful

2021-02-12 Thread Flammie A Pirinen
I think I've come up with a solution that is minimally intrusive for existing work-flows and usages, and that is, allowing optionalising select tags for generation, i.e.: Analysing: echo London | lt-proc eng.automorf.bin ^London/London/London$ (I didn't even plan this it just happened to be

Re: [Apertium-stuff] Begin of sentence

2021-02-17 Thread Flammie A Pirinen
Am Wed, Feb 17, 2021 at 07:22:36PM +0300 schrieb Hèctor Alòs i Font: > Is there any form to match a "begin of sentence" in lexical selection or in > transfer? In transfer, usually the point of the previous sentence is used, > but I want to match even the beginning of the first sentence of the

Re: [Apertium-stuff] Proper noun classification considered harmful

2021-02-06 Thread Flammie A Pirinen
Thank you all for a lively discussion, I'll summarise here and reply to few of the comments in a typical inline reply format. I think as tldr we agree to some extent that these rich np annotation tags are specific to language pairs and steps in the pipeline and should not be hindering unrelated

Re: [Apertium-stuff] Regression Testing Now Operational

2021-07-30 Thread Flammie A Pirinen
[for some reason these all went to gmail spam...] Am Tue, Jul 27, 2021 at 08:54:05AM -0500 schrieb Daniel Swanson: > On Tue, Jul 27, 2021 at 8:47 AM Flammie A Pirinen wrote: > > > > perhaps this is different version of pyhthon or libraries? I have > > python 3.9.1 on linu

Re: [Apertium-stuff] Regression Testing Now Operational

2021-07-30 Thread Flammie A Pirinen
Am Fri, Jul 30, 2021 at 08:03:41AM -0500 schrieb Daniel Swanson: > > > > Another question is that in lot of expected files there seems to be > > all-capsed words for fin-* pairs, I am not sure how this has happened? > > I am guessing my apertium is older and some ICU changes have affected > > the

Re: [Apertium-stuff] Regression Testing Now Operational

2021-07-27 Thread Flammie A Pirinen
Am Fri, Jul 23, 2021 at 08:14:01PM -0500 schrieb Daniel Swanson: > At the beginning of last week, 47 languages and pairs had a meaningful > 'make test', and several of those were failing. As of today, 365 repos > have 'make test' and virtually all are passing. That's exciting, here's some first

Re: [Apertium-stuff] Tagset Standardization

2023-03-07 Thread Flammie A Pirinen
Am Mon, Mar 06, 2023 at 03:35:45PM -0500 schrieb Daniel Swanson: > This script could, without too much trouble, be expanded to cover the > rest of our source files, at which point I would like to propose that > we move towards greater standardization of our tagset: >