Re: [Apertium-stuff] Anaphora Resolution and Long Distance Agreement Resolution

2019-03-29 Thread Tanmai Khanna
nish/Catalan, currently they are not > ("Los cantantes que canta canta bien"). > > 3. Let's accept that we will deal only with the 3rd person. It is too > complicated to resolve: > * I'm tall > (gender?) > * You are tall > (gender? number?) > > 4. I cannot see

[Apertium-stuff] Anaphora Resolution and Long Distance Agreement Resolution

2019-03-29 Thread Tanmai Khanna
m.org/wiki/User:Khannatanmai If anyone has any comments, suggestions, criticism, ideas, I would really appreciate if you let me know as it'll help me make a stronger proposal and a better tool for Apertium during GSoC 2019. Thanks and Regards, Tanmai Khanna IRC: khannatanmai -- *Khan

Re: [Apertium-stuff] Anaphora Resolution and Long Distance Agreement Resolution

2019-03-31 Thread Tanmai Khanna
't want to be too ambitious either. I will also include analysis in the paper as to why the method of Saliency Scores will be the best method given the current situation. Thanks a lot Hector for your comments as they helped me realise the importance of a problem I was putting under the

[Apertium-stuff] Creation of an account on wiki

2019-03-03 Thread Tanmai Khanna
Hi, I'd also like an account on the wiki to start my proposal for GSoC. I'll be working on Anaphora Resolution. Thanks and Regards, Tanmai Khanna IRC Nick: khannatanmai ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https

[Apertium-stuff] GSoC Project: Anaphora Resolution

2019-03-04 Thread Tanmai Khanna
as I would really like to contribute to Apertium this summer. Please advise. Thanks and Regards, Tanmai Khanna IRC Id: khannatanmai -- *Khanna, Tanmai* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net

Re: [Apertium-stuff] Creation of an account

2019-03-04 Thread Tanmai Khanna
Hi, I also need an account on wiki, with the username: khannatanmai. Thanks! Tanmai Khanna On Mon, Mar 4, 2019 at 9:51 PM Ilnar Salimzianov wrote: > Hi Sandy, > > please tell which username you'd like to have on the wiki. > > Best, > > selimcan > > On 3/4/19 10:31 AM

[Apertium-stuff] GSoC 2019 Proposal First Draft

2019-03-16 Thread Tanmai Khanna
Hi, Here's the first draft of the proposal for the Anaphora Resolution project. If any of the mentors have feedback on it I would really appreciate it as I can make the required changes and submit the final one. Proposal: http://wiki.apertium.org/wiki/User:Khannatanmai Thanks and Regards, Tanmai

[Apertium-stuff] Anaphora Resolution for multiple language pairs

2019-08-02 Thread Tanmai Khanna
work on adapting the tool for the language pair :) Thanks and Regards, Tanmai Khanna -- *Khanna, Tanmai* apertium-eng-spa.spa-eng.arx Description: Binary data ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourc

Re: [Apertium-stuff] Anaphora Resolution for multiple language pairs

2019-08-03 Thread Tanmai Khanna
litic) in order to elect between the Catalan pronouns "el" and "ho". > > Hèctor > > Missatge de Tanmai Khanna del dia dv., 2 d’ag. > 2019 a les 17:48: > >> Hi all, >> The Anaphora Resolution module is currently being tested on the >> S

Re: [Apertium-stuff] Anaphora Resolution in Nightly

2019-08-12 Thread Tanmai Khanna
Hi, I've sent a PR: https://github.com/apertium/apertium/pull/55 with the changes required to make the ref tag clip-able(?). Really glad that the module works well! :) Thanks, Tanmai Khanna On Tue, Aug 13, 2019 at 1:36 AM Hèctor Alòs i Font wrote: > Thanks, Tino and Tanmai. Great work

[Apertium-stuff] Anaphora Resolution for Apertium

2019-08-25 Thread Tanmai Khanna
: khannatanmai), or even open issues on the repo. Thanks again, Tanmai Khanna -- *Khanna, Tanmai* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Null-subject resolution

2019-11-04 Thread Tanmai Khanna
Hey Hèctor, I'm in the process of implementing conditional anaphora resolution where you can define multiple anaphors. We can define verbs as a type of anaphor and give the rules the appropriate scores to guess a reference(subject in this case). This should work decently because Anaphora

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-04 Thread Tanmai Khanna
tlı wrote: > >  > I help to run the election. > > Sevilay > >> On Thu, 5 Mar 2020, 06:15 Tanmai Khanna, wrote: >> I volunteer to help conduct the election. >> >> Tanmai >> >> Sent from my iPhone >> >>>> On 05-Mar-2020, at 02

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-04 Thread Tanmai Khanna
I volunteer to help conduct the election. Tanmai Sent from my iPhone > On 05-Mar-2020, at 02:40, Diogo wrote: > >  > I must say the fact that the second list was generated by me, but I'm not > eligible to vote according to those lists is quite ironic > > A quarta, 4/03/2020, 22:01, Tino

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Tanmai Khanna
Hi, I have a few questions about this: 1. How would you analyse the sentiment of the source text? Considering the language pairs that Apertium deals with are low resource languages. 2. As Tino mentions, is there a problem of sentiment loss in Apertium? Any examples of this? 3. Doesn't the

Re: [Apertium-stuff] GSOC 2020 idea

2020-02-27 Thread Tanmai Khanna
terface, the available languages for >> translation are beyond my knowledge. I am not sure if I am right, but >> Hindi/Bengali is probably not one of the languages to which an English word >> can be translated to. Correct me if I am wrong >> >> >> >

Re: [Apertium-stuff] Idiom translation module

2020-01-16 Thread Tanmai Khanna
"Is there anything this wouldn't cover?" Idioms may be contiguous but need not be frozen. They can take arguments and can be modified based on TAM and GNP. But that can be handled by separable if I'm not wrong. So yeah, seems like we have this handled. As constructions get larger and more

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-05 Thread Tanmai Khanna
Since it seems like there are enough people to help conduct the election, I’ll just put in my name for the candidacy formally. Thanks Tanmai Sent from my iPhone > On 05-Mar-2020, at 23:28, Scoop Gracie wrote: > >  > We now have three volunteers for the board and four candidates. I suppose

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-20 Thread Tanmai Khanna
Hey Francis, I agree that it does seem like a solution searching for a problem if we look at it in isolation. But it's important to look at this in the context of eliminating trimming. Chronologically, this project was first about and still is, about eliminating dictionary trimming. Modification

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-20 Thread Tanmai Khanna
in this project, I believe these documents provide enough as a proof of concept for this modification. I invite your comments, complaints and feedback regarding this modification. Thanks and Regards, Tanmai Khanna On Mon, Mar 30, 2020 at 12:21 AM Tanmai Khanna wrote: > Just to clarify, in this origi

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-20 Thread Tanmai Khanna
. >> > >> > To me the most obvious thing to do with it is to put markup >> > information in secondary tags as a way of solving the superblank >> > reordering problem. >> > >> >> Didn't we have a solution for this that was worked on over a

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-04-21 Thread Tanmai Khanna
gt; tag??). >> >> I guess this sort of worry is the sort of thing you're keeping track of >> so that it can be worked on? >> >> -- >> Jonathan >> >> On Mon, Apr 20, 2020, 14:52 Tanmai Khanna >> wrote: >> >>> In a nutshell, by using th

Re: [Apertium-stuff] Fwd: Announcing a multilingual COVID-19 health and safety poster-maker

2020-04-20 Thread Tanmai Khanna
Translated it to Hindi. Tanmai On Mon, Apr 20, 2020 at 12:54 PM Mikel L. Forcada wrote: > Dear Apertiumers: > > Translation Commons is seeking help to get a COVID-19 health poster > translated into as many languages as possible. > > If you would like to help, please check the information Fran

Re: [Apertium-stuff] Apertium elections coming up.

2020-03-13 Thread Tanmai Khanna
r, these people have indicated they want to run for PMC members: >> > - Jonathan Washington >> > - Francis Tyers >> > - Tino Didriksen >> > - Scoop Gracie (pseudonym) >> > - Tanmai Khanna >> > - Xavi Ivars >> > ...with Mikel L. Forcada as a

Re: [Apertium-stuff] GSOC 2020 idea

2020-03-27 Thread Tanmai Khanna
Hey I have one doubt, The examples given for mistranslation, I didn't quite understand how sentiment analysis would fix those. Also what about languages for which a SentiWordNet doesn't exist? Thanks and Regards, Tanmai On Fri, Mar 27, 2020 at 3:56 PM Rajarshi Roychoudhury <

Re: [Apertium-stuff] Lexical Selection

2020-04-03 Thread Tanmai Khanna
To show the translated text with multiple options due to polysemy would require Apertium to preserve all the polysemous forms of a word in the Lexical Unit. While this isn't possible as of now, we're working on a project to try and extend the Apertium stream such that we can include an arbitrary

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-28 Thread Tanmai Khanna
29, 2020 at 2:29 AM Scoop Gracie wrote: > Or = > > On Sat, Mar 28, 2020, 13:58 Scoop Gracie wrote: > >> That sounds like a great idea to me. Maybe could even become ? >> >> On Sat, Mar 28, 2020, 13:51 Tanmai Khanna >> wrote: >> >>> Hey guys,

[Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-28 Thread Tanmai Khanna
pros, cons, suggestions - to the idea, to the syntax, anything. Thanks and Regards, Tanmai Khanna -- *Khanna, Tanmai* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
this will allow each program to access a lot more information and open up possibilities that we haven't even thought of yet. At the very least, it will help us to eliminate trimming. Thanks and Regards, Tanmai Khanna On Sun, Mar 29, 2020 at 10:39 AM Hèctor Alòs i Font wrote: > Hi Tanmai,

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
and suggestions, I will make a robust proposal and it will be formalised before it is implemented. Thanks and Regards, Tanmai Khanna On Sun, Mar 29, 2020 at 3:52 PM Mikel L. Forcada wrote: > Folks: > > The elders in Apertium will not be surprised if I voiced my opposition to > changin

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
I apologise, it seems like the link got removed when the message sent. Here it is: http://wiki.apertium.org/wiki/User:Khannatanmai/GSoC2020Proposal_Trimming Thanks Tanmai On Sun, Mar 29, 2020 at 3:11 PM Tanmai Khanna wrote: > Hey guys, > Here's a draft proposal for this project. Any co

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
Hey guys, Here's a draft proposal http://wiki.apertium.org/wiki/User:Khannatanmai/GSoC2020Proposal_Trimming> for this project. Any comments will be appreciated :) Thanks, Tanmai On Sun, Mar 29, 2020 at 12:52 PM Tanmai Khanna wrote: > Hi Hèctor, > A fundamental motivation for this

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
imming and modifying the stream clear. It would be great to hear from Sergio, as these discussions are how we're going to make this project a success for Apertium. Thanks and Regards, Tanmai Khanna El 29/3/20 a les 13:31, Tanmai Khanna ha escrit: > > Mikel, > This is a p

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
in the pipe. Tanmai On Sun, Mar 29, 2020 at 8:53 PM Tanmai Khanna wrote: > Hi Mikel, > >> (0) No change should be made without proper regression testing. I think >> we all agree on that! >> > Definitely, and this is something I'll add in the proposal. > >> (1) I

Re: [Apertium-stuff] Modifying the apertium stream format to include arbitrary information

2020-03-29 Thread Tanmai Khanna
compatible. :) Tanmai On Sun, Mar 29, 2020 at 11:59 PM Tanmai Khanna wrote: > Instead of looking at this as modifying or extending the apertium stream > format, we could look at this as making tags more versatile by creating a > new kind of tags which have a feature:value pair. T

Re: [Apertium-stuff] PMC election: Proclamation of the candidates.

2020-03-25 Thread Tanmai Khanna
;. > Nobody has found anything to amend. > > The following people have indicated they want to run for PMC members: > > - Sushain K. Cherivirala > - Tino Didriksen > - Mikel L. Forcada > - Scoop Gracie (pseudonym) > - Xavi Ivars > - Tanmai Khanna > - Francis

Re: [Apertium-stuff] How do I get a list of lemmas for nouns

2020-04-23 Thread Tanmai Khanna
Hi, How about you try this: lt-expand apertium-swe.swe.dix | grep -E "[^<:>]+:[^<:>]+" | sed -E 's/[^<:>]+:([^<:>]+).*/\1/g' | sed -E 's/\p{No}//g' | uniq Just a small addition to Daniel's earlier command, to delete all superscripts before removing duplicates. Hopefully you don't need

Re: [Apertium-stuff] Bylaws Overhaul Proposal

2020-04-29 Thread Tanmai Khanna
If the main issue is an abuse of power and having adequate checks and balances, doesn't the assembly of committers hold the power to overturn PMC decisions? Which would include removals I guess. Tanmai On Wed, Apr 29, 2020 at 10:26 PM Samuel Sloniker wrote: > Okay. Maybe at least have a group

Re: [Apertium-stuff] Bylaws Overhaul Proposal

2020-04-30 Thread Tanmai Khanna
I have to ask, what kind of abuse of power do you have in mind? On Fri, May 1, 2020 at 8:15 AM Samuel Sloniker wrote: > That way, we don't have to complicate things now. > > On Thu, Apr 30, 2020 at 7:44 PM Samuel Sloniker > wrote: > >> Another possibility is to allow the Assembly to amend the

Re: [Apertium-stuff] Secondary Tag Prefixes

2020-05-08 Thread Tanmai Khanna
st the community to bombard this project with their skepticism, their doubts, their suggestions, their criticism, and I promise there will be thorough discussion and we will achieve an acceptable compromise on all fronts. What I would hate for this project is to be finished and then never reaching r

Re: [Apertium-stuff] Apertium PMC elections

2020-03-18 Thread Tanmai Khanna
t;>>> Oh, okay. There's a similar scenario with GCI; I want to be a student >>>>>> again this year. Could we keep a vacancy on the PMC from the start of GCI >>>>>> until it ended? >>>>>> >>>>>> On Tue, Mar 17, 20

[Apertium-stuff] Working around monodix trimming

2020-03-21 Thread Tanmai Khanna
any significant pros, cons, or suggestions to add for this project, you're requested to reply to this thread so that if I work on this project, I can do it fully informed. Thanks and Regards, Tanmai Khanna -- *Khanna, Tanmai* ___ Apertium-stuff

Re: [Apertium-stuff] Working around monodix trimming

2020-03-22 Thread Tanmai Khanna
t; dictionary and use an algorithm such as OSTIA (1) to learn morphological >> analyses for word endings. >> >> Mikel >> >> (1) Oncina, J., Garcia, P., Vidal, E., IEEE Trans Patt Recog Mach Intell >> 15:5 (1993)448-458. >> >> >> >> >

Re: [Apertium-stuff] Working around monodix trimming

2020-03-21 Thread Tanmai Khanna
dictionaries. A "guessing" dictionary would try to > assign a morphological analysis to an unknown word by looking at the > morphology of known words in the dictionary... > > This would be easy if one could, e.g. match suffixes to morphology in a > suffixing language. > &

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Hey, It solved them Franco-Japanese issue as well :D Can you check the diff once and see if there's any more issues Hèctor? (After updating apertium). *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 2:35 PM Tanmai Khanna wrote: > Hi Hèctor, > I'm dealing with the issues I s

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Hèctor, No worries I'll look into this. Can you send the input sentences? I want to see the transfer rules that are applying to the erroneous parts. They might need some changing. तन्मय खन्ना Tanmai Khanna From: Hèctor Alòs i Font Sent: Sunday, August 30, 2020

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
Also, I agree with Tino, if the punctuation is important for the context then it should probably be analysed as a token. *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 3:16 PM Tanmai Khanna wrote: > Hèctor what I mean is, if you don't want a space in the output of rules > yo

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
uld remove the . This is because if the input blank is an empty string then that isn't counted as a blank. Does that work? *तन्मय खन्ना * *Tan

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
le output. So it might not be possible to strictly control the position of the blanks in the output as was done earlier, but that was pretty much the intention of the change. *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apert

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
es will go away. I'm still evaluating some other issues. *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 1:21 PM Hèctor Alòs i Font wrote: > > > Missatge de Tanmai Khanna del dia dg., 30 d’ag. > 2020 a les 9:49: > >> My guess is, the transfer rule for Franco-Japan

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
g apertium-separable you can combine them into one LU. Finally, the space between l and ér shouldn't appear in the rule output and it is because of an issue that's still being fixed. But it'll be fine soon :) *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 3, 2020 at 11:46 PM Hèctor Alòs i Font wr

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
Hèctor can you check the page on beta now? The hyphen and the superscript issues are solved. Of course, there's now a space between l and ér. If that's a big problem we can discuss other solutions. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 3, 2020 at 8:09 PM Tino Didriksen wrote: > I h

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-04 Thread Tanmai Khanna
rules now, but that is by design. Hèctor, let me know if this solved your issue :) Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Fri, Sep 4, 2020 at 12:15 PM Hèctor Alòs i Font wrote: > Missatge de Tanmai Khanna del dia dv., 4 de > set. 2020 a les 9:22: > >> Hèctor,

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-04 Thread Tanmai Khanna
ble, we can discuss other possible solutions :) *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

[Apertium-stuff] Need help to decide language pairs for examples of markup handling

2020-09-09 Thread Tanmai Khanna
l be great as well. Any help will be sincerely appreciated :)) Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] Need help to decide language pairs for examples of markup handling

2020-09-12 Thread Tanmai Khanna
Thanks a lot for your suggestions guys! :)) *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 10, 2020 at 7:57 AM Samuel Sloniker wrote: > > > On Wed, Sep 9, 2020, 02:34 Tanmai Khanna wrote: > >> (Accepting snazzier title suggestions). >> > Maybe

Re: [Apertium-stuff] Update about superblanks in transfer

2020-09-04 Thread Tanmai Khanna
, such as macros, or within .. blocks. Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 5:59 PM Tanmai Khanna wrote: > Are the changes being implemented going to alter the behavior of the >> punctuation marks that are not analyzed as tokens? >> > >

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
damentally, as of now *it's not possible to have markup on part of an LU.* It's possible though if you keep I & ér as separate LUs. Also Hèctor, is the space after hyphen issue still there? Looks fine to me. *तन्मय खन्ना * *Tanmai Khanna* On Wed, Sep 2, 2020 at 4:55 PM Tino Didriksen wr

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
Oh I see the hyphen thing. That should've been fixed after the latest commit. Will check it out. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 3, 2020 at 12:34 PM Tanmai Khanna wrote: > Hey, > As of now, the analyser sees wordbound blanks as normal blanks, and so > when t

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-09-03 Thread Tanmai Khanna
vidual markups on the final MWE. If this is done then both "idag" and "i dag" will be recognised and the italics will apply on the entire word. If this isn't acceptable or too much of an inconvenience, then I can modify the analyser. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Sep 3, 20

[Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-26 Thread Tanmai Khanna
nd Regards, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-26 Thread Tanmai Khanna
a rule is matched, but that is a decision that can be taken in the future. For now, blanks in transfer rules work for any superblanks that still exist in the stream. Hope that answered your question :)) *तन्मय खन्ना * *Tanmai Khanna* On Wed, Aug 26, 2020 at 11:43 PM Kevin Brubeck Unhammer wrote

[Apertium-stuff] Update about superblanks in transfer

2020-08-29 Thread Tanmai Khanna
, and postchunk. If you have any questions, suggestions, comments, etc., I'll be happy to respond to them. Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net

Re: [Apertium-stuff] GSoC 2020 Code Collections - need info

2020-08-22 Thread Tanmai Khanna
My contributions are documented here <https://wiki.apertium.org/wiki/User:Khannatanmai/Wordbound_blanks#Features> . *तन्मय खन्ना * *Tanmai Khanna* On Sat, Aug 22, 2020 at 5:52 PM Tino Didriksen wrote: > As for previous years, I will run a code collection for GSoC changes. > &

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
ser denotes. If X > Y, then we can print the first Y blanks and then flush the remaining. After this the option will become useless. Does that sound good? *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 2:28 PM Francis Tyers wrote: > El 2020-08-27 09:54, Kevin Brubeck Unhammer

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
between their output LUs. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 2:31 PM Tanmai Khanna wrote: > So what I'll try to do, is after the blanks are collected, lets say X is > the number of source LUs in the pattern and Y is the number of output LUs. > If X = Y then we can

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
Again, remember these aren't wordbound blanks or block tags, just superblanks, like , or other tags that aren't hard breaks or wordbound. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 2:33 PM Tanmai Khanna wrote: > Or, if we want to give the user complete control of the bla

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-26 Thread Tanmai Khanna
I always thought that's the default behaviour. That if some blanks aren't explicitly printed in the transfer rules then they're flushed. I'll check it out, but it should be that. *तन्मय खन्ना * *Tanmai Khanna* On Thu, Aug 27, 2020 at 1:27 AM Kevin Brubeck Unhammer wrote: > Tanmai Kha

Re: [Apertium-stuff] We now have markup handling and reordering in Apertium!

2020-08-27 Thread Tanmai Khanna
* *Tanmai Khanna* On Thu, Aug 27, 2020 at 1:20 PM Kevin Brubeck Unhammer wrote: > Tanmai Khanna > čálii: > > > I always thought that's the default behaviour. That if some blanks aren't > > explicitly printed in the transfer rules then they're flushed. I'll

Re: [Apertium-stuff] Update about superblanks in transfer

2020-08-30 Thread Tanmai Khanna
her options. *तन्मय खन्ना * *Tanmai Khanna* On Sun, Aug 30, 2020 at 12:09 PM Tanmai Khanna wrote: > Hèctor, > No worries I'll look into this. Can you send the input sentences? I want > to see the transfer rules that are applying to the erroneous parts. They > might need some changing.

[Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-25 Thread Tanmai Khanna
ther possible solution that was discussed was to continue trimming multiwords since they present a unique disadvantage, but to eliminate trimming for words without spaces. *Any other possible solution can be discussed as well. Hopefully after an informed discussion we can come to an acceptable co

Re: [Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-25 Thread Tanmai Khanna
Here's a timing test for weighted dictionaries. On apertium-eng-kaz: 1. lt-trim analyser.bin bidix.bin analyser-found.bin Time: real 0m4.257s user 0m4.120s sys 0m0.131s 2. lt-trim analyser.bin bidix.bin analyser-found.bin lt-print -H analyser.bin > analyser.att lt-print -H

Re: [Apertium-stuff] How useful is eliminating trimming for language developers?

2020-05-26 Thread Tanmai Khanna
eighted dictionary takes ~8 seconds. I guess these facts are enough for us to at least provide our language developers with the option to disable trimming, and by propagating the surface form through secondary tags, it's what I hope to make a reality. Thanks for all your comments. :) Regards, Tan

[Apertium-stuff] Testing markup reordering in Apertium

2020-07-20 Thread Tanmai Khanna
ve any questions, suggestions, I'd be glad to respond to them on this thread. If you need help testing this on your language pair you can contact us on the IRC. Same if you find any bugs, or have any feature requests. Enjoy! *तन्मय खन्ना * *Tanmai

Re: [Apertium-stuff] Testing markup reordering in Apertium

2020-07-22 Thread Tanmai Khanna
Hey Xavi, Postgeneration has been modified to deal with wordbound blanks. It deals with wordbound blanks in one-one, one-many, many-one and many-many rules in postgeneration. (Pull Request <https://github.com/apertium/lttoolbox/pull/102>) Regards, *तन्मय खन्ना * *Tanmai Khanna* On Tue,

[Apertium-stuff] Anaphora Resolution 1.0.0

2020-06-21 Thread Tanmai Khanna
ages as well. Any clarifications can be asked here or on the IRC. I'd also appreciate it if you guys let me know about any feature requests you might have from a module like this. :) Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-

Re: [Apertium-stuff] Apertium's Wider Use & Secondary Tags

2020-06-14 Thread Tanmai Khanna
if one person in the PMC objects to something it shouldn’t happen, and I’m sure a group of experts and I can find something we all agree on as a design decision. Peace ✌ Tanmai Khanna Sent from my iPhone > On 14-Jun-2020, at 18:02, Samuel Sloniker wrote: > >  > If we d

Re: [Apertium-stuff] finding difficulty in pairing languages

2020-06-22 Thread Tanmai Khanna
/IRC Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Mon, Jun 22, 2020 at 8:48 AM annie rajan via Apertium-stuff < apertium-stuff@lists.sourceforge.net> wrote: > Hello, > I am Annie, from Goa, India. I did some work on morph analyses for Konkani > language and has p

Re: [Apertium-stuff] Apertium's Wider Use & Secondary Tags

2020-06-13 Thread Tanmai Khanna
ion is stagnating more than most projects I've seen in action. If people are still not convinced about any benefits that can come out of such a modification, then I guess I'll choose the best solution that provides no regression, keep it optional, and if it really is helpful then people can use it. :)

Re: [Apertium-stuff] Testing markup reordering in Apertium

2020-07-20 Thread Tanmai Khanna
the generation step. Since after generation the tokenisation of the input is lost (as the LUs are removed), tf-close adds a delimiter so that the reformatter knows the span of the format. Hope that answers your question. Thanks and Regards, *तन्मय खन्ना * *Tanmai Khanna* On Tue, Jul 21, 2020 at 4:57 AM

[Apertium-stuff] Starring Apertium repositories

2020-07-18 Thread Tanmai Khanna
software to star the repo and share it or something. These are just my personal views and I'm happy to be corrected :) Thanks, *तन्मय खन्ना * *Tanmai Khanna* ___ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net

Re: [Apertium-stuff] An easy tool to report bad translations and propose alternatives

2020-12-05 Thread Tanmai Khanna
the rules. I know a TM exists in Apertium already, but I'm not sure if it's being used on apertium.org, plus user given translations can definitely improve TMs for language pairs that don't have a lot of parallel corpora to learn TMs from. *तन्मय खन्ना * *Tanmai Khanna* On Sat, Dec 5, 2020 at 2:59 PM

Re: [Apertium-stuff] Cleaning Parallel Corpus

2021-04-29 Thread Tanmai Khanna
and spanish corpus. You'd lose some sentences to train on but that would be negligible and the remaining corpus would be aligned. Just a thought *तन्मय खन्ना * *Tanmai Khanna* On Thu, Apr 29, 2021 at 6:23 PM VIVEK VICKY wrote: > > On Thu, Apr 29, 2021 at 3:35 PM Kevin Brubeck Unhammer &

Re: [Apertium-stuff] Bitrotted releases redux

2021-03-31 Thread Tanmai Khanna
That's great! Do let me know when the main website is updated. Thanks, *तन्मय खन्ना * *Tanmai Khanna* On Wed, Mar 31, 2021 at 5:42 PM Tino Didriksen wrote: > This is less critical than I thought. I just re-tested all existing > packages, and they run. So I can update the mai

Re: [Apertium-stuff] Apertium PMC Election: Census & Candidates

2022-04-22 Thread Tanmai Khanna
end: > > - No change to the census > > - 4 candidates for the PMC: Francis Tyers, Tanmai Khanna, Tino Didriksen, > Xavi Ivars > > - 2 candidates for President: Francis, Tino > > - Auditor: Daniel Swanson > > -- Tino Didriksen > > > On Tue, 19 Apr 2022 at 13:20,

Re: [Apertium-stuff] Apertium PMC Election: Census & Candidates

2022-04-19 Thread Tanmai Khanna
Hi, I'd like to apply to be part of the PMC again. It's been great to see the management side of things with Apertium and would really love to see through our plan to create a formal organisation. Regards, Tanmai Khanna On Tue, Apr 19, 2022, 16:51 Tino Didriksen wrote: > G'day every

Re: [Apertium-stuff] Apertium PMC Election: Census & Candidates

2022-04-19 Thread Tanmai Khanna
Maybe in this new PMC we should strictly follow at least a once a month meeting. Even if there's nothing to discuss or no progress made, it helps to meet and keeps things moving. Just a thought On Tue, Apr 19, 2022, 22:59 Francis Tyers via Apertium-stuff < apertium-stuff@lists.sourceforge.net>

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-26 Thread Tanmai Khanna
ton, Kevin Brubeck > Unhammer, Mikel L. Forcada, Tanmai Khanna, Tino Didriksen, Xavi Ivars > > - President: Francis, Tino > > Given that exactly 7 for PMC would avoid the need for an election, I'm > happy to forego the whole thing and let Francis continue in the ro

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
e that > can be achieved by unanimous consent. I say we take it and get on with the > business of forming the Foundation, which the (new) PMC can delegate the > legwork of to anyone. > > -- Tino Didriksen > > > On Tue, 26 Apr 2022 at 17:41, Tanmai Khanna > wrote: &g

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
bit useless. > > Maybe if someone outside the PMC gave their opinion, voting would make > more sense. But so far, it's been only the ones in the PMC (+ Sushain + > Daniel), everyone agreeing. > > > -- > Xavi Ivars > < http://xavi.ivars.me > > > El dc.,

Re: [Apertium-stuff] strange behaviour of apertium

2022-07-15 Thread Tanmai Khanna
understand it, once you analyse the form, it goes through the pipeline, after transfer it becomes a unit enclosed by ^...$ which then goes through the generator. Is that relevant for your question? Tanmai Khanna On Sat, Jul 16, 2022, 08:21 Amba Kulkarni wrote: > > Dear members, > >

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
My point isn't just about appearances though it's also about what's the more democratic thing to do. To at least give a proper platform for the non PMC people to say something. Moreover, I'm really confused about all this talk of "bureaucracy". It's one vote, which was supposed to happen anyway.

Re: [Apertium-stuff] Apertium PMC Election: Bypass election?

2022-04-27 Thread Tanmai Khanna
Yeah, but you're going to be elected to the PMC, you can't vote for no vote. Same as me, we shouldn't be deciding that we don't need a vote. Violates many principles :p On Wed, Apr 27, 2022, 16:28 Kevin Brubeck Unhammer wrote: > Xavi Ivars čálii: > > > But also, voting for just to confirm (or

Re: [Apertium-stuff] Capitalization Handling

2022-12-23 Thread Tanmai Khanna
Merry Christmas for wordbound blanks indeed! On Fri, Dec 23, 2022, 03:10 Daniel Swanson wrote: > Greetings Apertiumers! > > I have two updates to report: > > First, I have rewritten the postgenerator (again), this time as part > of apertium-separable (and so not breaking the old one, unlike

Re: [Apertium-stuff] Applying for GSOC 2023 projects

2023-02-23 Thread Tanmai Khanna
Telugu is a fairly decently resources language now, and has some well performing neural translators, like Hindi. We can still consider working with it in Apertium but first must figure out what our goal is, with it. If there's no particular goal then a low resource language would be preferable.

[Apertium-stuff] Stepping down from my position in the PMC

2023-05-29 Thread Tanmai Khanna
me along the way - I will forever be grateful. :) With this, I would request you to update this change on the wiki ( https://wiki.apertium.org/wiki/Project_Management_Committee), and remove me from the PMC emailing list. Regards, *तन्मय खन्ना * *Tanmai Khanna