Hi Tanmai,

I won't be a mentor, but I asked for anaphora resolution in Apertium, so,
if I am allowed, I'd like some clarification about the proposal (which, I
think, is great - congrats).

First of all, note that "The girl ate his apple" is not grammatically
incoherent. Maybe she ate an apple given by a male friend of her. Anaphora
resolution is complicated i.a. because language is often ambiguous.

1. I've been thinking about the example

La chica comió su manzana

Let's suppose that the antecedent of "su" is "la chica".
If the target language would be a Slavic language or Esperanto, the
selection will not be between "his", "her" or "its", but also a reflexive
possessive pronoun, e.g. in Russian Девушка съела своё яблоко, but
not Девушка съела её яблоко. If using the proposal in
http://wiki.apertium.org/wiki/Anaphora_resolution I'm not sure how could we
deal with it. We probably should need to have a referent in the verb too,
in order to be able to compare in the transfer rules whether the antecedent
of "su" is also the antecedent of "comió".

So, my point is: will the user be able to "configure" for which parts of
speech should the antecedent be tracked? E.g. for the Catalan-Spanish pair
I don't see any need to track the "antecedents" of verbs, but for e.g.
Spanish to English it seems necessary for dealing with zero pronouns.

(By the way, I am surprised that e.g. the subject of a verb can be tracked
by a language-independent tool without any configuration. I really doubt
this can be true.)

2. The examples in "Reflexive pronouns" and "Long distance agreement" seem
very difficult. I'd propose a few simpler agreements:
* The lady with the book is nice.
* The lady reading the book is nice.
* The lady who reads the book is nice.
"Nice" should be feminine in Spanish/Catalan (currently it happens only in
the first case)
* The singers that sing sing well.
Both "sing" should be p3pl in Spanish/Catalan, currently they are not ("Los
cantantes que canta canta bien").

3. Let's accept that we will deal only with the 3rd person. It is too
complicated to resolve:
* I'm tall
(gender?)
* You are tall
(gender? number?)

4. I cannot see why it should be useful to test the system with the
Spanish-English and Catalan-English pairs. As for the anaphora, if I am not
wrong, Catalan and Spanish are twins. One pair of the two seems enough.

5. One detail: the current translation of
La chica está aquí, lleva un vestido rojo.
is:
The girl is here, carries a red dress.

Best,
Hèctor

Missatge de Tanmai Khanna <khanna.tan...@gmail.com> del dia dv., 29 de març
2019 a les 15:48:

> Hi,
> I have submitted a draft for review for the project "Anaphora Resolution"
> for GSoC 2019. The project will also include a tool for resolution of
> agreement for adjectives in Spanish, Catalan and other languages that need
> it.
>
> You can find the proposal here:
> http://wiki.apertium.org/wiki/User:Khannatanmai
>
> If anyone has any comments, suggestions, criticism, ideas, I would
> really appreciate if you let me know as it'll help me make a stronger
> proposal and a better tool for Apertium during GSoC 2019.
>
> Thanks and Regards,
> Tanmai Khanna
> IRC: khannatanmai
>
> --
> *Khanna, Tanmai*
> _______________________________________________
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to