On 28 February 2014 04:28, Gaurav Agrawal <[email protected]> wrote:
> Hello All,
>
> I am Gaurav Agrawal, student of M.Tech in Computer Science and Engineering
> at IIIT, Hyderabad.
>
> I am very much interested in the machine learning and want to send my summer
> by contributing to the open source. So, GSoc is the best opportunities and
> Apertium is the best organization in machine learning for me.
>
> As I have the prior good knowledge of the XML and the Java and also the
> basic knowledge of the python and shell scripts, I found the project "Unify
> the metadix formats" interesting and suitable for me.
>
> Thanks to the #Unhammer #firespeaker #wei2912 for suggesting me the wiki
> pages for the basic understanding of the Apertium project and for the
> installation.
>
> Presently, I have been working on the Coding Challenge :)
>
> I have the few queries in the same:
>
> 1)
>
> For the entry:
> <e r="RL" lm="débil"><i>débil</i><par n="abdominal__adj"/></e>
>
> Output suggested:
>
> (débil:>:débil)[abdominal_adj]; # débil
>
> But as is it RL i.e. Right to Left. So, as per understading it should be :
>
> (débil:<:débil)[abdominal_adj]; # débil ?
>

You are correct.

> 2)
>
> Similarly for the conversion of :
> <e r="LR" lm="inapropiado"><i>inapropiad</i><par n="absolut/o__adj"/></e>
>
> Output Suggested:
> (inapropiad:<:inapropiad)[absolut/o_adj]; # inapropiado
>
> But as is it LR i.e. Right to Left. So, as per understading it should be :
>
> (inapropiad:>:inapropiad)[absolut/o_adj]; # inapropiado ?
>

You are correct.

> 3)
>
> The Entry:
>  <e lm="multa de tráfico"><i>multa</i><par
> n="abeja__n"/><p><l><b/>de<b/>tráfico</l><r><g><b/>de<b/>tráfico</g></r></p></e>
>
> should becomes:
>
> (multa:multa)[abeja__n](_de_tráfico)); # multa de tráfico
>
> We have both the left (l) and right(r) part in the pair (p) :
>
> <p><l><b/>de<b/>tráfico</l><r><g><b/>de<b/>tráfico</g></r></p></e>
>
> But in the conversion we only have the (_de_tráfico)) and not the
> (_de_tráfico:_de_tráfico)) is it because both the left and right part are
> equal ?
> If yes, we are doing this way only when there is Multiwords with inner
> inflection and we have the tag <g> ?
> How we will treat the case when the left and right part are different with
> the <g> tag.
>

I would assume that the output of
'<p><l><b/>de<b/>tráfico</l><r><g><b/>de<b/>tráfico</g></r></p></e>'
should be '(_de_tráfico:#_de_tráfico) -- i.e., that <p> is processed
as usual, and that <g> inserts the '#' symbol as in the text stream.
'(_de_tráfico)' is the output I would expect to see for
<i><b/>de<b/>tráfico</i>


-- 
<Sefam> Are any of the mentors around?
<jimregan> yes, they're the ones trolling you

------------------------------------------------------------------------------
Subversion Kills Productivity. Get off Subversion & Make the Move to Perforce.
With Perforce, you get hassle-free workflows. Merge that actually works. 
Faster operations. Version large binaries.  Built-in WAN optimization and the
freedom to use Git, Perforce or both. Make the move to Perforce.
http://pubads.g.doubleclick.net/gampad/clk?id=122218951&iu=/4140/ostg.clktrk
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to