On 28 February 2014 04:28, Gaurav Agrawal <[email protected]> wrote: > Hello All, > > I am Gaurav Agrawal, student of M.Tech in Computer Science and Engineering > at IIIT, Hyderabad. > > I am very much interested in the machine learning and want to send my summer > by contributing to the open source. So, GSoc is the best opportunities and > Apertium is the best organization in machine learning for me. > > As I have the prior good knowledge of the XML and the Java and also the > basic knowledge of the python and shell scripts, I found the project "Unify > the metadix formats" interesting and suitable for me. > > Thanks to the #Unhammer #firespeaker #wei2912 for suggesting me the wiki > pages for the basic understanding of the Apertium project and for the > installation. > > Presently, I have been working on the Coding Challenge :) > > I have the few queries in the same: > > 1) > > For the entry: > <e r="RL" lm="débil"><i>débil</i><par n="abdominal__adj"/></e> > > Output suggested: > > (débil:>:débil)[abdominal_adj]; # débil > > But as is it RL i.e. Right to Left. So, as per understading it should be : > > (débil:<:débil)[abdominal_adj]; # débil ? >
You are correct. > 2) > > Similarly for the conversion of : > <e r="LR" lm="inapropiado"><i>inapropiad</i><par n="absolut/o__adj"/></e> > > Output Suggested: > (inapropiad:<:inapropiad)[absolut/o_adj]; # inapropiado > > But as is it LR i.e. Right to Left. So, as per understading it should be : > > (inapropiad:>:inapropiad)[absolut/o_adj]; # inapropiado ? > You are correct. > 3) > > The Entry: > <e lm="multa de tráfico"><i>multa</i><par > n="abeja__n"/><p><l><b/>de<b/>tráfico</l><r><g><b/>de<b/>tráfico</g></r></p></e> > > should becomes: > > (multa:multa)[abeja__n](_de_tráfico)); # multa de tráfico > > We have both the left (l) and right(r) part in the pair (p) : > > <p><l><b/>de<b/>tráfico</l><r><g><b/>de<b/>tráfico</g></r></p></e> > > But in the conversion we only have the (_de_tráfico)) and not the > (_de_tráfico:_de_tráfico)) is it because both the left and right part are > equal ? > If yes, we are doing this way only when there is Multiwords with inner > inflection and we have the tag <g> ? > How we will treat the case when the left and right part are different with > the <g> tag. > I would assume that the output of '<p><l><b/>de<b/>tráfico</l><r><g><b/>de<b/>tráfico</g></r></p></e>' should be '(_de_tráfico:#_de_tráfico) -- i.e., that <p> is processed as usual, and that <g> inserts the '#' symbol as in the text stream. '(_de_tráfico)' is the output I would expect to see for <i><b/>de<b/>tráfico</i> -- <Sefam> Are any of the mentors around? <jimregan> yes, they're the ones trolling you ------------------------------------------------------------------------------ Subversion Kills Productivity. Get off Subversion & Make the Move to Perforce. With Perforce, you get hassle-free workflows. Merge that actually works. Faster operations. Version large binaries. Built-in WAN optimization and the freedom to use Git, Perforce or both. Make the move to Perforce. http://pubads.g.doubleclick.net/gampad/clk?id=122218951&iu=/4140/ostg.clktrk _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
