Jim found out while I was writing my message. Would this be a .t4x?

Mikel


Al 12/20/2011 05:15 PM, En/na Jimmy O'Regan ha escrit:
> On 20 December 2011 15:11, Mikel Forcada<[email protected]>  wrote:
>> Pim:
>>
>> I'm trying to fix an issue with the af-nl pair, namely that removing
>> "nie" at the end of a sentence leaves a space between the last word
>> and the period.
>>
>> Have you tried to see where this extra space is added? I assume this is a
>> problem with a .t1x rule or similar dealing with nie, which does not
>> properly take care of superblanks. The rule<rule comment="REGLA: nie">  is
> Blank handling can only be done when the blank is _inside_ the chunk.
> Which, usually, is what you want.
>
> This file:
> <?xml version="1.0"?>
> <transfer>
>    <section-def-cats>
>      <def-cat n="punct">
>        <cat-item lemma="," tags="cm"/>
>        <cat-item lemma="." tags="sent"/>
>        <cat-item lemma=";" tags="sent"/>
>        <cat-item lemma=":" tags="sent"/>
>      </def-cat>
>
>      <def-cat n="any">
>        <cat-item tags=""/>         <!-- unknown words -->
>        <cat-item tags="*"/>
>      </def-cat>
>
>    </section-def-cats>
>
>    <section-def-attrs>
>    </section-def-attrs>
>
>    <section-def-vars>
>    </section-def-vars>
>
>    <section-rules>
>      <rule>
>        <pattern>
>          <pattern-item n="any"/>
>          <pattern-item n="punct"/>
>        </pattern>
>        <action>
>          <choose>
>            <when>
>              <test>
>                <equal>
>                  <b pos="1"/>
>                  <lit v=" "/>
>                </equal>
>              </test>
>              <out>
>                <lu><clip pos="1" side="tl" part="whole"/></lu>
>                <lu><clip pos="2" side="tl" part="whole"/></lu>
>              </out>
>            </when>
>            <otherwise>
>              <out>
>                <lu><clip pos="1" side="tl" part="whole"/></lu>
>                <b pos="1"/>
>                <lu><clip pos="2" side="tl" part="whole"/></lu>
>              </out>
>            </otherwise>
>          </choose>
>        </action>
>      </rule>
>    </section-rules>
> </transfer>
>
> when saved as 'noblank.t1x' gives the correct output:
>
> $ echo '^test<n><sg>$ ^.<sent>$' |apertium-transfer -n noblank.t1x noblank.bin
> ^test<n><sg>$^.<sent>$
> $ echo '^test<n><sg>$[ ]^.<sent>$' |apertium-transfer -n noblank.t1x
> noblank.bin
> ^test<n><sg>$[ ]^.<sent>$
>
> I presume you want the same behaviour for ',' ':' and ';' as '.'
>


-- 
Mikel L. Forcada (http://www.dlsi.ua.es/~mlf/)
Departament de Llenguatges i Sistemes InformĂ tics
Universitat d'Alacant
E-03071 Alacant, Spain
Phone: +34 96 590 9776
Fax: +34 96 590 9326


------------------------------------------------------------------------------
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to