Hi !

I have a proposal about making new chunks in interchunk. At the moment
if we want to make a new chunk we have to put in <lit> tags with '{' and
'}' -- this is superugly. Here is an example:

          <chunk>
            <lit v="prep"/>
            <lit-tag v="PREP"/>
            <lit v="{^na"/>
            <lit-tag v="pr"/>
            <lit v="$}"/>
          </chunk>

output: ^prep<PREP>{^na<pr>$}$

It would be supernicer if it were like this:

          <chunk>
            <lit v="prep"/>
            <lit-tag v="PREP"/>
            <chcontent>
              <lu>
                <lit v="на"/>
                <lit-tag v="pr"/>
              </lu>
            </chcontent>
          </chunk>

output: ^prep<PREP>{^na<pr>$}$

What do people think ? It would mean changing the DTD and the transfer
code... But quite a simple change I think, and backwards compatible.

<chcontent> outputs        {
</chcontent> outputs       }
<lu> (in chunker) outputs  ^
</lu> (in chunker) outputs $

Fran

PS. People are really using the <lit v="{"/> hack at the moment:

$ cat trunk/apertium-[a-z][a-z]-[a-z][a-z]/*.t2x | grep '{' |wc -l
586

It appears in the following files:

/home/fran/source/apertium//trunk/apertium-br-fr/apertium-br-fr.br-fr.t2x
/home/fran/source/apertium//trunk/apertium-cy-en/apertium-cy-en.cy-en.t2x
/home/fran/source/apertium//trunk/apertium-en-ca/apertium-en-ca.ca-en.t2x
/home/fran/source/apertium//trunk/apertium-en-ca/apertium-en-ca.en-ca.t2x
/home/fran/source/apertium//trunk/apertium-en-es/apertium-en-es.en-es.t2x
/home/fran/source/apertium//trunk/apertium-en-es/apertium-en-es.es-en.t2x
/home/fran/source/apertium//trunk/apertium-en-gl/apertium-en-gl.en-gl.t2x
/home/fran/source/apertium//trunk/apertium-en-gl/apertium-en-gl.gl-en.t2x
/home/fran/source/apertium//trunk/apertium-eo-ca/apertium-eo-ca.ca-eo.t2x
/home/fran/source/apertium//trunk/apertium-eo-en/apertium-eo-en.en-eo.t2x
/home/fran/source/apertium//trunk/apertium-eo-en/apertium-eo-en.eo-en.t2x
/home/fran/source/apertium//trunk/apertium-eo-es/apertium-eo-es.es-eo.t2x
/home/fran/source/apertium//trunk/apertium-eo-fr/apertium-eo-fr.fr-eo.t2x
/home/fran/source/apertium//trunk/apertium-eu-es/apertium-eu-es.eu-es.t2x
/home/fran/source/apertium//trunk/apertium-fr-ca/apertium-fr-ca.fr-ca.t2x
/home/fran/source/apertium//trunk/apertium-fr-es/apertium-fr-es.es-fr.t2x
/home/fran/source/apertium//trunk/apertium-fr-es/apertium-fr-es.fr-es.t2x
/home/fran/source/apertium//trunk/apertium-is-en/apertium-is-en.is-en.t2x
/home/fran/source/apertium//trunk/apertium-nn-nb/apertium-nn-nb.nb-nn.t2x
/home/fran/source/apertium//trunk/apertium-oc-ca/apertium-oc-ca.ca-oc.t2x
/home/fran/source/apertium//trunk/apertium-oc-ca/apertium-oc-ca.oc-ca.t2x
/home/fran/source/apertium//trunk/apertium-oc-ca/[email protected]
/home/fran/source/apertium//trunk/apertium-oc-ca/ca-oc.t2x
/home/fran/source/apertium//trunk/apertium-oc-es/apertium-oc-es.es-oc.t2x
/home/fran/source/apertium//trunk/apertium-oc-es/[email protected]
/home/fran/source/apertium//trunk/apertium-oc-es/es-oc.t2x

So we should really have a way of expressing it properly.


------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to