Now we have the java compound word implementation ported to C++ we can
probably consider this 'de facto' how we are going to do compounds in
lttoolbox -- it is _in use_ and there have been _no alternatives_. 

So it is probably worth looking at how we are going to represent this
nicely in the .dix format. At the moment we use two 'special' symbols:

<sdef n="compound-only-L" c="for a form that can only appear on the L"/>
<sdef n="compound-R"    c="for a form that can only appear on the R, or
as a word on its own"/>

I propose making a new element <c> for compound, and having one
attribute "r" for restriction.

<s n="compound-only-L"/> would be replaced with <c r="L"/> and 
<s n="compound-R/> would be replaced with <c r="R"/>

Example:

These entries

  <e><p><l>plast</l><r>plast<s n="n"/><s n="m"/><s n="sg"/><s
n="ind"/><c r="L"/></r></p></e>
  <e><p><l>plast</l><r>plast<s n="n"/><s n="m"/><s n="sg"/><s
n="ind"/></r></p></e>
  <e><p><l>kortet</l><r>kort<s n="n"/><s n="nt"/><s n="sg"/><s
n="def"/><c r="R"/></r></p></e>

would give

$ echo plastkortet|lt-proc -e nb-nn.automorf.bin # the plastic card
^plastkortet/plast<n><m><sg><ind><cmp>+kort<n><nt><sg><def>$

$ echo kortet|lt-proc -e nb-nn.automorf.bin # 'the card'
^kortet/korte<vblex><pp>/korte<vblex><pret>/kort<n><nt><sg><def>/kort<n><nt><sg><def>/korte<adj><pp><nt><sg><ind>/korte<adj><pp><mf><sg><ind>$

$ echo plast|lt-proc -e nb-nn.automorf.bin # 'plastic'
^plast/plast<n><m><sg><ind>/plast<n><m><sg><ind>$

while these entries:

  <e><p><l>navne</l><r>navn<s n="n"/><s n="nt"/><s n="sg"/><s
n="ind"/><c r="L"/></r></p></e>
  <e><p><l>navn</l><r>navn<s n="n"/><s n="nt"/><s n="sg"/><s
n="ind"/></r></p></e>
  <e><p><l>kortet</l><r>kort<s n="n"/><s n="nt"/><s n="sg"/><s
n="def"/><c r="R"/></r></p></e>

would give:

$ echo navnekortet|lt-proc -e nb-nn.automorf.bin
^navnekortet/navn<n><nt><sg><ind><cmp>+kort<n><nt><sg><def>$
'the name card'

$ echo navne|lt-proc -e nb-nn.automorf.bin
^navne/*navne$

$ echo navn|lt-proc -e nb-nn.automorf.bin
^navn/navn<n><nt><sg><ind>/navn<n><nt><sg><ind>/navn<n><nt><pl><ind>$

$ echo kortet|lt-proc -e nb-nn.automorf.bin
^kortet/korte<vblex><pp>/korte<vblex><pret>/kort<n><nt><sg><def>/korte<adj><pp><nt><sg><ind>/korte<adj><pp><mf><sg><ind>$

The DTD would be updated to allow this, and a new release of lttoolbox
would be made. Any pairs using this would be made to depend on the new
release.

It would be nice to get this done before Christmas, are there any
comments ? 

Fran


------------------------------------------------------------------------------
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to