Now we have the java compound word implementation ported to C++ we can probably consider this 'de facto' how we are going to do compounds in lttoolbox -- it is _in use_ and there have been _no alternatives_.
So it is probably worth looking at how we are going to represent this nicely in the .dix format. At the moment we use two 'special' symbols: <sdef n="compound-only-L" c="for a form that can only appear on the L"/> <sdef n="compound-R" c="for a form that can only appear on the R, or as a word on its own"/> I propose making a new element <c> for compound, and having one attribute "r" for restriction. <s n="compound-only-L"/> would be replaced with <c r="L"/> and <s n="compound-R/> would be replaced with <c r="R"/> Example: These entries <e><p><l>plast</l><r>plast<s n="n"/><s n="m"/><s n="sg"/><s n="ind"/><c r="L"/></r></p></e> <e><p><l>plast</l><r>plast<s n="n"/><s n="m"/><s n="sg"/><s n="ind"/></r></p></e> <e><p><l>kortet</l><r>kort<s n="n"/><s n="nt"/><s n="sg"/><s n="def"/><c r="R"/></r></p></e> would give $ echo plastkortet|lt-proc -e nb-nn.automorf.bin # the plastic card ^plastkortet/plast<n><m><sg><ind><cmp>+kort<n><nt><sg><def>$ $ echo kortet|lt-proc -e nb-nn.automorf.bin # 'the card' ^kortet/korte<vblex><pp>/korte<vblex><pret>/kort<n><nt><sg><def>/kort<n><nt><sg><def>/korte<adj><pp><nt><sg><ind>/korte<adj><pp><mf><sg><ind>$ $ echo plast|lt-proc -e nb-nn.automorf.bin # 'plastic' ^plast/plast<n><m><sg><ind>/plast<n><m><sg><ind>$ while these entries: <e><p><l>navne</l><r>navn<s n="n"/><s n="nt"/><s n="sg"/><s n="ind"/><c r="L"/></r></p></e> <e><p><l>navn</l><r>navn<s n="n"/><s n="nt"/><s n="sg"/><s n="ind"/></r></p></e> <e><p><l>kortet</l><r>kort<s n="n"/><s n="nt"/><s n="sg"/><s n="def"/><c r="R"/></r></p></e> would give: $ echo navnekortet|lt-proc -e nb-nn.automorf.bin ^navnekortet/navn<n><nt><sg><ind><cmp>+kort<n><nt><sg><def>$ 'the name card' $ echo navne|lt-proc -e nb-nn.automorf.bin ^navne/*navne$ $ echo navn|lt-proc -e nb-nn.automorf.bin ^navn/navn<n><nt><sg><ind>/navn<n><nt><sg><ind>/navn<n><nt><pl><ind>$ $ echo kortet|lt-proc -e nb-nn.automorf.bin ^kortet/korte<vblex><pp>/korte<vblex><pret>/kort<n><nt><sg><def>/korte<adj><pp><nt><sg><ind>/korte<adj><pp><mf><sg><ind>$ The DTD would be updated to allow this, and a new release of lttoolbox would be made. Any pairs using this would be made to depend on the new release. It would be nice to get this done before Christmas, are there any comments ? Fran ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
