El dt 25 de 03 de 2014 a les 12:17 +0000, en/na Jim O'Regan va escriure: > On 25 March 2014 11:05, Francis Tyers <[email protected]> wrote: > > Hello everyone, > > > > I think it is time to do a new release of apertium and lttoolbox. This > > release would be the 3.3 release. If you are wondering why, here are > > some new unreleased features: > > > > ==lttoolbox== > > > > * lt-print: a program to print out binary transducers in ATT format. > > * lt-comp: will now compile ATT transducers > > * lt-trim: a program to intersect a morphological analyser with a > > bilingual dictionary in lttoolbox format. > > * the pkg-config .pc file has also changed from lttoolbox-3.2 to > > lttoolbox and versions should rely only on version, not on the name. > > > > ==apertium== > > > > * Gang Chen's sliding window part-of-speech tagger was integrated and is > > optional next to the HMM tagger > > * the pkg-config .pc file has also changed from apertium-3.2 to apertium > > and versions should rely only on version, not on the name. > > * new deformatter for latex. > > * Unhammer's work on adding autotools support for separate language > > directories. > > > > I think this should be fairly convincing. Does anyone have any > > objections ? Also, feel free to bring up some new features that I have > > forgotten. > > > > I believe this would be the first version of lt-comp that supports alt > and v attributes. > > Also, I have a tiny feature that allows the user to specify a set of > characters to be ignored at runtime (motivated primarily by soft > hyphens, but I've left it general[1]). I sent the patch to Sergio to > review, but I'd really rather get it in now than wait n years until > the next release :) > > For the curious, I've attached the patch. > > Current behaviour is: > $ echo testing |lttoolbox/lt-proc > ~/Apertium/apertium-en-es/en-es.automorf.bin > ^test/test<n><sg>/test<vblex><inf>/test<vblex><pres>$^ing/*ing > > Using this as soft-hyphen.icx: > > <?xml version="1.0"?> > <ignored-chars> > <char value="­ "/> > </ignored-chars> > > echo testing |lttoolbox/lt-proc -i soft-hyphen.icx > ~/Apertium/apertium-en-es/en-es.automorf.bin > ^testing/test<vblex><ger>/test<vblex><pprs>/test<vblex><subs>/testing<n><sg>$
Could this just be included as default ? I mean, are there any cases in which we would not want to skip a soft-hyphen ? Fran ------------------------------------------------------------------------------ Learn Graph Databases - Download FREE O'Reilly Book "Graph Databases" is the definitive new guide to graph databases and their applications. Written by three acclaimed leaders in the field, this first edition is now available. Download your free book today! http://p.sf.net/sfu/13534_NeoTech _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
