El dc 08 de 02 de 2012 a les 10:21 -0800, en/na stevens35 va escriure: > Hi All, > > Thanks for the quick and really awesome responses. The java version of > Lttoolbox is /*exactly*/ what I was hoping to find (and not have to > write myself)! Looking through the source code for the processing main, > it doesn't look too hard to use the internal classes as a library and > feed it strings. > > On a related note, if i just want to do morphological analysis of > English, which language pair should I start with? Or is there an all > encompassing English morphological dictionary that someone maintains? > If not, how troublesome would it be to merge the existing different > English dictionary files?
To merge dictionaries, I think you can use dixtools. For English, I would recommend starting with: en-ca, en-es, is-en, mk-en these should have the most entries. http://wiki.apertium.org/wiki/Dixtools:_Merge_dictionaries Actually, here is a full list: $ for i in `ls *en*/*.en.*dix`; do echo -n "$i: "; cat $i | grep '<e lm' | sort -u | wc -l; done apertium-cy-en/apertium-cy-en.en.dix: 18141 apertium-en-ca/apertium-en-ca.en.metadix: 31268 apertium-en-es/apertium-en-es.en.metadix: 32286 apertium-en-gl/apertium-en-gl.en.metadix: 20044 apertium-eo-en/apertium-eo-en.en.dix: 25174 apertium-eu-en/apertium-eu-en.en.metadix: 23540 apertium-is-en/apertium-is-en.en.dix: 32021 apertium-mk-en/apertium-mk-en.en.dix: 33000 Regards, Fran ------------------------------------------------------------------------------ Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
