El dc 08 de 02 de 2012 a les 18:03 +0000, en/na Jimmy O'Regan va escriure: > On 8 February 2012 17:38, stevens35 <[email protected]> wrote: > > Hello, > > > > I've been searching around for a good morphological analyzer for a while > > and came across Lttoolbox. The analyzer step does exactly what I want > > for words in a language, it splits the word into it's lexical base and > > then adds in morphological tags based on how the word was formed. Up > > until now, I've just been using the Porter Stemmer to get the root word, > > but it's always been displeasing because it throws away the rest of the > > surface form. > > > > That's to be expected, because that's what most people want from a > stemmer. There are wrappers that maintain the surface form, and/or > offset in the original string. NLP2RDF > (http://code.google.com/p/nlp2rdf/) is the only example I can think of > off the top of my head, but I'm sure there are others. > > > However, most of the text processing code I work with is in Java, and if > > possible, I'd like to keep everything within Java. Had anyone had any > > experience linking to Lttoolbox from Java? Or does anyone know of any > > java versions of Lttoolbox that utilize the existing dictionaries, or a > > similar tool for java? > > We can go one better: there's an implementation of lttoolbox (in fact, > of all of Apertium) in Java: > http://apertium.svn.sourceforge.net/viewvc/apertium/trunk/lttoolbox-java/ > > AFAIR, the interface closely follows the C++ version, so the input and > output are in the form of files, where you (presumably) want to pass > strings -- Jacob, our resident Java guru, had talked about adding a > string based interface, though I'm not sure if he has had the time. > He's definitely the best person to talk to, in any case.
You can probably use the biltrans[1] method to give a string to a transducer and get the result. I think this is what we did with voikko. Fran 1. http://wiki.apertium.org/wiki/Lttoolbox_API ------------------------------------------------------------------------------ Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
