On 8 February 2012 17:38, stevens35 <[email protected]> wrote: > Hello, > > I've been searching around for a good morphological analyzer for a while > and came across Lttoolbox. The analyzer step does exactly what I want > for words in a language, it splits the word into it's lexical base and > then adds in morphological tags based on how the word was formed. Up > until now, I've just been using the Porter Stemmer to get the root word, > but it's always been displeasing because it throws away the rest of the > surface form. >
That's to be expected, because that's what most people want from a stemmer. There are wrappers that maintain the surface form, and/or offset in the original string. NLP2RDF (http://code.google.com/p/nlp2rdf/) is the only example I can think of off the top of my head, but I'm sure there are others. > However, most of the text processing code I work with is in Java, and if > possible, I'd like to keep everything within Java. Had anyone had any > experience linking to Lttoolbox from Java? Or does anyone know of any > java versions of Lttoolbox that utilize the existing dictionaries, or a > similar tool for java? We can go one better: there's an implementation of lttoolbox (in fact, of all of Apertium) in Java: http://apertium.svn.sourceforge.net/viewvc/apertium/trunk/lttoolbox-java/ AFAIR, the interface closely follows the C++ version, so the input and output are in the form of files, where you (presumably) want to pass strings -- Jacob, our resident Java guru, had talked about adding a string based interface, though I'm not sure if he has had the time. He's definitely the best person to talk to, in any case. -- <Sefam> Are any of the mentors around? <jimregan> yes, they're the ones trolling you ------------------------------------------------------------------------------ Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
