On 8 February 2012 17:38, stevens35 <[email protected]> wrote:
> Hello,
>
> I've been searching around for a good morphological analyzer for a while
> and came across Lttoolbox.  The analyzer step does exactly what I want
> for words in a language, it splits the word into it's lexical base and
> then adds in morphological tags based on how the word was formed.    Up
> until now, I've just been using the Porter Stemmer to get the root word,
> but it's always been displeasing because it throws away the rest of the
> surface form.
>

That's to be expected, because that's what most people want from a
stemmer. There are wrappers that maintain the surface form, and/or
offset in the original string. NLP2RDF
(http://code.google.com/p/nlp2rdf/) is the only example I can think of
off the top of my head, but I'm sure there are others.

> However, most of the text processing code I work with is in Java, and if
> possible, I'd like to keep everything within Java.  Had anyone had any
> experience linking to Lttoolbox from Java?  Or does anyone know of any
> java versions of Lttoolbox that utilize the existing dictionaries, or a
> similar tool for java?

We can go one better: there's an implementation of lttoolbox (in fact,
of all of Apertium) in Java:
http://apertium.svn.sourceforge.net/viewvc/apertium/trunk/lttoolbox-java/

AFAIR, the interface closely follows the C++ version, so the input and
output are in the form of files, where you (presumably) want to pass
strings -- Jacob, our resident Java guru, had talked about adding a
string based interface, though I'm not sure if he has had the time.
He's definitely the best person to talk to, in any case.


-- 
<Sefam> Are any of the mentors around?
<jimregan> yes, they're the ones trolling you

------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to