Hi William,
in its current state, hunspell has some limitations which make this
solution imperfect. First, in some languages only all flags of the base
form of the word determine the part of speech information (two genders
can share the same affixes, for example). Hunspell is unable to
distinguish that at the moment. Second, there is no support for stemming
and hunstem in OOo (yet).
Nemeth Laszlo is planning to remove (1) but due to quite lengthy process
of adding any feature to OOo it's hard to say when (2) will be a thing
of the past. So all you can do is to write a hunspell dictionary parser,
and implement the functions in Perl, or try JMorph (Java hunspell
stemming port, available somewhere on Laci's university website, dig in
the archives of his group for the link). You are using Java interface
anyway in CoGrOO, so that should be a good temporary solution. Note that
this Java interface needs some fixing (crashes on some dictionaries) but
is almost a complete port.
Best,
Marcin
William Silva napisał(a):
Hi,
Have anyone tried to use Hunspell for morphological analysis? In our
grammar
checker development (CoGrOO), we are using a morphological dictionary we
wrote. But it is big, especially because we didn't care about redundant
data, this would be solved using the Hunspell affixes, and even better, the
grammar checker and spell checker would share the same dictionary.
Does anyone know if the OOo make any interface available to its
Hunspell, so
a grammar checker could use this interface to query the dictionaries?
Thanks!
William
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]