Hi Andriy,

W dniu 2013-11-19 22:47, Andriy Rysin pisze:
> I am thinking to add rules to Ukrainian that would check if related
> words agree on case/gender etc. There are several primary cases for
> this:
> 1) having adjective and noun have the same case, gender etc
> 2) having noun's/adjective's gender or plural form to match that of the verb
> 3) having adjective and/or noun to be in a right case if it follows
> the preposition or a verb that requires some particular case
>
> I was thinking about 3) for a bit and it looks like it'll be too hard
> to implement this in grammar.xml: we have 7 cases in Ukrainian and
> some prepositions may allow several cases for following
> nouns/adjectives and some nouns don't change (currently I just mark
> them as such in the dictionary instead of exploding same word 7 times
> which may be more correct way to go).
>
> So I was going to take a shot at doing this in Jave (and I guess along
> the way I'll see if it makes sense follow similar pattern for 1 and
> 2). But before I started I wanted to double check that there's no
> good/common/existing way of doing things like that.

You might look at unification. Basically, we are able to find 
non-agreeing words very easily if they are POS-tagged appropriately.

http://wiki.languagetool.org/using-unification

Note however that in Polish, that would generate zillions of false 
alarms as there might be several noun phrases in different grammatical 
cases, and such a check would also match all boundaries of phrases (last 
word of the first phrase and the first word of the next phrase). You 
should also mark up the phrases ("chunks") first. I want to add marking 
of chunks in the disambiguator but I don't have time currently to do 
that. But it is definitely possible to mark chunks in Polish using 
unification (with a few additions).

Speaking of which, is there a real POS dictionary for Ukrainian in LT? I 
thought you have only a lemmatiser. We might integrate UGTagger after all?

Regards,
Marcin

------------------------------------------------------------------------------
Shape the Mobile Experience: Free Subscription
Software experts and developers: Be at the forefront of tech innovation.
Intel(R) Software Adrenaline delivers strategic insight and game-changing 
conversations that shape the rapidly evolving mobile landscape. Sign up now. 
http://pubads.g.doubleclick.net/gampad/clk?id=63431311&iu=/4140/ostg.clktrk
_______________________________________________
Languagetool-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to