Simon Brouwer wrote:

Here's the link to the web crawler/collocation finder, GPL:

http://www.mimuw.edu.pl/polszczyzna/kolokacje/index-en.htm
Great, thanks for the link!

You're welcome ;) A word of warning: you need lots of heap space to have it running and a very, very large corpus to get significant results. I haven't tried it with larger corpora (like 30M words) so caveat emptor (it really eats up memory)! With smaller corpora, you would get too much accidental garbage like proper names etc.

Best,
Marcin

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to