On Tue, 03 Feb 2004 09:27:25 +0100
Andrzej Bialecki <[EMAIL PROTECTED]> wrote:

> 
> A question: what was your source for the representative hi-frequency 
> words in various languages? Was it your training corpus or some publication?

I use the data supplied with Gertjan van Noord:s TextCat distribution.

http://odur.let.rug.nl/~vannoord/TextCat/


-- 

karl

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to