On Tue, 03 Feb 2004 09:27:25 +0100 Andrzej Bialecki <[EMAIL PROTECTED]> wrote:
> > A question: what was your source for the representative hi-frequency > words in various languages? Was it your training corpus or some publication? I use the data supplied with Gertjan van Noord:s TextCat distribution. http://odur.let.rug.nl/~vannoord/TextCat/ -- karl --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]