Hi Guillaume,

> I just have one question :
> 
>> Also with the current algorithm implemented you must NEVER pass a
>> sub-string of different language to the guesser! It will just result in
>> getting some rather random guesses that usually are of no use at all.
>> Thus sticking to sentence boundaries is recommended because of this as well.
>> 
> 
> I don't understand what you mean here. Did you mean that I mustn't give the 
> guesser a multilingual string ?

Yes.
If you pass a text to language guessing that has e.g. French and Swedish
don't be surprised if the resulting language from the guesser is
something completely different...
Mixing languages confuses the (current) implementation that relies on
counting 3-gram occurrences.

Thomas


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to