On Mon, Jun 22, 2015 at 05:09:45PM -0400, Charles Sprickman wrote: > Are there any other options for filtering based on language, or any known > patches/fixes for TextCat to make it a bit less aggressive when it runs > across gibberish that is probably not any particular language?
You could tinker with textcat_acceptable_score. Increasing it slightly (e.g. back to the old default of 1.05) seems to reduce those wild guesses. Regards, Marc