Hi Alexey,

On Mon, 22 Apr 2024, at 7:43 PM, [email protected] wrote:
> Hello all.
> Looks like, Squat search engine for non-ascii texts is heavily broken in 
> recent Cyrus. The logic of damage is completely unclear for me:
> - search for single cyrillic character returns nothing (while nearly all 
> messages definitely contain this character)
> - search for longer random seguence of cyrillic characters returns lots of 
> findings (while messages definitely cannot contain this non-existent word)
> - search for a real word returns both real and false findings.

I don't suppose you know of a version that it used to work correctly in?

> Can anyone confirm?
> Cyrus IMAP 3.6.1 Debian 12, default encoding UTF-8. Client programs 
> Thunderbird and Sogo.

3.6.4 came out a little while ago with a handful of fixes to some long-standing 
Squat bugs (among other things).  The Squat bugs weren't specifically about 
non-ascii texts, but they were about index corruption and bad search results, 
so it's possible there was some overlap that's been fixed.

Having a look, there doesn't seem to be a Debian package for 3.6.4 yet (but 
maybe I'm not looking correctly)

> Resolved by upgrade to Xapian search engine.

Good choice.  The Xapian engine is actively maintained, whereas Squat has been 
neglected for a while...

Cheers,

ellie
------------------------------------------
Cyrus: Info
Permalink: 
https://cyrus.topicbox.com/groups/info/Ta1f0b0500304ec85-M9e59ccba3d59aeb9552a682e
Delivery options: https://cyrus.topicbox.com/groups/info/subscription

Reply via email to