Your message dated Thu, 2 Aug 2018 05:03:13 +0100 with message-id <[email protected]> and subject line Re: www.debian.org: Website search box unhelpful for common names (e.g. Buster) in certain character sets has caused the Debian Bug report #905126, regarding www.debian.org: Website search box unhelpful for common names (e.g. Buster) in certain character sets to be marked as done.
This means that you claim that the problem has been dealt with. If this is not the case it is now your responsibility to reopen the Bug report if necessary, and/or fix the problem forthwith. (NB: If you are a system administrator and have no idea what this message is talking about, this may indicate a serious mail system misconfiguration somewhere. Please contact [email protected] immediately.) -- 905126: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=905126 Debian Bug Tracking System Contact [email protected] with problems
--- Begin Message ---Package: www.debian.org Severity: normal A number of search languages end up with no results for contextually common search terms, for example "debian" or "buster". To reproduce: - use the search box for the term "buster" in English. There are a number of results including release information, news items and errata. - set the language to Vietnamese, Chinese or similar and search again - there are no results. I assume that this is an issue with translations into non-Latin character sets without hint words nearby the translated word. -- System Information: Debian Release: 9.5 APT prefers stable APT policy: (990, 'stable') Architecture: amd64 (x86_64) Foreign Architectures: i386 Kernel: Linux 4.16.0-0.bpo.2-amd64 (SMP w/4 CPU cores) Locale: LANG=en_GB.utf8, LC_CTYPE=en_GB.utf8 (charmap=UTF-8), LANGUAGE=en_GB:en (charmap=UTF-8) Shell: /bin/sh linked to /bin/dash Init: systemd (via /run/systemd/system)
--- End Message ---
--- Begin Message ---On Thu, Aug 02, 2018 at 01:04:25AM +0100, Olly Betts wrote: > I'm not sure how the stemmer mapping file is generated, but I'll look > into it today if I can. I think we should be able to just specify a > default of "none" but I suspect this file is generated so I need to > fix the script not just the current output. It's generated by /srv/search.debian.org/bin/gen-stemmer.sh - I've fixed that to generate $set{stemmer,none} before anything else, run it by hand, and now I can search for "buster" in "chinese-china" and "vietnamese", and stemming is still enabled for "english". This should fix all cases, but please reopen if not. Cheers, Olly
--- End Message ---

