Your message dated Sat, 1 Aug 2009 10:13:37 +0100
with message-id <[email protected]>
and subject line Re: Bug#539441: xapian-omega: capitalization required
has caused the Debian Bug report #539441,
regarding xapian-omega: capitalization required
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)


-- 
539441: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=539441
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: xapian-omega
Version: 1.0.13-1
Severity: normal


Hello,

here's an example that my problem should illustrate.

The first with the correct spelling „trennen“:
/usr/lib/cgi-bin/omega/omega 'P=trennen' HITSPERPAGE=1 'DB=default' | grep 
'matches'  
 (no matches)

/usr/lib/cgi-bin/omega/omega 'P=Trennen' HITSPERPAGE=1 'DB=default' | grep 
'matches'
  1 of exactly 33 matches

/usr/lib/cgi-bin/omega/omega 'P="trennen"' HITSPERPAGE=1 'DB=default' | grep 
'matches'
  1 of exactly 33 matches

/usr/lib/cgi-bin/omega/omega 'P=TRENNEN' HITSPERPAGE=1 'DB=default' | grep 
'matches'
  1 of exactly 33 matches

To illustrate again with the English Word "love":

/usr/lib/cgi-bin/omega/omega 'P=love' HITSPERPAGE=1 'DB=default' | grep 
'matches'
 (no matches)

/usr/lib/cgi-bin/omega/omega 'P=Love' HITSPERPAGE=1 'DB=default' | grep 
'matches'
  1 of exactly 18 matches

The database I have created, whether with "stemmer=german" or "stemmer=none" 
does not difference:

omindex --stemmer=german --db /var/lib/xapian-omega/data/default --url 
http://localhost /var/www/

The same also applies to the version 1.0.7-3.


With best regards

-- System Information:
Debian Release: 5.0.2
  APT prefers stable
  APT policy: (2000, 'stable'), (600, 'testing'), (500, 
'testing-proposed-updates'), (500, 'proposed-updates')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.28-1-amd64 (SMP w/2 CPU cores)
Locale: LANG=de_DE.utf8, LC_CTYPE=de_DE.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages xapian-omega depends on:
ii  libc6                        2.7-18      GNU C Library: Shared libraries
ii  libgcc1                      1:4.3.2-1.1 GCC support library
ii  libstdc++6                   4.3.2-1.1   The GNU Standard C++ Library v3
ii  libxapian15                  1.0.13-3    Search engine library

Versions of packages xapian-omega recommends:
ii  apache2-mpm-prefork [htt 2.2.9-10+lenny4 Apache HTTP Server - traditional n
ii  lighttpd [httpd-cgi]     1.4.22-1        A fast webserver with minimal memo

Versions of packages xapian-omega suggests:
ii  antiword           0.37-3                Converts MS Word files to text and
ii  catdoc             0.94.2-1              MS-Word to TeX or plain text conve
ii  catdvi             0.14-10               DVI to plain text translator
ii  djvulibre-bin      3.5.20-8+lenny1       Utilities for the DjVu image forma
ii  ghostscript        8.62.dfsg.1-3.2lenny1 The GPL Ghostscript PostScript/PDF
ii  libwpd-tools       0.8.14-1              Tools from libwpd for converting W
ii  libwps-tools       0.1.2-1               Tools from libwps for converting W
ii  unrtf              0.19.3-1.1            RTF to other formats converter
ii  unzip              5.52-12               De-archiver for .zip files
ii  xpdf-utils         3.02-1.4+lenny1       Portable Document Format (PDF) sui

-- no debconf information



--- End Message ---
--- Begin Message ---
On Fri, Jul 31, 2009 at 11:01:13PM +0200, Alex Wilk wrote:
> The database I have created, whether with "stemmer=german" or
> "stemmer=none" does not difference:

You need to configure omega to use the same stemmer at search time as
you told omindex to use at index time, with e.g. $set{stemmer,german}
in your query template.

Capitalised query terms are searched for "as is", which is why they work
even with mismatched stemmer settings.

Cheers,
    Olly


--- End Message ---

Reply via email to