On Wed, 2002-03-13 at 18:58, Gilles Detillieux wrote:
> According to Stefan Wold:
> > On Wed, 2002-03-13 at 01:07, Gilles Detillieux wrote:
> > > According to Stefan Wold:
> > > > I'm running htdig 3.1.6 on Linux. When I use rundig to create the
> > > > database for a website it index it correct except that it doesn't take
> > > > ANY foreign chars (Swedish) at all, ��� nor ��� can be found in the
> > > > db.wordlist. It seem to skip the whole word if it contains a Swedish
> > > > char. I have tried with different locale settings before running rundig
> > > > without any luck.
> > > >
> > > > Anyone had this kind of problem?
> > >
> > > Lots of people do! See http://www.htdig.org/FAQ.html#q5.8
> > > I added a couple paragraphs to it this morning.
> >
> > Well I have done everything by the book, the testlocale program show
> > correct chars. I have tried to recompile a few locales as well without
> > any luck. After setting sv_SE.ISO-8859-1 as locale I try to rerun rundig
> > but it still fail to index Swedish chars. I'm currently out of clues =)
>
> What Linux distribution are you running htdig on? Is it libc5 or glibc
> based? Does the /usr/share/locale/sv_SE directory contain LC_CTYPE and
> several other LC_* files?
>
> When you run testlocale, what output does it give for characters such as
> �, �, �, �, � and �? Are they flagged as -a-un--gt---? and -al-n--gt---?
> Did you try testlocale using both LC_CTYPE and LC_ALL as the first argument
> to the setlocale() function?
>
> --
> Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
> Dept. Physiology, U. of Manitoba Phone: (204)789-3766
> Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
>
It's a modified Redhat 7.1, so yes I'm using glibc.
As I said I have also tried to recompile the locale with
"localdef -c -f charmaps/ISO-8859-1 -i locales/sv_SE
/usr/share/locale/sv_SE"
The following can be found under /usr/share/locale/sv_SE
Same files does exist under /usr/share/locale/sv
totalt 232
-rw-r--r-- 2 root root 136 mar 13 16:04 LC_ADDRESS
-rw-r--r-- 2 root root 15911 mar 13 16:04 LC_COLLATE
-rw-r--r-- 2 root root 172916 mar 13 16:04 LC_CTYPE
-rw-r--r-- 2 root root 294 mar 13 16:04
LC_IDENTIFICATION
-rw-r--r-- 2 root root 32 mar 13 16:04 LC_MEASUREMENT
drwxr-xr-x 2 root root 4096 mar 15 11:48 LC_MESSAGES
-rw-r--r-- 2 root root 299 mar 13 16:04 LC_MONETARY
-rw-r--r-- 2 root root 71 mar 13 16:04 LC_NAME
-rw-r--r-- 2 root root 63 mar 13 16:04 LC_NUMERIC
-rw-r--r-- 2 root root 43 mar 13 16:04 LC_PAPER
-rw-r--r-- 2 root root 66 mar 13 16:04 LC_TELEPHONE
-rw-r--r-- 2 root root 2260 mar 13 16:04 LC_TIME
testlocale report this regarding ��� ���
196 0xC4: � -a-un--gt---?
197 0xC5: � -a-un--gt---?
214 0xD6: � -a-un--gt---?
228 0xE4: � -al-n--gt---?
229 0xE5: � -al-n--gt---?
246 0xF6: � -al-n--gt---?
When using rundig I have tried with both LC_CTYPE and LC_ALL.
--
Med v�nliga h�lsningar / Sincerely
Stefan Wold Vxl: +46 8 56311000 Song Networks AB
Staff, Sysadmin UNIX Fax: +46 8 56311010 AGA / Dal�num, Hus 112, 3TR
Mob: +46 701 880093 181 70 Liding�
Tel: +46 8 56311093 Sweden
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html