Hello All,

> > I have 100,000+ small HTML files that are mainly in the
> > english language. I
> > just noticed that we have some user names with umlauts. These
> > are seemingly
> > stored and searchable as the '?' character.

This appears to be a Solaris thing. I develop under Solaris 9 and then burn 
my application onto a multi-platform CD (Unix/Win/Mac). It is only when I 
run the application under Solaris that the umlauts appear as the '?' 
character. On all other platforms the characters are correctly displayed. 
All platforms are running Java 1.3.1.

Two questions :-

1) Has anyone any experience with such behaviour ? (Apologies for the 
non-lucene content)

2) How to search on text containing umlauts ? At the moment a search on   
"j�rgen" returns no hits, but a search on "rgen" will return posts by user 
J�rgen.

Thanks

IAP

_________________________________________________________________
Send and receive Hotmail on your mobile device: http://mobile.msn.com


--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to