there must be something seriously broken with the queryparse code.

if a query starts with �/�/� (ø, &oaelig;, å) then an exception
in the queryparser occurs.

org.apache.lucene.queryParser.TokenMgrError: Lexical error at line 1, column
1.  Encountered: "\u00c3" (195), after : ""
        at
org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(Unknown
Source)
        at org.apache.lucene.queryParser.QueryParser.jj_ntk(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.Modifiers(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.Query(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)

but if the query contains �/�/� (ø, &oaelig;, å) then it is
translated wrongly into the swedish/german ä regardless of what
character it was.

if someone could point me to where to start I could try to find the problem
because I guess it is errorous unicode translation...


mvh karl



>no it's even stranger than that, i have decoded the querystring, the
problem
>is that it seems like something is changed on the way in. if i search for
>"fj�s" (fjøs) i get the swedish "fj�" (fjÄ). Where ø is
>changed to Ä and 's' is removed.
>
>is the querystring translated some where?
>
>mvh karl �ie
>  -----Original Message-----
>  From: David Bonilla [mailto:[EMAIL PROTECTED]]
>  Sent: 27. november 2001 10:43
>  To: Lucene Users List; [EMAIL PROTECTED]
>  Subject: Re: scandinavian characters.
>
>
>  Hi Karl !!!
>
>  I�m spanish and I have a lot of problems programming with our not english
>characters. I use LUCENE with spanish accents and it works fine...
>
>  Have you tried to use the java.net.URLEncoder and java.net.URLDecoder
with
>your fields to index ?
>
>  Best Regards from Spain !
>  __________________________
>  David Bonilla Fuertes
>  THE BIT BANG NETWORK
>  http://www.bit-bang.com
>  Profesor Waksman, 8, 6� B
>  28036 Madrid
>  SPAIN
>  Tel.: (+34) 914 577 747
>  M�vil: 656 62 83 92
>  Fax: (+34) 914 586 176
>  __________________________




--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to