When I search in an index for a word that starts with an eight-bit
character I get:


Query: �snan
Exception in thread "main" org.apache.lucene.queryParser.TokenMgrError: Lexical error 
at line 1, column 1.  Encountered: "\u00e5" (229), after : ""
        at org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(Unknown 
Source)
        at org.apache.lucene.queryParser.QueryParser.jj_ntk(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.Modifiers(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.Query(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
        at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
        at Lucsearch.main(Lucsearch.java:36)



(My class "Lucsearch", is a slightly modified version of the demo
class "SearchFiles.java".)

It works fine when the problematic character is later in the word.

Any ideas?

/Stefan Bergstrand

--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to