When I search in an index for a word that starts with an eight-bit
character I get:
Query: �snan
Exception in thread "main" org.apache.lucene.queryParser.TokenMgrError: Lexical error
at line 1, column 1. Encountered: "\u00e5" (229), after : ""
at org.apache.lucene.queryParser.QueryParserTokenManager.getNextToken(Unknown
Source)
at org.apache.lucene.queryParser.QueryParser.jj_ntk(Unknown Source)
at org.apache.lucene.queryParser.QueryParser.Modifiers(Unknown Source)
at org.apache.lucene.queryParser.QueryParser.Query(Unknown Source)
at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
at org.apache.lucene.queryParser.QueryParser.parse(Unknown Source)
at Lucsearch.main(Lucsearch.java:36)
(My class "Lucsearch", is a slightly modified version of the demo
class "SearchFiles.java".)
It works fine when the problematic character is later in the word.
Any ideas?
/Stefan Bergstrand
--
To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>