Thanks, Thomas.
Zend_Search_Lucene uses current locale as an input string encoding.
It uses "iconv('', 'ASCII//TRANSLIT', $stringValue);" to transform input
string now.
So iconv takes current locale as an input string encoding.
It can be set with "setlocale(LC_CTYPE, $myInputEncoding);"
The message "Detected an illegal character in input string" means, that
current locale doesn't match real string encoding. Check it please.
Notice "Undefined property...
Zend_Search_Lucene_Search_QueryLexer::$_currentCharIndex..." is a bug in
exception message constructing. I just fixed it.
With best regards,
Alexander Veremyev.
Thomas Weidner wrote:
Zend_Search is not part of the I18N core... so I copied this request to
the general mailing list
Greetings
Thomas
----- Original Message ----- From: "johnpham" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Saturday, January 06, 2007 4:00 AM
Subject: [fw-i18n] Zend_search shows illigal characters
I follow
http://framework.zend.com/manual/en/zend.search.charset.html#zend.search.charset.description
this tutorial to create a search engine in a different language,
particular
in Vietnamese.
However, when I did an reIndex action, it showed an error:
Notice: iconv() [function.iconv]: Detected an illegal character in input
string
whenever it converts a field that has some vietnamese characters.
But it somehow still produced 5 files in my index directory:
1. _0 (a CSF file)
2. _1 (a CSF file)
3. deletable
4. index.Lock
5. segments
I assumed the indexing part "worked" and type some vietnamese to
search in
the view. And it showed an error:
Notice: Undefined property:
Zend_Search_Lucene_Search_QueryLexer::$_currentCharIndex in
QueryLexer.php
on line 477
Does anyone know why and how to fix it ? I'd very much appreciate.
Thank you,
--
View this message in context:
http://www.nabble.com/Zend_search-shows-illigal-characters-tf2929424s16154.html#a8190192
Sent from the Zend I18N/Locale mailing list archive at Nabble.com.