Thanks, Thomas.

Zend_Search_Lucene uses current locale as an input string encoding.

It uses "iconv('', 'ASCII//TRANSLIT', $stringValue);" to transform input string now.
So iconv takes current locale as an input string encoding.
It can be set with "setlocale(LC_CTYPE, $myInputEncoding);"

The message "Detected an illegal character in input string" means, that current locale doesn't match real string encoding. Check it please.


Notice "Undefined property... Zend_Search_Lucene_Search_QueryLexer::$_currentCharIndex..." is a bug in exception message constructing. I just fixed it.


With best regards,
   Alexander Veremyev.


Thomas Weidner wrote:
Zend_Search is not part of the I18N core... so I copied this request to the general mailing list

Greetings
Thomas

----- Original Message ----- From: "johnpham" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Saturday, January 06, 2007 4:00 AM
Subject: [fw-i18n] Zend_search shows illigal characters



I follow
http://framework.zend.com/manual/en/zend.search.charset.html#zend.search.charset.description this tutorial to create a search engine in a different language, particular
in Vietnamese.

However, when I did an reIndex action, it showed an error:

Notice: iconv() [function.iconv]: Detected an illegal character in input
string

whenever it converts a field that has some vietnamese characters.
But it somehow still produced 5 files in my index directory:
1. _0 (a CSF file)
2. _1 (a CSF file)
3. deletable
4. index.Lock
5. segments

I assumed the indexing part "worked" and type some vietnamese to search in
the view. And it showed an error:

Notice: Undefined property:
Zend_Search_Lucene_Search_QueryLexer::$_currentCharIndex in QueryLexer.php
on line 477



Does anyone know why and how to fix it ? I'd very much appreciate.
Thank you,

--
View this message in context: http://www.nabble.com/Zend_search-shows-illigal-characters-tf2929424s16154.html#a8190192 Sent from the Zend I18N/Locale mailing list archive at Nabble.com.



Reply via email to