Re: [pylucene-dev] Document encoding?

Andi Vajda Wed, 07 Mar 2007 09:31:54 -0800


On Wed, 7 Mar 2007, Jarek Zgoda wrote:

It seems that I cann't properly store UTF-8 encoded documents using PyLucene(by "properly" I mean the documents are searchable and can be returned inform they have been stored). Should I use only unicode objects in mysearch/indexing machinery code, as PyLucene returns search result's fields asunicode objects?


PyLucene wraps Java Lucene by compiling it with gcj. Java only uses Unicode.

If you pass utf-8 strings to PyLucene APIs, they are converted to Unicodebefore being passed to the wrapped Java Lucene APIs because that's all theyunderstand.


Andi..
_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Re: [pylucene-dev] Document encoding?

Reply via email to