Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-19 Thread Albe Laurenz
Rural Hunter wrote: Do you get the error if you try Chinese settings without nlpbamboo? How can I do this? The Chinese processing is provided by nlpbamboo. Er, sorry, forget what I said. I have no experience with Chinese. I guess you should try to ask the nlpbamboo people. Is there anything

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-18 Thread Rural Hunter
于 2012/4/17 18:06, Albe Laurenz 写道: Rural Hunter wrote: That's probably the problem - it seems to emit something that is not proper UTF-8 sometimes. Do you get the error if you try Chinese settings without nlpbamboo? How can I do this? The Chinese processing is provided by nlpbamboo. -- Sent

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-17 Thread Albe Laurenz
Rural Hunter wrote: My db is in utf-8, I have a row in my table say tmp_article and I wanted to generate ts_vector from the article content: select to_tsvector(content) from tmp_article; But I got this error: ERROR: invalid byte sequence for encoding UTF8: 0xf481 Do you use any nonstandard

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-16 Thread Albe Laurenz
Rural Hunter wrote: My db is in utf-8, I have a row in my table say tmp_article and I wanted to generate ts_vector from the article content: select to_tsvector(content) from tmp_article; But I got this error: ERROR: invalid byte sequence for encoding UTF8: 0xf481 I am wondering how this

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-16 Thread Rural Hunter
=SELECT convert_to(content, 'UTF8') FROM tmp_article; This works. My pg is at latest 9.1.3 on ubuntu 10.04 server. We have millions of data in the db but this is the only one we met the problem. The bad data is inserted in recent days and we upgraded to 9.1.3 right after it was released. 于

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-16 Thread Albe Laurenz
Please don't top post. Rural Hunter wrote: My db is in utf-8, I have a row in my table say tmp_article and I wanted to generate ts_vector from the article content: select to_tsvector(content) from tmp_article; But I got this error: ERROR: invalid byte sequence for encoding UTF8: 0xf481 I

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-16 Thread Rural Hunter
Hi Albe, Yes, I'm using a Chinese tsearch2 plugin called nlpbamboo and the default default_text_search_config is 'chinesecfg'. I tested select to_tsvector('english',content) from tmp_article and this works. So it's the problem of nlpbamboo? The result of 'CAST (content AS bytea)' is shown

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-14 Thread raghu ram
On Sat, Apr 14, 2012 at 9:31 AM, Rural Hunter ruralhun...@gmail.com wrote: doesn't work either. db=# show client_encoding; client_encoding - UTF8 (1 row) db=# set client_encoding='LATIN1'; SET db=# show client_encoding; client_encoding - LATIN1

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-14 Thread Rural Hunter
still doesn't work. db=# set client_encoding='SQL_ASCII' ; SET db=# select to_tsvector(content) from tmp_article; ERROR: invalid byte sequence for encoding "UTF8": 0xf481 2012/4/15 1:38, raghu ram : On Sat, Apr 14, 2012 at 9:31 AM, Rural

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-13 Thread raghu ram
2012/4/14 Rural Hunter ruralhun...@gmail.com My db is in utf-8, I have a row in my table say tmp_article and I wanted to generate ts_vector from the article content: select to_tsvector(content) from tmp_article; But I got this error: ERROR: invalid byte sequence for encoding UTF8: 0xf481

Re: [ADMIN] invalid byte sequence for encoding UTF8: 0xf481 - how could this happen?

2012-04-13 Thread Rural Hunter
doesn't work either. db=# show client_encoding; client_encoding - UTF8 (1 row) db=# set client_encoding='LATIN1'; SET db=# show client_encoding; client_encoding - LATIN1 (1 row) db=#