I'm loading twits into PostgreSQL, and get a few hundreds of errors
for illegal sequence 0x00 in UTF8, e.g. (each leading . is 10,000
gardenhose twits):

.org.postgresql.util.PSQLException: ERROR: invalid byte sequence for
encoding "UTF8": 0x00 [loving the weather here in sunny birmingham uk
at the moment but its hard to sleep in when imfeeling lazy lol]
com.tfitter.db.DBError: CANNOT PUT TWIT 2283513311
ROLLBACK uid=21490127 tid=2283513311
....org.postgresql.util.PSQLException: ERROR: invalid byte sequence
for encoding "UTF8": 0x00 F?9H^f????'??????%???p?{^]
com.tfitter.db.DBError: CANNOT PUT TWIT 2283842814
ROLLBACK uid=30029372 tid=2283842814
...org.postgresql.util.PSQLException: ERROR: invalid byte sequence for
encoding "UTF8": 0x00 [...@andycrofford  まだ脱ぐな。そろそろこのこと考えるのは最後にされると、5エ譛ォ遶ッ
縺ョ譁ケ縺ッ蜃コ鬘後&繧後腟蕭⒢㎢⒢⒢]

Anybody knows how to get rid of those 0x00s cleanly in Scala/Java?
Cheers,
Alexy

Reply via email to