I'm loading twits into PostgreSQL, and get a few hundreds of errors for illegal sequence 0x00 in UTF8, e.g. (each leading . is 10,000 gardenhose twits):
.org.postgresql.util.PSQLException: ERROR: invalid byte sequence for encoding "UTF8": 0x00 [loving the weather here in sunny birmingham uk at the moment but its hard to sleep in when imfeeling lazy lol] com.tfitter.db.DBError: CANNOT PUT TWIT 2283513311 ROLLBACK uid=21490127 tid=2283513311 ....org.postgresql.util.PSQLException: ERROR: invalid byte sequence for encoding "UTF8": 0x00 F?9H^f????'??????%???p?{^] com.tfitter.db.DBError: CANNOT PUT TWIT 2283842814 ROLLBACK uid=30029372 tid=2283842814 ...org.postgresql.util.PSQLException: ERROR: invalid byte sequence for encoding "UTF8": 0x00 [...@andycrofford まだ脱ぐな。そろそろこのこと考えるのは最後にされると、5エ譛ォ遶ッ 縺ョ譁ケ縺ッ蜃コ鬘後&繧後腟蕭⒢㎢⒢⒢] Anybody knows how to get rid of those 0x00s cleanly in Scala/Java? Cheers, Alexy