Hi all,

I'm trying to figure out how to query Hive on latin1 encoded data.

I created a file with 256 characters, with unicode value 0-255,
encoded in latin1. I made a table out of it. But when I do a "select
*", Hive returns the upper ascii rows as '\xef\xbf\xbd', which is the
replacement character '\ufffd' encoded in UTF-8.

Does anyone know how to work with non-UTF8 data?

Cheers,
-- 
bc Wong
Cloudera Software Engineer

Reply via email to