Hi all, I'm trying to figure out how to query Hive on latin1 encoded data.
I created a file with 256 characters, with unicode value 0-255, encoded in latin1. I made a table out of it. But when I do a "select *", Hive returns the upper ascii rows as '\xef\xbf\xbd', which is the replacement character '\ufffd' encoded in UTF-8. Does anyone know how to work with non-UTF8 data? Cheers, -- bc Wong Cloudera Software Engineer
