>I've then created ORC and Parquet versions of this same table. The >behavior remains... select * works, any filter creates horribly >mangled results. > >To replace- throw this into a file: > >{"id":1,"order_id":8,"number":1,"broken":"#\n---\nstuff\nstuff2: >\"stuff3\"\nstuff4: '730'\nstuff5: []\n","last":null}
You're trying to fix the issue on the wrong side of the problem, I think. Try with set hive.default.serde=org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe ; set hive.query.result.fileformat=SequenceFile; Hopefully we'll have a newer & more compact format for results soon. Cheers, Gopal