>I've then created ORC and Parquet versions of this same table. The
>behavior remains... select * works, any filter creates horribly
>mangled results.
>
>To replace- throw this into a file:
>
>{"id":1,"order_id":8,"number":1,"broken":"#\n---\nstuff\nstuff2:
>\"stuff3\"\nstuff4: '730'\nstuff5: []\n","last":null}
You're trying to fix the issue on the wrong side of the problem, I think.
Try with
set
hive.default.serde=org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe
;
set hive.query.result.fileformat=SequenceFile;
Hopefully we'll have a newer & more compact format for results soon.
Cheers,
Gopal