Hi all, In case you're interested in this. A while ago I wrote up a Haskell parser and writer for ORC, which one can find here <https://github.com/HuwCampbell/orc-haskell>. I use it in the day job a fair bit, and it's come in quite handy for ad-hoc data generators and parsing tasks.
It's a "clean room" implementation, and was written almost entirely from the specification instead of cribbing from the Java or C++ versions. It's also quite capable, being able to read any schemas for v0 and v1 files with a few different compression codecs. It writes with v0 style RLEs. Lastly it's pretty compact, being only ~6000 lines of sparsely formatted Haskell. I think it demonstrates how ORC works quite nicely. Kind regards, Huw