Hi all,

In case you're interested in this. A while ago I wrote up a Haskell parser
and writer for ORC, which one can find here
<https://github.com/HuwCampbell/orc-haskell>. I use it in the day job a
fair bit, and it's come in quite handy for ad-hoc data generators and
parsing tasks.

It's a "clean room" implementation, and was written almost entirely from
the specification instead of cribbing from the Java or C++ versions.

It's also quite capable, being able to read any schemas for v0 and v1 files
with a few different compression codecs. It writes with v0 style RLEs.

Lastly it's pretty compact, being only ~6000 lines of sparsely formatted
Haskell. I think it demonstrates how ORC works quite nicely.

Kind regards,
Huw

Reply via email to