+1
By the way, around the same topic, I'm working on Apache CarbonData support
(http://carbondata.apache.org/).
Regards
JB
On 04/01/2017 05:31 PM, Tibor Kiss wrote:
Hello,
Recently the Optimized Row Columnar (ORC) file format was spin off from Hive
and became a top level Apache Project: https://orc.apache.org/
It is similar to Parquet in a sense that it uses column major format but
ORC has
a more elaborate type system and stores basic statistics about each row.
I'd be interested extending Beam with ORC support if others find it helpful
too.
What do you think?
- Tibor
--
Jean-Baptiste Onofré
[email protected]
http://blog.nanthrax.net
Talend - http://www.talend.com