Thanks Tibor,
Ready to help ! (I also started the ParquetIO).
Regards
JB
On 04/03/2017 02:11 PM, Tibor Kiss wrote:
Thanks for your replies, I've created
https://issues.apache.org/jira/browse/BEAM-1861 to track this effort.
On Sun, Apr 2, 2017 at 7:40 AM, Jean-Baptiste Onofré <[email protected]>
wrote:
+1
By the way, around the same topic, I'm working on Apache CarbonData
support (http://carbondata.apache.org/).
Regards
JB
On 04/01/2017 05:31 PM, Tibor Kiss wrote:
Hello,
Recently the Optimized Row Columnar (ORC) file format was spin off from
Hive
and became a top level Apache Project: https://orc.apache.org/
It is similar to Parquet in a sense that it uses column major format but
ORC has
a more elaborate type system and stores basic statistics about each row.
I'd be interested extending Beam with ORC support if others find it
helpful
too.
What do you think?
- Tibor
--
Jean-Baptiste Onofré
[email protected]
http://blog.nanthrax.net
Talend - http://www.talend.com
--
Jean-Baptiste Onofré
[email protected]
http://blog.nanthrax.net
Talend - http://www.talend.com