Hi Rishi,
Problem is that it is not possible to use AbstractFileOutputOperator to
write to a columnar storage data formats such as Parquet or ORC.
AbstractFileOutputOperator assumes row data formats. AFAIK, Malhar does
not have output operators that support columnar storage, so it will be
necessary to create a new output operators to write to ORC files.
Thank you,
Vlad
On 4/27/17 00:05, rishi wrote:
Vlad,
Thanks for the reply!
I have code which takes the input tuple and write to HDFS in ORC format. Now
my challenge is to incorporate same code in one of the operator which
extends AbstractFileOutputOperator.
I am attaching the code which is writing the ORC file and the operator in
which I am trying to incorporate the code.
Thanks ORC_Query_Apex.txt
<http://apache-apex-users-list.78494.x6.nabble.com/file/n1558/ORC_Query_Apex.txt>
Rishi
--
View this message in context:
http://apache-apex-users-list.78494.x6.nabble.com/How-to-write-data-in-ORC-format-to-hdfs-instead-of-text-format-tp1539p1558.html
Sent from the Apache Apex Users list mailing list archive at Nabble.com.