Hi Rishi,

Problem is that it is not possible to use AbstractFileOutputOperator to write to a columnar storage data formats such as Parquet or ORC. AbstractFileOutputOperator assumes row data formats. AFAIK, Malhar does not have output operators that support columnar storage, so it will be necessary to create a new output operators to write to ORC files.

Thank you,

Vlad

On 4/27/17 00:05, rishi wrote:
Vlad,

Thanks for the reply!

I have code which takes the input tuple and write to HDFS in ORC format. Now
my challenge is to incorporate same code in one of the operator which
extends AbstractFileOutputOperator.

I am attaching the code which is writing the ORC file and the operator in
which I am trying to incorporate the code.

Thanks ORC_Query_Apex.txt
<http://apache-apex-users-list.78494.x6.nabble.com/file/n1558/ORC_Query_Apex.txt>
Rishi



--
View this message in context: 
http://apache-apex-users-list.78494.x6.nabble.com/How-to-write-data-in-ORC-format-to-hdfs-instead-of-text-format-tp1539p1558.html
Sent from the Apache Apex Users list mailing list archive at Nabble.com.

Reply via email to