[ 
https://issues.apache.org/jira/browse/BEAM-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16848869#comment-16848869
 ] 

Ismaël Mejía commented on BEAM-1861:
------------------------------------

It seems that supporting this in Beam may require upstream changes in ORC to 
make the API more generic (right now it seems to be tightly coupled with Hadoop 
FileSystem). For more details see ORC-508.


As a workaround Beam users on Hadoop supported file systems can use 
`org.apache.orc.mapreduce.OrcInputFormat` and 
`org.apache.orc.mapreduce.OrcOutputFormat`via HadoopFormatIO.

> ORC support
> -----------
>
>                 Key: BEAM-1861
>                 URL: https://issues.apache.org/jira/browse/BEAM-1861
>             Project: Beam
>          Issue Type: Wish
>          Components: io-ideas
>            Reporter: Tibor Kiss
>            Priority: Major
>
> Add support for Optimized Row Columnar (ORC) file format: 
> https://orc.apache.org/
> As ORC does not have Python bindings (yet) this effort focuses on Java based 
> implementation



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to