[ 
https://issues.apache.org/jira/browse/NIFI-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15412826#comment-15412826
 ] 

ASF GitHub Bot commented on NIFI-1663:
--------------------------------------

GitHub user mattyb149 reopened a pull request:

    https://github.com/apache/nifi/pull/727

    NIFI-1663: Add ConvertAvroToORC processor

    This PR is based on #706 which removed the ConvertAvroToORC processor using 
Hive 2.x and Apache ORC 1.x. This PR replaces that processor with one that uses 
Hive 1.2.1 (which includes hive-orc before it was split into its own Apache 
project).

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mattyb149/nifi old_orc

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nifi/pull/727.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #727
    
----
commit c550fdb1f3a6a140647eb85de273e6d20fc9bb4b
Author: Matt Burgess <[email protected]>
Date:   2016-07-27T03:25:11Z

    NIFI-1663: Add ConvertAvroToORC processor

commit db7118780fe5cd16cfe919bfce171a453089ee35
Author: Matt Burgess <[email protected]>
Date:   2016-08-04T13:59:21Z

    NIFI-1663: Updated NifiOrcUtils with review comments

commit 242e4dcc367c9f1b9f072dda965058c1646f25bf
Author: Matt Burgess <[email protected]>
Date:   2016-08-05T21:13:00Z

    NIFI-1663: Added support to ConvertAvroToORC for nested records, added unit 
tests

----


> Add support for ORC format
> --------------------------
>
>                 Key: NIFI-1663
>                 URL: https://issues.apache.org/jira/browse/NIFI-1663
>             Project: Apache NiFi
>          Issue Type: New Feature
>            Reporter: Matt Burgess
>            Assignee: Matt Burgess
>             Fix For: 1.0.0
>
>
> From the Hive/ORC wiki 
> (https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC): 
> The Optimized Row Columnar (ORC) file format provides a highly efficient way 
> to store Hive data ... Using ORC files improves performance when Hive is 
> reading, writing, and processing data.
> As users are interested in NiFi integrations with Hive (NIFI-981, NIFI-1193, 
> etc.), NiFi should be able to support ORC file format to enable users to 
> efficiently store flow files for use by Hive.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to