[jira] [Commented] (NIFI-981) Add support for Hive JDBC / ExecuteSQL

ASF GitHub Bot (JIRA) Mon, 02 May 2016 19:17:02 -0700

    [ 
https://issues.apache.org/jira/browse/NIFI-981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15267940#comment-15267940
 ]


ASF GitHub Bot commented on NIFI-981:
-------------------------------------

Github user bbende commented on a diff in the pull request:

    https://github.com/apache/nifi/pull/384#discussion_r61832445
  
    --- Diff: 
nifi-nar-bundles/nifi-hive-bundle/nifi-hive-processors/src/main/java/org/apache/nifi/processors/hive/SelectHiveQL.java
 ---
    @@ -52,15 +54,27 @@
     @EventDriven
     @InputRequirement(Requirement.INPUT_ALLOWED)
     @Tags({"hive", "sql", "select", "jdbc", "query", "database"})
    -@CapabilityDescription("Execute provided HiveQL SELECT query against a 
Hive database connection. Query result will be converted to Avro format."
    +@CapabilityDescription("Execute provided HiveQL SELECT query against a 
Hive database connection. Query result will be converted to Avro or CSV format."
             + " Streaming is used so arbitrarily large result sets are 
supported. This processor can be scheduled to run on "
             + "a timer, or cron expression, using the standard scheduling 
methods, or it can be triggered by an incoming FlowFile. "
             + "If it is triggered by an incoming FlowFile, then attributes of 
that FlowFile will be available when evaluating the "
             + "select query. FlowFile attribute 'executehiveql.row.count' 
indicates how many rows were selected.")
    -public class ExecuteHiveQL extends AbstractHiveQLProcessor {
    +@WritesAttributes({
    +        @WritesAttribute(attribute = "mime.type", description = "Sets the 
MIME type for the outgoing flowfile to application/avro-binary for Avro or 
text/csv for CSV."),
    +        @WritesAttribute(attribute = "filename", description = "Adds .avro 
or .csv to the filename attribute depending on which output format is 
selected."),
    +        @WritesAttribute(attribute = "executehiveql.row.count", 
description = "Indicates how many rows were selected/returned by the query.")
    --- End diff --
    
    Nit-picking here, but given the rename of the processor, do we want this to 
be selecthiveql.row.count?


> Add support for Hive JDBC / ExecuteSQL
> --------------------------------------
>
>                 Key: NIFI-981
>                 URL: https://issues.apache.org/jira/browse/NIFI-981
>             Project: Apache NiFi
>          Issue Type: New Feature
>          Components: Extensions
>            Reporter: Joseph Witt
>            Assignee: Matt Burgess
>
> In this mailing list thread from September 2015 "NIFI DBCP connection pool 
> not working for hive" the main thrust of the converstation is to provide 
> proper support for delivering data to hive.  Hive's jdbc driver appears to 
> have dependencies on Hadoop libraries.  We need to be careful/thoughtful 
> about how to best support this so that different versions of Hadoop distros 
> can be supported (potentially in parallel on the same flow).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (NIFI-981) Add support for Hive JDBC / ExecuteSQL

Reply via email to