[jira] [Commented] (NIFI-4946) nifi-spark-bundle : Adding support for pyfiles, file, jars options

ASF GitHub Bot (JIRA) Thu, 08 Mar 2018 09:16:49 -0800

    [ 
https://issues.apache.org/jira/browse/NIFI-4946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16391548#comment-16391548
 ]


ASF GitHub Bot commented on NIFI-4946:
--------------------------------------

Github user zenfenan commented on a diff in the pull request:

    https://github.com/apache/nifi/pull/2521#discussion_r173216342
  
    --- Diff: 
nifi-nar-bundles/nifi-spark-bundle/nifi-livy-processors/src/main/java/org/apache/nifi/processors/livy/ExecuteSparkInteractive.java
 ---
    @@ -83,6 +83,62 @@
                 .expressionLanguageSupported(true)
                 .build();
     
    +    public static final PropertyDescriptor IS_BATCH_JOB = new 
PropertyDescriptor.Builder()
    +            .name("exec-spark-iactive-is_batch_job")
    +            .displayName("Is Batch Job")
    +            .description("If true, the `Code` part is ignored and the flow 
file from previous stage is considered "
    +                    + "as a triggering event and not as code for Spark 
session. When `Wait` state is self routed"
    +                    + "the livy json response flow file from previous 
Spark job is used to poll the job status"
    +                    + "for sucess or failure")
    +            .required(true)
    +            .allowableValues("true", "false")
    +            .defaultValue("false")
    +            .build();
    +
    +    public static final PropertyDescriptor PY_FILES =  new 
PropertyDescriptor.Builder()
    +            .name("exec-spark-iactive-pyfiles")
    +            .displayName("pyFiles")
    +            .description("Python files to be used in this batch session 
that includes *.py, *.zip files")
    +            .required(false)
    +            .addValidator(StandardValidators.createURLorFileValidator())
    +            .expressionLanguageSupported(false)
    +            .build();
    +
    +    public static final PropertyDescriptor JAR_FILES =  new 
PropertyDescriptor.Builder()
    +            .name("exec-spark-iactive-jarfiles")
    +            .displayName("jars")
    +            .description("jars to be used in this batch session")
    +            .required(false)
    +            .addValidator(StandardValidators.NON_EMPTY_VALIDATOR)
    +            .expressionLanguageSupported(false)
    +            .build();
    +
    +    public static final PropertyDescriptor NAME =  new 
PropertyDescriptor.Builder()
    --- End diff --
    
    Is this supposed to be the Spark app name? Looks it is never used anywhere 
other than adding to the `PropertyDescriptor` list


> nifi-spark-bundle : Adding support for pyfiles, file, jars options
> ------------------------------------------------------------------
>
>                 Key: NIFI-4946
>                 URL: https://issues.apache.org/jira/browse/NIFI-4946
>             Project: Apache NiFi
>          Issue Type: New Feature
>          Components: Extensions
>    Affects Versions: 1.6.0
>         Environment: Ubuntu 16.04, IntelliJ
>            Reporter: Mageswaran
>            Priority: Major
>             Fix For: 1.6.0
>
>         Attachments: nifi-spark-options.png, nifi-spark.png
>
>
> Adding support for submitting PySpark based Sparks jobs (which is normally 
> structured as modules) over Livy on existing "ExecuteSparkInteractive" 
> processor.
> This is done by reading file paths for pyfiles and file and an option from 
> user whether the processor should trigger a batch job or not.
> [https://livy.incubator.apache.org/docs/latest/rest-api.html]
>  *Current Work flow Logic ( [https://github.com/apache/nifi/pull/2521 
> )|https://github.com/apache/nifi/pull/2521]*
>  * Check whether the processor has to handle code or submit a Spark job
>  * Read incoming flow file
>  ** If batch == true
>  *** If flow file matches Livy `batches` JSON response through `wait` loop
>  **** Wait for Status Check Interval
>  **** Read the state
>  **** If state is `running` route it to `wait` or if it  is `success` or 
> `dead` route it accordingly
>  *** Else
>  **** Ignore the flow file
>  **** Trigger the Spark job over Livy `batches` endpoint
>  **** Read the state of the submitted job
>  **** If state is `running` route it to `wait` or if it  is `success` or 
> `dead` route it accordingly
>  ** Else:
>  *** Existing Logic to handle `Code`
>  
> !nifi-spark-options.png!
> !nifi-spark.png!
>  
> Thanks.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (NIFI-4946) nifi-spark-bundle : Adding support for pyfiles, file, jars options

Reply via email to