ronyang1985 opened a new issue #8485:
URL: https://github.com/apache/dolphinscheduler/issues/8485


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and 
found no similar feature requirement.
   
   
   ### Description
   
   Add emr task plugin to support submitting tasks to AWS EMR cluster
   
   Supports declaring emr cluster configuration and tasks (steps) to be execute 
in json mode
   
   The json sample is as follows
   
   ```json
   {
     "name": "SparkPi",
     "releaseLabel": "emr-5.34.0",
     "applications": [{"name": "Spark"}],
     "instances": {
       "instanceGroups": [
         {
           "name": "Primary node",
           "instanceRole": "MASTER",
           "instanceType": "m4.xlarge",
           "instanceCount": 1
         }
       ],
       "keepJobFlowAliveWhenNoSteps": false,
       "terminationProtected": false
     },
     "steps": [
       {
         "name": "calculate_pi",
         "actionOnFailure": "CONTINUE",
         "hadoopJarStep": {
           "jar": "command-runner.jar",
           "args": ["/usr/lib/spark/bin/run-example", "SparkPi", "15"]
         }
       }
     ],
     "jobFlowRole": "EMR_EC2_DefaultRole",
     "serviceRole": "EMR_DefaultRole"
   }
   ```
   
   ### Use case
   
   _No response_
   
   ### Related issues
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to