[jira] [Commented] (AIRFLOW-2524) Airflow integration with AWS Sagemaker

2018-08-18 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-2524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16585028#comment-16585028
 ] 

ASF GitHub Bot commented on AIRFLOW-2524:
-

troychen728 opened a new pull request #3767: [AIRFLOW-2524]Add SageMaker Batch 
Inference
URL: https://github.com/apache/incubator-airflow/pull/3767
 
 
   Make sure you have checked _all_ steps below.
   
   ### JIRA
   - [X] My PR addresses the following [Airflow 
JIRA](https://issues.apache.org/jira/browse/AIRFLOW/) issues and references 
them in the PR title. 
   - https://issues.apache.org/jira/browse/AIRFLOW-2524
   
   ### Description
   - [X] Here are some details about my PR, including screenshots of any UI 
changes:
   - This PR allows user to start a Amazon SageMaker Transform job (Batch 
Inference) using the SageMakerCreateTransformJobOperator
   - User can also check the progress(state) of the transform job through 
the SageMakerTranformSensor
   
   
   ### Tests
   - [X] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
- tests/contrib/hooks/test_sagemaker_hook.py
- 
tests/contrib/operators/test_sagemaker_create_transform_job_operator.py
- tests/contrib/sensors/test_sagemaker_transform_sensor.py
 
   
   
   
   ### Commits
   - [X] My commits all reference JIRA issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
   1. Subject is separated from body by a blank line
   2. Subject is limited to 50 characters
   3. Subject does not end with a period
   4. Subject uses the imperative mood ("add", not "adding")
   5. Body wraps at 72 characters
   6. Body explains "what" and "why", not "how"
   
   
   ### Documentation
   - [X] In case of new functionality, my PR adds documentation that describes 
how to use it.
   - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
   
   
   ### Code Quality
   - [X] Passes `git diff upstream/master -u -- "*.py" | flake8 --diff`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Airflow integration with AWS Sagemaker
> --
>
> Key: AIRFLOW-2524
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2524
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: aws, contrib
>Reporter: Rajeev Srinivasan
>Assignee: Yang Yu
>Priority: Major
>  Labels: AWS
> Fix For: 2.0.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Would it be possible to orchestrate an end to end  AWS  Sagemaker job using 
> Airflow.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] troychen728 opened a new pull request #3767: [AIRFLOW-2524]Add SageMaker Batch Inference

2018-08-18 Thread GitBox
troychen728 opened a new pull request #3767: [AIRFLOW-2524]Add SageMaker Batch 
Inference
URL: https://github.com/apache/incubator-airflow/pull/3767
 
 
   Make sure you have checked _all_ steps below.
   
   ### JIRA
   - [X] My PR addresses the following [Airflow 
JIRA](https://issues.apache.org/jira/browse/AIRFLOW/) issues and references 
them in the PR title. 
   - https://issues.apache.org/jira/browse/AIRFLOW-2524
   
   ### Description
   - [X] Here are some details about my PR, including screenshots of any UI 
changes:
   - This PR allows user to start a Amazon SageMaker Transform job (Batch 
Inference) using the SageMakerCreateTransformJobOperator
   - User can also check the progress(state) of the transform job through 
the SageMakerTranformSensor
   
   
   ### Tests
   - [X] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
- tests/contrib/hooks/test_sagemaker_hook.py
- 
tests/contrib/operators/test_sagemaker_create_transform_job_operator.py
- tests/contrib/sensors/test_sagemaker_transform_sensor.py
 
   
   
   
   ### Commits
   - [X] My commits all reference JIRA issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
   1. Subject is separated from body by a blank line
   2. Subject is limited to 50 characters
   3. Subject does not end with a period
   4. Subject uses the imperative mood ("add", not "adding")
   5. Body wraps at 72 characters
   6. Body explains "what" and "why", not "how"
   
   
   ### Documentation
   - [X] In case of new functionality, my PR adds documentation that describes 
how to use it.
   - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
   
   
   ### Code Quality
   - [X] Passes `git diff upstream/master -u -- "*.py" | flake8 --diff`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] feng-tao commented on issue #3763: [AIRFLOW-2915] Add example DAG for GoogleCloudStorageToBigQueryOperator

2018-08-18 Thread GitBox
feng-tao commented on issue #3763: [AIRFLOW-2915] Add example DAG for 
GoogleCloudStorageToBigQueryOperator
URL: 
https://github.com/apache/incubator-airflow/pull/3763#issuecomment-414099828
 
 
   If I don't have gcp dependency, what happens when I set load_example_dag to 
true?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] dalejung commented on issue #3723: [AIRFLOW-2876] Update Tenacity to 4.12

2018-08-18 Thread GitBox
dalejung commented on issue #3723: [AIRFLOW-2876] Update Tenacity to 4.12
URL: 
https://github.com/apache/incubator-airflow/pull/3723#issuecomment-414085029
 
 
   @r39132 is the futures backport installed in your 3.6 install?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] codecov-io commented on issue #3766: [AIRFLOW-2917] Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed

2018-08-18 Thread GitBox
codecov-io commented on issue #3766: [AIRFLOW-2917] Set 
AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed
URL: 
https://github.com/apache/incubator-airflow/pull/3766#issuecomment-414063517
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3766?src=pr=h1)
 Report
   > Merging 
[#3766](https://codecov.io/gh/apache/incubator-airflow/pull/3766?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-airflow/commit/738d27edfeddb0dc8712cf2e2db72dedb4354821?src=pr=desc)
 will **not change** coverage.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-airflow/pull/3766/graphs/tree.svg?token=WdLKlKHOAU=pr=150=650)](https://codecov.io/gh/apache/incubator-airflow/pull/3766?src=pr=tree)
   
   ```diff
   @@   Coverage Diff   @@
   ##   master#3766   +/-   ##
   ===
 Coverage   77.67%   77.67%   
   ===
 Files 204  204   
 Lines   1584915849   
   ===
 Hits1231012310   
 Misses   3539 3539
   ```
   
   
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3766?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3766?src=pr=footer).
 Last update 
[738d27e...4446886](https://codecov.io/gh/apache/incubator-airflow/pull/3766?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (AIRFLOW-2917) Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed for k8s executor

2018-08-18 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-2917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584783#comment-16584783
 ] 

ASF GitHub Bot commented on AIRFLOW-2917:
-

ckljohn opened a new pull request #3766: [AIRFLOW-2917] Set 
AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed
URL: https://github.com/apache/incubator-airflow/pull/3766
 
 
   Make sure you have checked _all_ steps below.
   
   ### Jira
   
   - [x] My PR addresses the following [Airflow 
Jira](https://issues.apache.org/jira/browse/AIRFLOW/) issues and references 
them in the PR title. For example, "\[AIRFLOW-XXX\] My Airflow PR"
 - https://issues.apache.org/jira/browse/AIRFLOW-2917
 - In case you are fixing a typo in the documentation you can prepend your 
commit with \[AIRFLOW-XXX\], code changes always need a Jira issue.
   
   ### Description
   
   - [x] Here are some details about my PR, including screenshots of any UI 
changes:
   
   In Kubernetes executor, `AIRFLOW_CORE_SQL_ALCHEMY_CONN` is set as an 
environment variable even when it is specified in configmap or secrets.
   
   Only when `airflow_configmap` is not provided and 
`AIRFLOW__CORE__SQL_ALCHEMY_CONN` not in secrets, it is set as an env var.
   
   
   ### Tests
   
   - [ ] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   
   ### Commits
   
   - [x] My commits all reference Jira issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
 1. Subject is separated from body by a blank line
 1. Subject is limited to 50 characters (not including Jira issue reference)
 1. Subject does not end with a period
 1. Subject uses the imperative mood ("add", not "adding")
 1. Body wraps at 72 characters
 1. Body explains "what" and "why", not "how"
   
   ### Documentation
   
   - [x] In case of new functionality, my PR adds documentation that describes 
how to use it.
 - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
   
   ### Code Quality
   
   - [x] Passes `git diff upstream/master -u -- "*.py" | flake8 --diff`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed for k8s executor
> -
>
> Key: AIRFLOW-2917
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2917
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: executor
>Affects Versions: 1.10.0
>Reporter: John Cheng
>Priority: Minor
>
> In Kubernetes executor, `AIRFLOW__CORE__SQL_ALCHEMY_CONN` is set as an 
> environment variable even when it is specified in configmap or secrets.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] ckljohn opened a new pull request #3766: [AIRFLOW-2917] Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed

2018-08-18 Thread GitBox
ckljohn opened a new pull request #3766: [AIRFLOW-2917] Set 
AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed
URL: https://github.com/apache/incubator-airflow/pull/3766
 
 
   Make sure you have checked _all_ steps below.
   
   ### Jira
   
   - [x] My PR addresses the following [Airflow 
Jira](https://issues.apache.org/jira/browse/AIRFLOW/) issues and references 
them in the PR title. For example, "\[AIRFLOW-XXX\] My Airflow PR"
 - https://issues.apache.org/jira/browse/AIRFLOW-2917
 - In case you are fixing a typo in the documentation you can prepend your 
commit with \[AIRFLOW-XXX\], code changes always need a Jira issue.
   
   ### Description
   
   - [x] Here are some details about my PR, including screenshots of any UI 
changes:
   
   In Kubernetes executor, `AIRFLOW_CORE_SQL_ALCHEMY_CONN` is set as an 
environment variable even when it is specified in configmap or secrets.
   
   Only when `airflow_configmap` is not provided and 
`AIRFLOW__CORE__SQL_ALCHEMY_CONN` not in secrets, it is set as an env var.
   
   
   ### Tests
   
   - [ ] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   
   ### Commits
   
   - [x] My commits all reference Jira issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
 1. Subject is separated from body by a blank line
 1. Subject is limited to 50 characters (not including Jira issue reference)
 1. Subject does not end with a period
 1. Subject uses the imperative mood ("add", not "adding")
 1. Body wraps at 72 characters
 1. Body explains "what" and "why", not "how"
   
   ### Documentation
   
   - [x] In case of new functionality, my PR adds documentation that describes 
how to use it.
 - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
   
   ### Code Quality
   
   - [x] Passes `git diff upstream/master -u -- "*.py" | flake8 --diff`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Updated] (AIRFLOW-2917) Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed for k8s executor

2018-08-18 Thread John Cheng (JIRA)


 [ 
https://issues.apache.org/jira/browse/AIRFLOW-2917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Cheng updated AIRFLOW-2917:

Description: In Kubernetes executor, `AIRFLOW__CORE__SQL_ALCHEMY_CONN` is 
set as an environment variable even when it is specified in configmap or 
secrets.  (was: In Kubernetes executor, AIRFLOW__CORE__SQL_ALCHEMY_CONN is set 
as an environment variable even when it is specified in configmap or secrets.)

> Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed for k8s executor
> -
>
> Key: AIRFLOW-2917
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2917
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: executor
>Affects Versions: 1.10.0
>Reporter: John Cheng
>Priority: Minor
>
> In Kubernetes executor, `AIRFLOW__CORE__SQL_ALCHEMY_CONN` is set as an 
> environment variable even when it is specified in configmap or secrets.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (AIRFLOW-2917) Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed for k8s executor

2018-08-18 Thread John Cheng (JIRA)
John Cheng created AIRFLOW-2917:
---

 Summary: Set AIRFLOW__CORE__SQL_ALCHEMY_CONN only when needed for 
k8s executor
 Key: AIRFLOW-2917
 URL: https://issues.apache.org/jira/browse/AIRFLOW-2917
 Project: Apache Airflow
  Issue Type: Improvement
  Components: executor
Affects Versions: 1.10.0
Reporter: John Cheng


In Kubernetes executor, AIRFLOW__CORE__SQL_ALCHEMY_CONN is set as an 
environment variable even when it is specified in configmap or secrets.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] codecov-io commented on issue #3765: [AIRFLOW-2245] Add remote_host of SSH/SFTP operator as templated field

2018-08-18 Thread GitBox
codecov-io commented on issue #3765: [AIRFLOW-2245] Add remote_host of SSH/SFTP 
operator as templated field
URL: 
https://github.com/apache/incubator-airflow/pull/3765#issuecomment-414056215
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=h1)
 Report
   > Merging 
[#3765](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-airflow/commit/738d27edfeddb0dc8712cf2e2db72dedb4354821?src=pr=desc)
 will **not change** coverage.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-airflow/pull/3765/graphs/tree.svg?width=650=150=pr=WdLKlKHOAU)](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=tree)
   
   ```diff
   @@   Coverage Diff   @@
   ##   master#3765   +/-   ##
   ===
 Coverage   77.67%   77.67%   
   ===
 Files 204  204   
 Lines   1584915849   
   ===
 Hits1231012310   
 Misses   3539 3539
   ```
   
   
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=footer).
 Last update 
[738d27e...1d4f374](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[GitHub] codecov-io edited a comment on issue #3765: [AIRFLOW-2245] Add remote_host of SSH/SFTP operator as templated field

2018-08-18 Thread GitBox
codecov-io edited a comment on issue #3765: [AIRFLOW-2245] Add remote_host of 
SSH/SFTP operator as templated field
URL: 
https://github.com/apache/incubator-airflow/pull/3765#issuecomment-414056215
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=h1)
 Report
   > Merging 
[#3765](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-airflow/commit/738d27edfeddb0dc8712cf2e2db72dedb4354821?src=pr=desc)
 will **not change** coverage.
   > The diff coverage is `n/a`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-airflow/pull/3765/graphs/tree.svg?width=650=WdLKlKHOAU=150=pr)](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=tree)
   
   ```diff
   @@   Coverage Diff   @@
   ##   master#3765   +/-   ##
   ===
 Coverage   77.67%   77.67%   
   ===
 Files 204  204   
 Lines   1584915849   
   ===
 Hits1231012310   
 Misses   3539 3539
   ```
   
   
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=footer).
 Last update 
[738d27e...1d4f374](https://codecov.io/gh/apache/incubator-airflow/pull/3765?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (AIRFLOW-2245) Add remote_host of SSH/SFTP operator as templated field

2018-08-18 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584754#comment-16584754
 ] 

ASF GitHub Bot commented on AIRFLOW-2245:
-

ckljohn opened a new pull request #3765: [AIRFLOW-2245] Add remote_host of 
SSH/SFTP operator as templated field
URL: https://github.com/apache/incubator-airflow/pull/3765
 
 
   Make sure you have checked _all_ steps below.
   
   ### Jira
   
   - [x] My PR addresses the following [Airflow 
Jira](https://issues.apache.org/jira/browse/AIRFLOW/) issues and references 
them in the PR title. For example, "\[AIRFLOW-XXX\] My Airflow PR"
 - https://issues.apache.org/jira/browse/AIRFLOW-2245
 - In case you are fixing a typo in the documentation you can prepend your 
commit with \[AIRFLOW-XXX\], code changes always need a Jira issue.
   
   ### Description
   
   - [x] Here are some details about my PR, including screenshots of any UI 
changes:
   It added `remote_host` to the templated fields .`remote_host` can be 
obtained at the beginning of the DAG and pass to SSH/SFTP operator with xcom.
   
   ### Tests
   
   - [x] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   My PR does not add tests because this is a small change, and there are no 
tests currently in-place to test the templated fields.
   
   ### Commits
   
   - [x] My commits all reference Jira issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
 1. Subject is separated from body by a blank line
 1. Subject is limited to 50 characters (not including Jira issue reference)
 1. Subject does not end with a period
 1. Subject uses the imperative mood ("add", not "adding")
 1. Body wraps at 72 characters
 1. Body explains "what" and "why", not "how"
   
   ### Documentation
   
   - [ ] In case of new functionality, my PR adds documentation that describes 
how to use it.
 - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
   
   ### Code Quality
   
   - [x] Passes `git diff upstream/master -u -- "*.py" | flake8 --diff`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add remote_host of SSH/SFTP operator as templated field
> ---
>
> Key: AIRFLOW-2245
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2245
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: operators
>Reporter: John Cheng
>Priority: Trivial
>
> The remote host can be obtained at the beginning of the DAG and pass to 
> SSH/SFTP operator with xcom.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (AIRFLOW-2245) Add remote_host of SSH/SFTP operator as templated field

2018-08-18 Thread John Cheng (JIRA)


 [ 
https://issues.apache.org/jira/browse/AIRFLOW-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Cheng updated AIRFLOW-2245:

Issue Type: Improvement  (was: Wish)

> Add remote_host of SSH/SFTP operator as templated field
> ---
>
> Key: AIRFLOW-2245
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2245
> Project: Apache Airflow
>  Issue Type: Improvement
>Reporter: John Cheng
>Priority: Trivial
>
> The remote host can be obtained at the beginning of the DAG and pass to 
> SSH/SFTP operator with xcom.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (AIRFLOW-2245) Add remote_host of SSH/SFTP operator as templated field

2018-08-18 Thread John Cheng (JIRA)


 [ 
https://issues.apache.org/jira/browse/AIRFLOW-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Cheng updated AIRFLOW-2245:

Priority: Trivial  (was: Minor)

> Add remote_host of SSH/SFTP operator as templated field
> ---
>
> Key: AIRFLOW-2245
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2245
> Project: Apache Airflow
>  Issue Type: Wish
>Reporter: John Cheng
>Priority: Trivial
>
> The remote host can be obtained at the beginning of the DAG and pass to 
> SSH/SFTP operator with xcom.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (AIRFLOW-2245) Add remote_host of SSH/SFTP operator as templated field

2018-08-18 Thread John Cheng (JIRA)


 [ 
https://issues.apache.org/jira/browse/AIRFLOW-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Cheng updated AIRFLOW-2245:

Description: The remote host can be obtained at the beginning of the DAG 
and pass to SSH/SFTP operator with xcom.  (was: Add Jinja support for 
remote_host of SSH operator and hook)
Summary: Add remote_host of SSH/SFTP operator as templated field  (was: 
Add Jinja support for remote_host of SSH operator and hook)

> Add remote_host of SSH/SFTP operator as templated field
> ---
>
> Key: AIRFLOW-2245
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2245
> Project: Apache Airflow
>  Issue Type: Wish
>Reporter: John Cheng
>Priority: Minor
>
> The remote host can be obtained at the beginning of the DAG and pass to 
> SSH/SFTP operator with xcom.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] codecov-io commented on issue #3764: [AIRFLOW-2916] Arg `verify` for AwsHook() & S3 sensors/operators

2018-08-18 Thread GitBox
codecov-io commented on issue #3764: [AIRFLOW-2916] Arg `verify` for AwsHook() 
& S3 sensors/operators
URL: 
https://github.com/apache/incubator-airflow/pull/3764#issuecomment-414046987
 
 
   # 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3764?src=pr=h1)
 Report
   > Merging 
[#3764](https://codecov.io/gh/apache/incubator-airflow/pull/3764?src=pr=desc)
 into 
[master](https://codecov.io/gh/apache/incubator-airflow/commit/738d27edfeddb0dc8712cf2e2db72dedb4354821?src=pr=desc)
 will **decrease** coverage by `<.01%`.
   > The diff coverage is `71.42%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/incubator-airflow/pull/3764/graphs/tree.svg?src=pr=650=WdLKlKHOAU=150)](https://codecov.io/gh/apache/incubator-airflow/pull/3764?src=pr=tree)
   
   ```diff
   @@Coverage Diff @@
   ##   master#3764  +/-   ##
   ==
   - Coverage   77.67%   77.66%   -0.01% 
   ==
 Files 204  204  
 Lines   1584915856   +7 
   ==
   + Hits1231012315   +5 
   - Misses   3539 3541   +2
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/incubator-airflow/pull/3764?src=pr=tree) 
| Coverage Δ | |
   |---|---|---|
   | 
[airflow/sensors/s3\_prefix\_sensor.py](https://codecov.io/gh/apache/incubator-airflow/pull/3764/diff?src=pr=tree#diff-YWlyZmxvdy9zZW5zb3JzL3MzX3ByZWZpeF9zZW5zb3IucHk=)
 | `38.88% <0%> (-2.29%)` | :arrow_down: |
   | 
[airflow/sensors/s3\_key\_sensor.py](https://codecov.io/gh/apache/incubator-airflow/pull/3764/diff?src=pr=tree#diff-YWlyZmxvdy9zZW5zb3JzL3MzX2tleV9zZW5zb3IucHk=)
 | `30.3% <0%> (-0.95%)` | :arrow_down: |
   | 
[airflow/operators/redshift\_to\_s3\_operator.py](https://codecov.io/gh/apache/incubator-airflow/pull/3764/diff?src=pr=tree#diff-YWlyZmxvdy9vcGVyYXRvcnMvcmVkc2hpZnRfdG9fczNfb3BlcmF0b3IucHk=)
 | `95.55% <100%> (+0.1%)` | :arrow_up: |
   | 
[airflow/operators/s3\_to\_hive\_operator.py](https://codecov.io/gh/apache/incubator-airflow/pull/3764/diff?src=pr=tree#diff-YWlyZmxvdy9vcGVyYXRvcnMvczNfdG9faGl2ZV9vcGVyYXRvci5weQ==)
 | `94.01% <100%> (+0.05%)` | :arrow_up: |
   | 
[airflow/operators/s3\_file\_transform\_operator.py](https://codecov.io/gh/apache/incubator-airflow/pull/3764/diff?src=pr=tree#diff-YWlyZmxvdy9vcGVyYXRvcnMvczNfZmlsZV90cmFuc2Zvcm1fb3BlcmF0b3IucHk=)
 | `93.87% <100%> (+0.26%)` | :arrow_up: |
   | 
[airflow/operators/s3\_to\_redshift\_operator.py](https://codecov.io/gh/apache/incubator-airflow/pull/3764/diff?src=pr=tree#diff-YWlyZmxvdy9vcGVyYXRvcnMvczNfdG9fcmVkc2hpZnRfb3BlcmF0b3IucHk=)
 | `100% <100%> (ø)` | :arrow_up: |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3764?src=pr=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/incubator-airflow/pull/3764?src=pr=footer).
 Last update 
[738d27e...142e1ff](https://codecov.io/gh/apache/incubator-airflow/pull/3764?src=pr=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Commented] (AIRFLOW-2916) Add argument `verify` for AwsHook() and S3 related sensors/operators

2018-08-18 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/AIRFLOW-2916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16584722#comment-16584722
 ] 

ASF GitHub Bot commented on AIRFLOW-2916:
-

XD-DENG opened a new pull request #3764: [AIRFLOW-2916] Arg `verify` for 
AwsHook() & S3 sensors/operators
URL: https://github.com/apache/incubator-airflow/pull/3764
 
 
   Make sure you have checked _all_ steps below.
   
   ### Jira
   
   - [x] My PR addresses the following [Airflow 
Jira](https://issues.apache.org/jira/browse/AIRFLOW/) issues and references 
them in the PR title. For example, "\[AIRFLOW-XXX\] My Airflow PR"
 - https://issues.apache.org/jira/browse/AIRFLOW-2916
 - In case you are fixing a typo in the documentation you can prepend your 
commit with \[AIRFLOW-XXX\], code changes always need a Jira issue.
   
   ### Description
   
   - [x] Here are some details about my PR, including screenshots of any UI 
changes:
   
   ### Background
   The `AwsHook()` and `S3`-related operators/sensors are depending on package 
`boto3`.
   
   In `boto3`, when we initiate a `client` or a `resource`, argument `verify` 
is provided (https://boto3.readthedocs.io/en/latest/reference/core/session.html 
).
   
   ### Why do we need this
   It is useful when
   - users want to use a different CA cert bundle than the one used by 
`botocore`.
   - users want to have **'--no-verify-ssl'**. This is especially useful when 
we're using on-premises S3 or other implementations of object storage, like 
IBM's Cloud Object Storage.
   
   ### Backward Compatibility
   The default value here is always `None`, which is also the default value in 
`boto3`, so that backward compatibility is ensured too.
   
   ### Tests
   
   - [ ] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   
   ### Commits
   
   - [ ] My commits all reference Jira issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
 1. Subject is separated from body by a blank line
 1. Subject is limited to 50 characters (not including Jira issue reference)
 1. Subject does not end with a period
 1. Subject uses the imperative mood ("add", not "adding")
 1. Body wraps at 72 characters
 1. Body explains "what" and "why", not "how"
   
   ### Documentation
   
   - [ ] In case of new functionality, my PR adds documentation that describes 
how to use it.
 - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
   
   ### Code Quality
   
   - [x] Passes `git diff upstream/master -u -- "*.py" | flake8 --diff`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Add argument `verify` for AwsHook() and S3 related sensors/operators
> 
>
> Key: AIRFLOW-2916
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2916
> Project: Apache Airflow
>  Issue Type: Improvement
>  Components: hooks, operators
>Reporter: Xiaodong DENG
>Assignee: Xiaodong DENG
>Priority: Minor
>
> The AwsHook() and S3-related operators/sensors are depending on package boto3.
> In boto3, when we initiate a client or a resource, argument `verify` is 
> provided (https://boto3.readthedocs.io/en/latest/reference/core/session.html 
> ).
> It is useful when
>  # users want to use a different CA cert bundle than the one used by botocore.
>  # users want to have '--no-verify-ssl'. This is especially useful when we're 
> using on-premises S3 or other implementations of object storage, like IBM's 
> Cloud Object Storage.
> However, this feature is not provided in Airflow for S3 yet.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] XD-DENG opened a new pull request #3764: [AIRFLOW-2916] Arg `verify` for AwsHook() & S3 sensors/operators

2018-08-18 Thread GitBox
XD-DENG opened a new pull request #3764: [AIRFLOW-2916] Arg `verify` for 
AwsHook() & S3 sensors/operators
URL: https://github.com/apache/incubator-airflow/pull/3764
 
 
   Make sure you have checked _all_ steps below.
   
   ### Jira
   
   - [x] My PR addresses the following [Airflow 
Jira](https://issues.apache.org/jira/browse/AIRFLOW/) issues and references 
them in the PR title. For example, "\[AIRFLOW-XXX\] My Airflow PR"
 - https://issues.apache.org/jira/browse/AIRFLOW-2916
 - In case you are fixing a typo in the documentation you can prepend your 
commit with \[AIRFLOW-XXX\], code changes always need a Jira issue.
   
   ### Description
   
   - [x] Here are some details about my PR, including screenshots of any UI 
changes:
   
   ### Background
   The `AwsHook()` and `S3`-related operators/sensors are depending on package 
`boto3`.
   
   In `boto3`, when we initiate a `client` or a `resource`, argument `verify` 
is provided (https://boto3.readthedocs.io/en/latest/reference/core/session.html 
).
   
   ### Why do we need this
   It is useful when
   - users want to use a different CA cert bundle than the one used by 
`botocore`.
   - users want to have **'--no-verify-ssl'**. This is especially useful when 
we're using on-premises S3 or other implementations of object storage, like 
IBM's Cloud Object Storage.
   
   ### Backward Compatibility
   The default value here is always `None`, which is also the default value in 
`boto3`, so that backward compatibility is ensured too.
   
   ### Tests
   
   - [ ] My PR adds the following unit tests __OR__ does not need testing for 
this extremely good reason:
   
   ### Commits
   
   - [ ] My commits all reference Jira issues in their subject lines, and I 
have squashed multiple commits if they address the same issue. In addition, my 
commits follow the guidelines from "[How to write a good git commit 
message](http://chris.beams.io/posts/git-commit/)":
 1. Subject is separated from body by a blank line
 1. Subject is limited to 50 characters (not including Jira issue reference)
 1. Subject does not end with a period
 1. Subject uses the imperative mood ("add", not "adding")
 1. Body wraps at 72 characters
 1. Body explains "what" and "why", not "how"
   
   ### Documentation
   
   - [ ] In case of new functionality, my PR adds documentation that describes 
how to use it.
 - When adding new operators/hooks/sensors, the autoclass documentation 
generation needs to be added.
   
   ### Code Quality
   
   - [x] Passes `git diff upstream/master -u -- "*.py" | flake8 --diff`
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services


[jira] [Created] (AIRFLOW-2916) Add argument `verify` for AwsHook() and S3 related sensors/operators

2018-08-18 Thread Xiaodong DENG (JIRA)
Xiaodong DENG created AIRFLOW-2916:
--

 Summary: Add argument `verify` for AwsHook() and S3 related 
sensors/operators
 Key: AIRFLOW-2916
 URL: https://issues.apache.org/jira/browse/AIRFLOW-2916
 Project: Apache Airflow
  Issue Type: Improvement
  Components: hooks, operators
Reporter: Xiaodong DENG
Assignee: Xiaodong DENG


The AwsHook() and S3-related operators/sensors are depending on package boto3.

In boto3, when we initiate a client or a resource, argument `verify` is 
provided (https://boto3.readthedocs.io/en/latest/reference/core/session.html ).

It is useful when
 # users want to use a different CA cert bundle than the one used by botocore.
 # users want to have '--no-verify-ssl'. This is especially useful when we're 
using on-premises S3 or other implementations of object storage, like IBM's 
Cloud Object Storage.

However, this feature is not provided in Airflow for S3 yet.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)