[ https://issues.apache.org/jira/browse/AIRFLOW-1756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16219459#comment-16219459 ]
Colin Son commented on AIRFLOW-1756: ------------------------------------ [~ashb] And another issue that I wanted to point out is that the unit tests for the S3TaskHandler are broken. You can find them in here: incubator-airflow/tests/utils/log/test_logging.py This particular test is not part of a python package, so the tests don't run. And the tests fail because S3TaskHandler object is not initialized correctly (since it needs the correct args for the __init__). And the method names that it is trying to test are all spelled incorrectly. So the unit tests should be revisited as well. Thanks! > S3 Task Handler Cannot Read Logs With New S3Hook > ------------------------------------------------ > > Key: AIRFLOW-1756 > URL: https://issues.apache.org/jira/browse/AIRFLOW-1756 > Project: Apache Airflow > Issue Type: Bug > Affects Versions: 1.9.0 > Reporter: Colin Son > Fix For: 1.9.0 > > > With the changes to the S3Hook, it seems like it cannot read the S3 task logs. > In the `s3_read` in the S3TaskHandler.py: > {code} > s3_key = self.hook.get_key(remote_log_location) > if s3_key: > return s3_key.get_contents_as_string().decode() > {code} > Since the s3_key object is now a dict, you cannot call > `get_contents_as_string()` on a dict object. You have to use the S3Hook's > `read_key()` method to read the contents of the task logs now. -- This message was sent by Atlassian JIRA (v6.4.14#64029)