[ 
https://issues.apache.org/jira/browse/BEAM-3072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16803038#comment-16803038
 ] 

niklas Hansson commented on BEAM-3072:
--------------------------------------

 I have requested to become a contributor and then plan to pick this task up. 
My initial plan is to replace the subprocess.check_call() call  in process.py 
to instead use  subprocess.check_output() and capture the error message from 
the subprocess and incorporate it in the error. Super happy for any feedback :) 

> Improve error handling at staging time time for DataflowRunner
> --------------------------------------------------------------
>
>                 Key: BEAM-3072
>                 URL: https://issues.apache.org/jira/browse/BEAM-3072
>             Project: Beam
>          Issue Type: Bug
>          Components: sdk-py-core
>            Reporter: Ahmet Altay
>            Priority: Minor
>              Labels: starter, triaged
>
> dependency.py calls out to external process to collect dependencies:
> https://github.com/apache/beam/blob/de7cc05cc67d1aa6331cddc17c2e02ed0efbe37d/sdks/python/apache_beam/runners/dataflow/internal/dependency.py#L263
> If these calls fails, the error is not clear. The error only tells what 
> failed but does not show the actual error message, and is not helpful for 
> users.
> As a general fix processes.py should have general better output collection 
> from failed processes.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to