[ https://issues.apache.org/jira/browse/BEAM-7934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Hutchison updated BEAM-7934: ---------------------------------- Description: Using the dataflow runner, log messages always show up in stackdriver with the step_id as the empty string, so filtering log messages for a step doesn't work. {code:java} resource: { labels: { job_id: "<job id>" job_name: "<job name>" project_id: "<project id>" region: "<region>" step_id: "" } type: "dataflow_step" }{code} Another user seems to have posted in the old github repo and appears to be seeing the same problem based on their output: [https://github.com/GoogleCloudPlatform/DataflowPythonSDK/issues/62] >From what I can tell is only affecting streaming pipelines was: Using the dataflow runner, log messages always show up in stackdriver with the step_id as the empty string, so filtering log messages for a step doesn't work. {code:java} resource: { labels: { job_id: "<job id>" job_name: "<job name>" project_id: "<project id>" region: "<region>" step_id: "" } type: "dataflow_step" }{code} Another user seems to have posted in the old github repo and appears to be seeing the same problem based on their output: [https://github.com/GoogleCloudPlatform/DataflowPythonSDK/issues/62] > Dataflow Python SDK logging: step_id is always empty string > ----------------------------------------------------------- > > Key: BEAM-7934 > URL: https://issues.apache.org/jira/browse/BEAM-7934 > Project: Beam > Issue Type: Bug > Components: runner-dataflow, sdk-py-core > Affects Versions: 2.13.0 > Reporter: James Hutchison > Assignee: Ahmet Altay > Priority: Major > > Using the dataflow runner, log messages always show up in stackdriver with > the step_id as the empty string, so filtering log messages for a step doesn't > work. > {code:java} > resource: { > labels: { > job_id: "<job id>" > job_name: "<job name>" > project_id: "<project id>" > region: "<region>" > step_id: "" > } > type: "dataflow_step" > }{code} > Another user seems to have posted in the old github repo and appears to be > seeing the same problem based on their output: > [https://github.com/GoogleCloudPlatform/DataflowPythonSDK/issues/62] > From what I can tell is only affecting streaming pipelines -- This message was sent by Atlassian JIRA (v7.6.14#76016)