[
https://issues.apache.org/jira/browse/BEAM-7934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
James Hutchison updated BEAM-7934:
----------------------------------
Description:
Using the dataflow runner, log messages always show up in stackdriver with the
step_id as the empty string, so filtering log messages for a step doesn't work.
{code:java}
resource: {
labels: {
job_id: "<job id>"
job_name: "<job name>"
project_id: "<project id>"
region: "<region>"
step_id: ""
}
type: "dataflow_step"
}{code}
Another user seems to have posted in the old github repo and appears to be
seeing the same problem based on their output:
[https://github.com/GoogleCloudPlatform/DataflowPythonSDK/issues/62]
>From what I can tell is only affecting streaming pipelines
was:
Using the dataflow runner, log messages always show up in stackdriver with the
step_id as the empty string, so filtering log messages for a step doesn't work.
{code:java}
resource: {
labels: {
job_id: "<job id>"
job_name: "<job name>"
project_id: "<project id>"
region: "<region>"
step_id: ""
}
type: "dataflow_step"
}{code}
Another user seems to have posted in the old github repo and appears to be
seeing the same problem based on their output:
[https://github.com/GoogleCloudPlatform/DataflowPythonSDK/issues/62]
> Dataflow Python SDK logging: step_id is always empty string
> -----------------------------------------------------------
>
> Key: BEAM-7934
> URL: https://issues.apache.org/jira/browse/BEAM-7934
> Project: Beam
> Issue Type: Bug
> Components: runner-dataflow, sdk-py-core
> Affects Versions: 2.13.0
> Reporter: James Hutchison
> Assignee: Ahmet Altay
> Priority: Major
>
> Using the dataflow runner, log messages always show up in stackdriver with
> the step_id as the empty string, so filtering log messages for a step doesn't
> work.
> {code:java}
> resource: {
> labels: {
> job_id: "<job id>"
> job_name: "<job name>"
> project_id: "<project id>"
> region: "<region>"
> step_id: ""
> }
> type: "dataflow_step"
> }{code}
> Another user seems to have posted in the old github repo and appears to be
> seeing the same problem based on their output:
> [https://github.com/GoogleCloudPlatform/DataflowPythonSDK/issues/62]
> From what I can tell is only affecting streaming pipelines
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)