peay created BEAM-2326:
--------------------------

             Summary: Verbose INFO logging with stateful DoFns and Dataflow 
                 Key: BEAM-2326
                 URL: https://issues.apache.org/jira/browse/BEAM-2326
             Project: Beam
          Issue Type: Bug
          Components: runner-dataflow
    Affects Versions: 0.6.0
            Reporter: peay
            Assignee: Daniel Halperin


I am seeing a lot of INFO level logging:

{code}
 jsonPayload: {
  logger: 
"com.google.cloud.dataflow.worker.runners.worker.BatchModeUngroupingParDoFn" 
  message: "Processing timers for key {} for stateful DoFn"    
 }
 jsonPayload: {
  message: "Processing key KV{one of my keys} for stateful DoFn"    
  logger: 
"com.google.cloud.dataflow.worker.runners.worker.BatchModeUngroupingParDoFn"    
}
{code}

out of one of my stateful DoFn. There is one such group of logs for each key I 
process, which leads to a very large amount of logs and possibly to a 
significant slowdown.

Also, not sure if the {{Processing timers}} log message is missing some string 
interpolation or if the empty key is on purpose.

At any rate, this seems more like something for {{DEBUG}} than {{INFO}} given 
the large volume.




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to