[
https://issues.apache.org/jira/browse/BEAM-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16775906#comment-16775906
]
Matt Casters commented on BEAM-6706:
------------------------------------
OK so it looks like there was some more history in StackDriver and I found a
dreaded SOE on SLF4J:
{{D Debug: download complete }}
{{I Exception }}
{{I in thread "main" }}
{{I java.lang.StackOverflowError }}
{{I at java.util.concurrent.ConcurrentHashMap.get(ConcurrentHashMap.java:936) }}
{{I at org.slf4j.impl.Log4jLoggerFactory.getLogger(Log4jLoggerFactory.java:58)
}}
{{I at org.slf4j.LoggerFactory.getLogger(LoggerFactory.java:277) }}
{{I at org.apache.log4j.Category.<init>(Category.java:57) }}
{{I at org.apache.log4j.Logger.<init>(Logger.java:37) }}
{{I at
org.apache.log4j.Log4jLoggerFactory.getLogger(Log4jLoggerFactory.java:43) }}
{{I at org.apache.log4j.LogManager.getLogger(LogManager.java:45) }}
{{I at org.slf4j.impl.Log4jLoggerFactory.getLogger(Log4jLoggerFactory.java:66)
}}
{{I at org.slf4j.LoggerFactory.getLogger(LoggerFactory.java:277) }}
{{I at org.apache.log4j.Category.<init>(Category.java:57) }}
{{I at org.apache.log4j.Logger.<init>(Logger.java:37) }}
{{I at
org.apache.log4j.Log4jLoggerFactory.getLogger(Log4jLoggerFactory.java:43) }}
{{I at org.apache.log4j.LogManager.getLogger(LogManager.java:45) }}
{{I at org.slf4j.impl.Log4jLoggerFactory.getLogger(Log4jLoggerFactory.java:66)}}
{{...}}
This is kind of strange since I didn't change anything in my dependencies.
That in turn got me looking into what could possibly be giving SLF4J the
run-around.
In the end the only extra dependency that got dragged in extra was:
{{flogger-system-backend-0.3.1.jar}}
I'm guessing that some code changed and there really was a need to use a Fluent
logging style and to get that to work something else got configured somewhere
causing the Stack overflow.
I haven't figured out what exactly this change is but I'll keep looking.
> User reports trouble downloading 2.10.0 Dataflow worker image
> -------------------------------------------------------------
>
> Key: BEAM-6706
> URL: https://issues.apache.org/jira/browse/BEAM-6706
> Project: Beam
> Issue Type: Bug
> Components: runner-dataflow
> Reporter: Kenneth Knowles
> Assignee: Matt Casters
> Priority: Major
>
> DataFlow however is throwing all sorts of errors. For example:
> * Handler for GET
> /v1.27/images/gcr.io/cloud-dataflow/v1beta3/beam-java-batch:beam-2.10.0/json
> returned error: No such image:
> gcr.io/cloud-dataflow/v1beta3/beam-java-batch:beam-2.10.0"
> * while reading 'google-dockercfg' metadata: http status code: 404 while
> fetching url
> http://metadata.google.internal./computeMetadata/v1/instance/attributes/google-dockercfg"
> * Error syncing pod..."
> The job gets stuck after starting a worker and after an hour or so it gives
> up with a failure. 2.9.0 runs fine.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)