attilapiros commented on pull request #31923: URL: https://github.com/apache/spark/pull/31923#issuecomment-806654897
@Ngone51 I checked the PR you mentioned. My findings based on [the last failure](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/41014/consoleFull). Here the first error is: ``` - Launcher client dependencies - SPARK-33615: Launcher client archives *** FAILED *** io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: GET at: https://192.168.39.147:8443/api/v1/namespaces/09e9c94160d543c1a338f364722d49a6/pods/spark-test-app-c80c78f512574b36b2608f7d92c24503/log?pretty=false. Message: container "spark-kubernetes-driver" in pod "spark-test-app-c80c78f512574b36b2608f7d92c24503" is waiting to start: trying and failing to pull image. Received status: Status(apiVersion=v1, code=400, details=null, kind=Status, message=container "spark-kubernetes-driver" in pod "spark-test-app-c80c78f512574b36b2608f7d92c24503" is waiting to start: trying and failing to pull image, metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=BadRequest, status=Failure, additionalProperties={}). at io.fabric8.kubernetes.client.dsl.base.OperationSupport.requestFailure(OperationSupport.java:570) at io.fabric8.kubernetes.client.dsl.base.OperationSupport.assertResponseCode(OperationSupport.java:509) at io.fabric8.kubernetes.client.dsl.internal.core.v1.PodOperationsImpl.doGetLog(PodOperationsImpl.java:189) at io.fabric8.kubernetes.client.dsl.internal.core.v1.PodOperationsImpl.getLog(PodOperationsImpl.java:198) at io.fabric8.kubernetes.client.dsl.internal.core.v1.PodOperationsImpl.getLog(PodOperationsImpl.java:85) at org.apache.spark.deploy.k8s.integrationtest.KubernetesSuite.$anonfun$logForFailedTest$3(KubernetesSuite.scala:89) at org.apache.spark.internal.Logging.logInfo(Logging.scala:57) at org.apache.spark.internal.Logging.logInfo$(Logging.scala:56) at org.apache.spark.SparkFunSuite.logInfo(SparkFunSuite.scala:61) at org.apache.spark.deploy.k8s.integrationtest.KubernetesSuite.$anonfun$logForFailedTest$2(KubernetesSuite.scala:86) ... ``` Checking the last successful and first failed ones we can see just a very few differences in the code: the failed one uses a `--archives` and the successful uses `--files`. I do not think this difference could lead to an error such "is waiting to start: trying and failing to pull image." -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
