[
https://issues.apache.org/jira/browse/MESOS-6810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15756414#comment-15756414
]
Yu Yang commented on MESOS-6810:
--------------------------------
{quote}
* Trying 52.45.221.131...
* Connected to registry-1.docker.io (52.45.221.131) port 443 (#0)
* found 173 certificates in /etc/ssl/certs/ca-certificates.crt
* found 697 certificates in /etc/ssl/certs
* ALPN, offering http/1.1
* SSL connection using TLS1.2 / ECDHE_RSA_AES_128_GCM_SHA256
* server certificate verification OK
* server certificate status verification SKIPPED
* common name: *.docker.io (matched)
* server certificate expiration date OK
* server certificate activation date OK
* certificate public key: RSA
* certificate version: #3
* subject: OU=GT98568428,OU=See www.rapidssl.com/resources/cps
(c)15,OU=Domain Control Validated - RapidSSL(R),CN=*.docker.io
* start date: Thu, 19 Mar 2015 17:34:32 GMT
* expire date: Sat, 21 Apr 2018 01:51:52 GMT
* issuer: C=US,O=GeoTrust Inc.,CN=RapidSSL SHA256 CA - G3
* compression: NULL
* ALPN, server did not agree to a protocol
> GET /v2/nvidia/cuda/manifests/latest HTTP/1.1
> Host: registry-1.docker.io
> User-Agent: curl/7.47.0
> Accept: */*
>
< HTTP/1.1 401 Unauthorized
< Content-Type: application/json; charset=utf-8
< Docker-Distribution-Api-Version: registry/2.0
< Www-Authenticate: Bearer
realm="https://auth.docker.io/token",service="registry.docker.io",scope="repository:nvidia/cuda:pull"
< Date: Sat, 17 Dec 2016 05:50:45 GMT
< Content-Length: 143
< Strict-Transport-Security: max-age=31536000
<
{"errors":[{"code":"UNAUTHORIZED","message":"authentication
required","detail":[{"Type":"repository","Name":"nvidia/cuda","Action":"pull"}]}]}
* Connection #0 to host registry-1.docker.io left intact
{quote}
> Tasks getting stuck in STAGING state when using unified containerizer
> ---------------------------------------------------------------------
>
> Key: MESOS-6810
> URL: https://issues.apache.org/jira/browse/MESOS-6810
> Project: Mesos
> Issue Type: Bug
> Components: containerization, docker
> Affects Versions: 1.0.0, 1.0.1, 1.1.0
> Environment: *OS*: ubuntu16.04 64bit
> *mesos*: 1.1.0, one master and one agent on same machine
> *Agent flag*: {{sudo ./bin/mesos-agent.sh --master=192.168.1.192:5050
> --work_dir=/tmp/mesos_slave --image_providers=docker
> --isolation=docker/runtime,filesystem/linux,cgroups/devices,gpu/nvidia
> --containerizers=mesos,docker --executor_environment_variables="{}"}}
> Reporter: Yu Yang
>
> when submit tasks using container settings like:
> {code}
> {
> "container": {
> "mesos": {
> "image": {
> "docker": {
> "name": "nvidia/cuda"
> },
> "type": "DOCKER"
> }
> },
> "type": "MESOS"
> },
> }
> {code}
> then task will get stuck in STAGING state, and finally it will fail with
> message {{Failed to launch container: Collect failed: Failed to perform
> 'curl': curl: (56) GnuTLS recv error (-54): Error in pull function}}
> this is the related log on
> agent
> {quote}
> I1217 13:05:35.406365 20780 slave.cpp:1539] Got assigned task
> 'mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591' for framework
> 02083c57-b2d9-4054-babe-90e962816813-0001
> I1217 13:05:35.406749 20780 slave.cpp:1701] Launching task
> 'mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591' for framework
> 02083c57-b2d9-4054-babe-90e962816813-0001
> I1217 13:05:35.406970 20780 paths.cpp:536] Trying to chown
> '/tmp/mesos_slave/slaves/02083c57-b2d9-4054-babe-90e962816813-S0/frameworks/02083c57-b2d9-4054-babe-90e962816813-0001/executors/mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591/runs/8be3b5cd-afa3-4189-aa2a-f09d73529f8c'
> to user 'root'
> I1217 13:05:35.409272 20780 slave.cpp:6179] Launching executor
> 'mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591' of framework
> 02083c57-b2d9-4054-babe-90e962816813-0001 with resources cpus(*):0.1;
> mem(*):32 in work directory
> '/tmp/mesos_slave/slaves/02083c57-b2d9-4054-babe-90e962816813-S0/frameworks/02083c57-b2d9-4054-babe-90e962816813-0001/executors/mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591/runs/8be3b5cd-afa3-4189-aa2a-f09d73529f8c'
> I1217 13:05:35.409958 20780 slave.cpp:1987] Queued task
> 'mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591' for executor
> 'mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591' of framework
> 02083c57-b2d9-4054-babe-90e962816813-0001
> I1217 13:05:35.410163 20779 docker.cpp:1000] Skipping non-docker container
> I1217 13:05:35.410636 20776 containerizer.cpp:938] Starting container
> 8be3b5cd-afa3-4189-aa2a-f09d73529f8c for executor
> 'mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591' of framework
> 02083c57-b2d9-4054-babe-90e962816813-0001
> I1217 13:05:44.459362 20778 slave.cpp:4992] Terminating executor
> ''cuda_mesos_nvidia_tf.72e9b9cf-8220-49bd-86fe-1667ee5e7a02' of framework
> 02083c57-b2d9-4054-babe-90e962816813-0001' because it did not register within
> 1mins
> I1217 13:05:53.586819 20780 slave.cpp:5044] Current disk usage 63.59%. Max
> allowed age: 1.848503351525151days
> I1217 13:06:35.410905 20777 slave.cpp:4992] Terminating executor
> ''mesos_containerizer_test.2a845a72-7b54-4a95-b6fa-6aeda8c6b591' of framework
> 02083c57-b2d9-4054-babe-90e962816813-0001' because it did not register within
> 1mins
> I1217 13:06:35.411175 20780 containerizer.cpp:1950] Destroying container
> 8be3b5cd-afa3-4189-aa2a-f09d73529f8c in PROVISIONING state
> {quote}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)