Hi Nick,

Thanks for the reply, looks like we could be hitting something similar. We are running Ceph Jewel packages on the Glance node.

Thanks for the links to the bug reports.

Regards,


On 31/01/17 11:35, Nick Jones wrote:
Hi Grant.

Could be unrelated but I'm throwing it out there anyway as we had similar 'weirdness' with Glance recently... If you're running the Ceph Jewel or Hammer packages on your Glance node then you might be running into this bug, in which image.stat() calls in librbdpy fail intermittently:

http://tracker.ceph.com/issues/17310

There's a corresponding bug report on Launchpad for the Ubuntu Ceph package that has some more detail:

https://bugs.launchpad.net/ubuntu/+source/ceph/+bug/1625489

Cheers.

--

-Nick

On 31 January 2017 at 10:49, Grant Morley <[email protected] <mailto:[email protected]>> wrote:

    Hi All,

    Not sure if anyone has come across this yet. Yesterday we started
    to try to upload some images to our stack and they were failing
    with the following error when trying via command line:

    Error finding address for
    http://10.6.0.3:9292/v2/images/e90a4626-4781-4b53-8914-85ff2129f777/file
    <http://10.6.0.3:9292/v2/images/e90a4626-4781-4b53-8914-85ff2129f777/file>:
    [Errno 32] Broken pipe

    In the Glance logs at the same time we see this:

    20172017-01-31 10:22:45.005 2096 INFO eventlet.wsgi.server
    [req-1837bce5-a90c-44d5-bf65-c1e3a534c793
    4c91f07132454a97b21fff35402b7825 4a6213a64312482896130efc3047195c
    - - -] 10.6.1.223,10.6.0.39 - - [31/Jan/2017 10:22:45] "HEAD
    /v1/images/e90a4626-4781-4b53-8914-85ff2129f777 HTTP/1.1" 200 689
    0.046623
    2017-01-31 10:22:47.595 2099 INFO eventlet.wsgi.server [-]
    10.6.0.40 - - [31/Jan/2017 10:22:47] "OPTIONS /versions HTTP/1.0"
    200 785 0.001113
    2017-01-31 10:22:47.720 2095 INFO eventlet.wsgi.server
    [req-873dd7df-f8a4-4443-8795-b81dcd54f412
    4c91f07132454a97b21fff35402b7825 4a6213a64312482896130efc3047195c
    - - -] 10.6.1.223,10.6.0.39 - - [31/Jan/2017 10:22:47] "HEAD
    /v1/images/e90a4626-4781-4b53-8914-85ff2129f777 HTTP/1.1" 200 689
    0.032188-01-31 10:22:45.005 2096 INFO eventlet.wsgi.server
    [req-1837bce5-a90c-44d5-bf65-c1e3a534c793
    4c91f07132454a97b21fff35402b7825 4a6213a64312482896130efc3047195c
    - - -] 10.6.1.223,10.6.0.39 - - [31/Jan/2017 10:22:45] "HEAD
    /v1/images/e90a4626-4781-4b53-8914-85ff2129f777 HTTP/1.1" 200 689
    0.046623
    2017-01-31 10:22:47.595 2099 INFO eventlet.wsgi.server [-]
    10.6.0.40 - - [31/Jan/2017 10:22:47] "OPTIONS /versions HTTP/1.0"
    200 785 0.001113
    2017-01-31 10:22:47.720 2095 INFO eventlet.wsgi.server
    [req-873dd7df-f8a4-4443-8795-b81dcd54f412
    4c91f07132454a97b21fff35402b7825 4a6213a64312482896130efc3047195c
    - - -] 10.6.1.223,10.6.0.39 - - [31/Jan/2017 10:22:47] "HEAD
    /v1/images/e90a4626-4781-4b53-8914-85ff2129f777 HTTP/1.1" 200 689
    0.032188

    Interestingly, when you delete the failed image you get this in
    the glance logs:

    2017-01-31 10:23:16.188 2099 INFO eventlet.wsgi.server
    [req-c2191fe1-64de-4252-b4a2-e84643dbed4c
    4c91f07132454a97b21fff35402b7825 4a6213a64312482896130efc3047195c
    - - -] 10.6.2.190,10.6.0.39 - - [31/Jan/2017 10:23:16] "DELETE
    /v2/images/e90a4626-4781-4b53-8914-85ff2129f777 HTTP/1.1" 204 208 0.08


    It seems to be using V1 to try and upload and V2 to remove? -
    However the image doesn't actually get deleted because it remains
    in ceph:

    rbd -p images ls | grep e90a4626-4781-4b53-8914-85ff2129f777
    e90a4626-4781-4b53-8914-85ff2129f777

    It is almost as if, the image uploads and is then forgotten about.

    Our compute nodes are also ceph backed and they are working
    absolutely fine, it is just Glance and ceph has all of a sudden
    stopped working. Just wondered if anyone had any ideas?

    Regards,


-- Grant Morley
    Cloud Lead
    Absolute DevOps Ltd
    Units H, J & K, Gateway 1000, Whittle Way, Stevenage, Herts, SG1 2FP
    www.absolutedevops.io <http://www.absolutedevops.io/>
    [email protected] <mailto:[email protected]> 0845 874 0580

    _______________________________________________
    OpenStack-operators mailing list
    [email protected]
    <mailto:[email protected]>
    http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
    <http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators>



DataCentred Limited registered in England and Wales no. 05611763

--
Grant Morley
Cloud Lead
Absolute DevOps Ltd
Units H, J & K, Gateway 1000, Whittle Way, Stevenage, Herts, SG1 2FP
www.absolutedevops.io <http://www.absolutedevops.io/> [email protected] <mailto:[email protected]> 0845 874 0580
_______________________________________________
OpenStack-operators mailing list
[email protected]
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators

Reply via email to