[Yahoo-eng-team] [Bug 1692090] Re: _dn_to_id ignores user_id_attribute

2018-06-08 Thread Launchpad Bug Tracker
[Expired for OpenStack Identity (keystone) because there has been no
activity for 60 days.]

** Changed in: keystone
   Status: Incomplete => Expired

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Identity (keystone).
https://bugs.launchpad.net/bugs/1692090

Title:
  _dn_to_id ignores user_id_attribute

Status in OpenStack Identity (keystone):
  Expired

Bug description:
  _dn_to_id is not affected when user_id_attribute is changed in keystone.conf.
  
https://github.com/openstack/keystone/blob/c3ca06ff47cced16ea9de3d6ef1a6c583bb3cf38/keystone/identity/backends/ldap/common.py#L1280

  Considering the following LDAP directory:

  ...
  # userid, Users, openstack.org
  dn: cn=userid,ou=Users,dc=openstack,dc=org
  objectClass: inetOrgPerson
  userPassword:: e1NTSEF9Rit1bTlOS2FKdWM2bWFhWUtmRGQ5dmlBdEd6NEFydHY=
  sn: 25cbd5b54da849128b89c3f7ab6e5bff
  cn: userid

  # test-group, UserGroups, openstack.org
  dn: cn=test-group,ou=UserGroups,dc=openstack,dc=org
  objectClass: groupOfNames
  cn: test-group
  ou: f44a7fbb9e174ba5823474c759d43643
  member: cn=userid,ou=Users,dc=openstack,dc=org
  ...

  keystone.conf:

  ...
  user_id_attribute = sn
  user_name_attribute = cn
  ...

  This results in users unable to found in groups.
  e.g. `openstack user list --domain default --group test-group`
  Expected: User ID and Name are returned
  +--+-+
  | ID   | Name|
  +--+-+
  | 25cbd5b54da849128b89c3f7ab6e5bff | userid  |
  +--+-+

  Actual: Nothing is returned

To manage notifications about this bug go to:
https://bugs.launchpad.net/keystone/+bug/1692090/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1693704] Re: Unable to list federated projects with unscoped token

2018-06-08 Thread Launchpad Bug Tracker
[Expired for OpenStack Identity (keystone) because there has been no
activity for 60 days.]

** Changed in: keystone
   Status: Incomplete => Expired

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Identity (keystone).
https://bugs.launchpad.net/bugs/1693704

Title:
  Unable to list federated projects with unscoped token

Status in OpenStack Identity (keystone):
  Expired

Bug description:
  When I got the federated user project list, the error is as bellow:

  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi   File 
"/usr/lib/python2.7/site-packages/keystone/common/wsgi.py", line 228, in 
__call__
  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi result = 
method(req, **params)
  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi   File 
"/usr/lib/python2.7/site-packages/keystone/common/controller.py", line 164, in 
inner
  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi return f(self, 
request, *args, **kwargs)
  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi   File 
"/usr/lib/python2.7/site-packages/keystone/federation/controllers.py", line 
480, in list_projects_for_user
  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi 
request.auth_context['group_ids'])
  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi KeyError: 'group_ids'
  2017-05-26 15:12:54.685 12742 ERROR keystone.common.wsgi

  and I have got the token scoped in domain.

  My mapping rule is as bellow:

  [
  {
  "local": [
  {
     "user": {
  "name": "{0}",
  "domain": {
  "name": "{1}"
  },
  "type": "local"
  }
  }
  ],
  "remote": [
  {
  "type": "openstack_user"
  },
  {
  "type": "openstack_user_domain"
  }
  ]
  }
  ]

  The error is that token is an unscoped token which is got from the API
  “/v3/OS-FEDERATION/identity_providers/keystone-
  idp/protocols/saml2/auth” and then the federated user want to get the
  projects from /v3/OS-FEDERATION/projects. But error occurs.

To manage notifications about this bug go to:
https://bugs.launchpad.net/keystone/+bug/1693704/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1775947] [NEW] tempest.api.compute.servers.test_device_tagging.TaggedAttachmentsTest failing

2018-06-08 Thread Slawek Kaplonski
Public bug reported:

Since few days I see that
tempest.api.compute.servers.test_device_tagging.TaggedAttachmentsTest.test_tagged_attachment
in neutron-tempest-dvr job.

Example of failure: http://logs.openstack.org/90/572690/2/check/neutron-
tempest-dvr/45ec391/logs/testr_results.html.gz

It happened at least 3 times on 8.06:
http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22exit%20status%3A%20137%2C%20stderr%3A%20Killed%5C%22%20AND%20build_name%3A%5C
%22neutron-tempest-dvr%5C%22

** Affects: neutron
 Importance: Undecided
 Status: New


** Tags: gate-failure

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1775947

Title:
  tempest.api.compute.servers.test_device_tagging.TaggedAttachmentsTest
  failing

Status in neutron:
  New

Bug description:
  Since few days I see that
  
tempest.api.compute.servers.test_device_tagging.TaggedAttachmentsTest.test_tagged_attachment
  in neutron-tempest-dvr job.

  Example of failure: http://logs.openstack.org/90/572690/2/check
  /neutron-tempest-dvr/45ec391/logs/testr_results.html.gz

  It happened at least 3 times on 8.06:
  
http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22exit%20status%3A%20137%2C%20stderr%3A%20Killed%5C%22%20AND%20build_name%3A%5C
  %22neutron-tempest-dvr%5C%22

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1775947/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1775934] [NEW] Cannot run "openstack server list" with instance stuck in scheduling state

2018-06-08 Thread Jim Rollenhagen
Public bug reported:

Seeing this in Ocata at 125dd1f30fdaf50182256c56808a5199856383c7.

Running `openstack server list --project
9c28d07207a54c78848fd7b4f85779d5` results in a 500 error:

RESP BODY: {"computeFault": {"message": "Unexpected API Error. Please
report this at http://bugs.launchpad.net/nova/ and attach the Nova API
log if possible.\n", "code":
500}}

Traceback in nova-api:
http://paste.openstack.org/show/6YrSmjMSo0lIxyFjbPIz/

some data on the instance:
http://paste.openstack.org/show/6PSa35HvdxZCQnVQ2sQU/

Looks like lazy-loading the flavor is failing because it's looking in
the wrong database.

** Affects: nova
 Importance: Undecided
 Status: New

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1775934

Title:
  Cannot run "openstack server list" with instance stuck in scheduling
  state

Status in OpenStack Compute (nova):
  New

Bug description:
  Seeing this in Ocata at 125dd1f30fdaf50182256c56808a5199856383c7.

  Running `openstack server list --project
  9c28d07207a54c78848fd7b4f85779d5` results in a 500 error:

  RESP BODY: {"computeFault": {"message": "Unexpected API Error. Please
  report this at http://bugs.launchpad.net/nova/ and attach the Nova API
  log if possible.\n", "code":
  500}}

  Traceback in nova-api:
  http://paste.openstack.org/show/6YrSmjMSo0lIxyFjbPIz/

  some data on the instance:
  http://paste.openstack.org/show/6PSa35HvdxZCQnVQ2sQU/

  Looks like lazy-loading the flavor is failing because it's looking in
  the wrong database.

To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1775934/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1775782] Re: glance-image-import.conf not parsed when running under wsgi

2018-06-08 Thread Erno Kuvaja
Taken into consideration that we don't currently support nor encourage
anyone running under wsgi mode either mod_wsgi nor uwsgi , I'd say the
priority for this bug is very low, specially as you mentioned in your
comment the fix needs to happen in Devstack rather than in Glance.

** Changed in: glance
   Importance: Undecided => Low

** Changed in: glance
   Status: New => Opinion

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to Glance.
https://bugs.launchpad.net/bugs/1775782

Title:
  glance-image-import.conf not parsed when running under wsgi

Status in Glance:
  Opinion

Bug description:
  When development environment is set to run under wsgi mode (by setting
  WSGI_MODE=mod_wsgi in local.conf) glance-image-import.conf file is not
  parsed.

  It is working properly if glance is running under uwsgi.

  Steps to reproduce:
  NOTE: Here I am trying to use plugin "inject_image_metadata" to inject 
metadata properties to the image.

  1. Add glance-image-import.conf as mentioned at /etc/glance
 [image_import_opts]
 image_import_plugins = ["inject_image_metadata"]

 [inject_metadata_properties]
 inject = "property1":"value"
 ignore_user_roles = demo

  2. Restart glance api service using "devstack@g-api.service"
  3. Source using admin credentials (as we are ignoring demo role in 
glance-image-import.conf)
 $ source devstack/openrc admin admin
  3. Create image using import api
 $ glance image-create-via-import --container-format bare --disk-format 
qcow2 --name ceph_image_default --file temp.qcow2

  Expected Output:
  Properties mentioned in glance-image-import.conf should be injected to the 
image

  Actual Output:
  Properties are not injected as plugin is not loaded.

To manage notifications about this bug go to:
https://bugs.launchpad.net/glance/+bug/1775782/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1774402] Re: Glance scrubber SELinux denials

2018-06-08 Thread Erno Kuvaja
This is a bug in RDO packaging rather than bug in Glance. Please file
the bug in RDO [0] and you have much more luck to get it fixed.

[0] https://bugzilla.redhat.com/enter_bug.cgi?product=RDO

** Changed in: glance
   Status: New => Invalid

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to Glance.
https://bugs.launchpad.net/bugs/1774402

Title:
  Glance scrubber SELinux denials

Status in Glance:
  Invalid

Bug description:
  Glance scrubber on RHEL7 from RDO with SELinux enabled get denied
  connecting to cinder & swift

  type=AVC msg=audit(1527765224.059:149655): avc:  denied  { name_connect } for 
 pid=1283 comm="glance-scrubber" dest=8776 
scontext=system_u:system_r:glance_scrubber_t:s0 tcontext=
  system_u:object_r:unreserved_port_t:s0 tclass=tcp_socket
  type=AVC msg=audit(1527765228.066:149656): avc:  denied  { name_connect } for 
 pid=1283 comm="glance-scrubber" dest=8776 
scontext=system_u:system_r:glance_scrubber_t:s0 tcontext=
  system_u:object_r:unreserved_port_t:s0 tclass=tcp_socket
  type=AVC msg=audit(1527765228.690:149657): avc:  denied  { name_connect } for 
 pid=1283 comm="glance-scrubber" dest=8080 
scontext=system_u:system_r:glance_scrubber_t:s0 tcontext=
  system_u:object_r:http_cache_port_t:s0 tclass=tcp_socket

  Enabling the nis_enabled seboolean allows connections to cinder,

  swift looks to need

  allow glance_scrubber_t http_cache_port_t:tcp_socket name_connect;

To manage notifications about this bug go to:
https://bugs.launchpad.net/glance/+bug/1774402/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1775879] [NEW] Create a domain, projects, users, and roles in keystone

2018-06-08 Thread andywilku
Public bug reported:

When I try to create new project I have get error as shown in the
attachment. I had done everything before in the instruction without any
problems. What can it be?

This bug tracker is for errors with the documentation, use the following
as a template and remove or add fields as you see fit. Convert [ ] into
[x] to check boxes:

- [ ] This doc is inaccurate in this way: __
- [ ] This is a doc addition request.
- [ ] I have a fix to the document that I can paste below including example: 
input and output. 

If you have a troubleshooting or support issue, use the following
resources:

 - Ask OpenStack: http://ask.openstack.org
 - The mailing list: http://lists.openstack.org
 - IRC: 'openstack' channel on Freenode

---
Release: 12.0.1.dev18 on 2018-04-21 04:02
SHA: 22af1d9f35c86e9c5bca288c2996be5c19e3cd61
Source: 
https://git.openstack.org/cgit/openstack/keystone/tree/doc/source/install/keystone-users-ubuntu.rst
URL: https://docs.openstack.org/keystone/pike/install/keystone-users-ubuntu.html

** Affects: keystone
 Importance: Undecided
 Status: New


** Tags: doc

** Attachment added: "Error report"
   
https://bugs.launchpad.net/bugs/1775879/+attachment/5150316/+files/IMG_20180608_174438.jpg

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Identity (keystone).
https://bugs.launchpad.net/bugs/1775879

Title:
  Create a domain, projects, users, and roles in keystone

Status in OpenStack Identity (keystone):
  New

Bug description:
  When I try to create new project I have get error as shown in the
  attachment. I had done everything before in the instruction without
  any problems. What can it be?

  This bug tracker is for errors with the documentation, use the
  following as a template and remove or add fields as you see fit.
  Convert [ ] into [x] to check boxes:

  - [ ] This doc is inaccurate in this way: __
  - [ ] This is a doc addition request.
  - [ ] I have a fix to the document that I can paste below including example: 
input and output. 

  If you have a troubleshooting or support issue, use the following
  resources:

   - Ask OpenStack: http://ask.openstack.org
   - The mailing list: http://lists.openstack.org
   - IRC: 'openstack' channel on Freenode

  ---
  Release: 12.0.1.dev18 on 2018-04-21 04:02
  SHA: 22af1d9f35c86e9c5bca288c2996be5c19e3cd61
  Source: 
https://git.openstack.org/cgit/openstack/keystone/tree/doc/source/install/keystone-users-ubuntu.rst
  URL: 
https://docs.openstack.org/keystone/pike/install/keystone-users-ubuntu.html

To manage notifications about this bug go to:
https://bugs.launchpad.net/keystone/+bug/1775879/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1775863] [NEW] Cannot save a RequestSpec object if nothing changed

2018-06-08 Thread Balazs Gibizer
Public bug reported:

If a RequestSpec.save() is called on an object that has no changed field
the code blows up with UnboundLocalError:

  File "nova/tests/unit/objects/test_request_spec.py", line 622, in 
test_save_does_not_persist_requested_resources
req_obj.save()
  File 
"/mnt/ssd/ebalgib/nova/py27/.tox/py27/local/lib/python2.7/site-packages/oslo_versionedobjects/base.py",
 line 226, in wrapper
return fn(self, *args, **kwargs)
  File "nova/objects/request_spec.py", line 588, in save
updates = self._get_update_primitives()
  File "nova/objects/request_spec.py", line 561, in _get_update_primitives
return db_updates
UnboundLocalError: local variable 'db_updates' referenced before assignment


It is because the db_updates local is only initialized if there is something to 
update but returned regardless
https://github.com/openstack/nova/blob/master/nova/objects/request_spec.py#L518
https://github.com/openstack/nova/blob/master/nova/objects/request_spec.py#L532

** Affects: nova
 Importance: Undecided
 Status: New

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1775863

Title:
  Cannot save a RequestSpec object if nothing changed

Status in OpenStack Compute (nova):
  New

Bug description:
  If a RequestSpec.save() is called on an object that has no changed
  field the code blows up with UnboundLocalError:

File "nova/tests/unit/objects/test_request_spec.py", line 622, in 
test_save_does_not_persist_requested_resources
  req_obj.save()
File 
"/mnt/ssd/ebalgib/nova/py27/.tox/py27/local/lib/python2.7/site-packages/oslo_versionedobjects/base.py",
 line 226, in wrapper
  return fn(self, *args, **kwargs)
File "nova/objects/request_spec.py", line 588, in save
  updates = self._get_update_primitives()
File "nova/objects/request_spec.py", line 561, in _get_update_primitives
  return db_updates
  UnboundLocalError: local variable 'db_updates' referenced before 
assignment

  
  It is because the db_updates local is only initialized if there is something 
to update but returned regardless
  
https://github.com/openstack/nova/blob/master/nova/objects/request_spec.py#L518
  
https://github.com/openstack/nova/blob/master/nova/objects/request_spec.py#L532

To manage notifications about this bug go to:
https://bugs.launchpad.net/nova/+bug/1775863/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1775758] Re: Deprecated auth_url entries in Neutron Queen's install guide

2018-06-08 Thread Brian Haley
Please look at the source url that you linked before filing bugs as this
has already been fixed.

** Changed in: neutron
   Status: New => Invalid

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1775758

Title:
  Deprecated auth_url entries in Neutron Queen's install guide

Status in neutron:
  Invalid

Bug description:

  This bug tracker is for errors with the documentation, use the
  following as a template and remove or add fields as you see fit.
  Convert [ ] into [x] to check boxes:

  - [x] This doc is inaccurate in this way:

  The Neutron installation guides use old auth_uri / auth_url values
  pointing at two different keystone endpoints for authentication of the
  network service and the compute service. This occurs within the
  controller node and compute node parts of the installation guide.
  Following the current guide then fails at the Verify step with error
  "Failed to retrieve extensions list from Network API"

  - [x] I have a fix to the document that I can paste below including
  example: input and output.

  input 1 (neutron.conf): 
  auth_uri = http://controller:5000
  auth_url = http://controller:35357 

  input 2 (nova.conf):
  auth_url = http://controller:35357

  output:
  auth_url = http://controller:5000

  I changed the values to the above output and the Verify step
  succeeded.


  ---
  Release: 12.0.3.dev24 on 2018-06-07 22:47
  SHA: 2206636feca043a9ab958010a00641f92957e8a5
  Source: 
https://git.openstack.org/cgit/openstack/neutron/tree/doc/source/install/controller-install-ubuntu.rst
  URL: 
https://docs.openstack.org/neutron/queens/install/controller-install-ubuntu.html

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1775758/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp


[Yahoo-eng-team] [Bug 1773449] Re: VM rbd backed block devices inconsistent after unexpected host outage

2018-06-08 Thread James Page
OK figured this one out - the cephx keys are missing a permission which
allows them to see blacklisted clients - as a result they can't deal
with a hard crash:

  mon 'allow command "osd blacklist"'

This is a charm issue after all.

As a workaround you can manually update the existing client keys for
nova-compute using:

  sudo ceph auth caps client.nova-compute mon 'allow r, allow command
"osd blacklist"' osd 'allow rwx'

from any mon unit.



** Changed in: nova
   Status: New => Invalid

** Changed in: ceph (Ubuntu)
   Status: New => Invalid

** Changed in: nova (Ubuntu)
   Status: New => Invalid

** Changed in: cloud-archive
   Status: New => Invalid

** Changed in: qemu (Ubuntu)
   Status: New => Invalid

** Also affects: charm-ceph-mon
   Importance: Undecided
   Status: New

** Also affects: charms.ceph
   Importance: Undecided
   Status: New

** Changed in: charms.ceph
   Status: New => Triaged

** Changed in: charm-ceph-mon
   Status: New => Triaged

** Changed in: charms.ceph
   Importance: Undecided => High

** Changed in: charm-ceph-mon
   Importance: Undecided => High

** Changed in: charm-ceph-mon
Milestone: None => 18.08

** Changed in: cloud-archive
 Assignee: Sean Feole (sfeole) => (unassigned)

** Changed in: nova (Ubuntu)
 Assignee: Sean Feole (sfeole) => (unassigned)

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1773449

Title:
  VM rbd backed block devices inconsistent after unexpected host outage

Status in OpenStack ceph-mon charm:
  Triaged
Status in charms.ceph:
  Triaged
Status in Ubuntu Cloud Archive:
  Invalid
Status in OpenStack Compute (nova):
  Invalid
Status in ceph package in Ubuntu:
  Invalid
Status in nova package in Ubuntu:
  Invalid
Status in qemu package in Ubuntu:
  Invalid

Bug description:
  Reboot host that contains VMs with volumes and all VMs fail to boot.
  Happens with Queens on Bionic and Xenial

  [0.00] Initializing cgroup subsys cpuset

  [0.00] Initializing cgroup subsys cpu

  [0.00] Initializing cgroup subsys cpuacct

  [0.00] Linux version 4.4.0-124-generic
  (buildd@lcy01-amd64-028) (gcc version 5.4.0 20160609 (Ubuntu
  5.4.0-6ubuntu1~16.04.9) ) #148-Ubuntu SMP Wed May 2 13:00:18 UTC 2018
  (Ubuntu 4.4.0-124.148-generic 4.4.117)

  [0.00] Command line:
  BOOT_IMAGE=/boot/vmlinuz-4.4.0-124-generic
  root=UUID=bca2de6e-f774-4203-ae05-e8deeb05f64a ro console=tty1
  console=ttyS0

  [0.00] KERNEL supported cpus:

  [0.00]   Intel GenuineIntel

  [0.00]   AMD AuthenticAMD

  [0.00]   Centaur CentaurHauls

  [0.00] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256

  [0.00] x86/fpu: Supporting XSAVE feature 0x01: 'x87 floating
  point registers'

  [0.00] x86/fpu: Supporting XSAVE feature 0x02: 'SSE registers'

  [0.00] x86/fpu: Supporting XSAVE feature 0x04: 'AVX registers'

  [0.00] x86/fpu: Enabled xstate features 0x7, context size is
  832 bytes, using 'standard' format.

  [0.00] x86/fpu: Using 'eager' FPU context switches.

  [0.00] e820: BIOS-provided physical RAM map:

  [0.00] BIOS-e820: [mem 0x-0x0009fbff]
  usable

  [0.00] BIOS-e820: [mem 0x0009fc00-0x0009]
  reserved

  [0.00] BIOS-e820: [mem 0x000f-0x000f]
  reserved

  [0.00] BIOS-e820: [mem 0x0010-0x7ffdbfff]
  usable

  [0.00] BIOS-e820: [mem 0x7ffdc000-0x7fff]
  reserved

  [0.00] BIOS-e820: [mem 0xfeffc000-0xfeff]
  reserved

  [0.00] BIOS-e820: [mem 0xfffc-0x]
  reserved

  [0.00] NX (Execute Disable) protection: active

  [0.00] SMBIOS 2.8 present.

  [0.00] Hypervisor detected: KVM

  [0.00] e820: last_pfn = 0x7ffdc max_arch_pfn = 0x4

  [0.00] x86/PAT: Configuration [0-7]: WB  WC  UC- UC  WB  WC
  UC- WT

  [0.00] found SMP MP-table at [mem 0x000f6a20-0x000f6a2f]
  mapped at [880f6a20]

  [0.00] Scanning 1 areas for low memory corruption

  [0.00] Using GB pages for direct mapping

  [0.00] RAMDISK: [mem 0x361f4000-0x370f1fff]

  [0.00] ACPI: Early table checksum verification disabled

  [0.00] ACPI: RSDP 0x000F6780 14 (v00 BOCHS )

  [0.00] ACPI: RSDT 0x7FFE1649 2C (v01 BOCHS
  BXPCRSDT 0001 BXPC 0001)

  [0.00] ACPI: FACP 0x7FFE14CD 74 (v01 BOCHS
  BXPCFACP 0001 BXPC 0001)

  [0.00] ACPI: DSDT 0x7FFE0040 00148D (v01 BOCHS
  BXPCDSDT 0001 BXPC 0001)

  [0.00] ACPI: FACS 0x7FFE 40

  [0.00] ACPI: APIC 0x7FFE15C1 88 (v01 BOCHS
  BXPCAPIC 0

[Yahoo-eng-team] [Bug 1775797] [NEW] The mac table size of neutron bridges (br-tun, br-int, br-*) is too small by default and eventually makes openvswitch explode

2018-06-08 Thread Slawek Kaplonski
Public bug reported:

Description of problem:

the CPU utilization of ovs-vswitchd is high without DPDK enabled

 PID USER  PR  NIVIRTRESSHR S  %CPU %MEM TIME+ COMMAND
1512 root  10 -10 4352840 793864  12008 R  1101  0.3  15810:26 ovs-vswitchd

at the same time we were observing failures to send packets (ICMP) over
VXLAN tunnel, we think this might be related to high CPU usage.


--- Reproducer and analysis on ovs side done by Jiri Benc:

Reproducer:

Create an ovs bridge:

--
ovs-vsctl add-br ovs0
ip l s ovs0 up
--

Save this to a file named "reproducer.py":

--
#!/usr/bin/python
from scapy.all import *

data = [(str(RandMAC()), str(RandIP())) for i in
range(int(sys.argv[1]))]

s = conf.L2socket(iface="ovs0")
while True:
for mac, ip in data:
p = Ether(src=mac, dst=mac)/IP(src=ip, dst=ip)
s.send(p)
--

Run the reproducer:

./reproducer.py 5000



The problem is how flow revalidation works in ovs. There are several 
'revalidator' threads launched. They should normally sleep (modulo waking every 
0.5 second just to do nothing) and they wake if anything of interest happens 
(udpif_revalidator => poll_block). On every wake up, each revalidator thread 
checks whether flow revalidation is needed and if it is, it does the 
revalidation.

The revalidation is very costly with high number of flows. I also
suspect there's a lot of contention between the revalidator threads.

The flow revalidation is triggered by many things. What is of interest
for us is that any eviction of a MAC learning table entry triggers
revalidation.

The reproducer script repeatedly sends the same 5000 packets, all of
them with a different MAC address. This causes constant overflows of the
MAC learning table and constant revalidation. The revalidator threads
are being immediately woken up and are busy looping the revalidation.

Which is exactly the pattern from the customers' data: there are 16000+
flows and the packet capture shows that the packets are repeating every
second.

A quick fix is to increase the MAC learning table size:

ovs-vsctl set bridge  other-config:mac-table-size=5

This should lower the CPU usage down substantially; allow a few seconds
for things to settle down.

** Affects: neutron
 Importance: Medium
 Assignee: Slawek Kaplonski (slaweq)
 Status: Confirmed


** Tags: ovs

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1775797

Title:
  The mac table size of neutron bridges (br-tun, br-int, br-*) is too
  small by default and eventually makes openvswitch explode

Status in neutron:
  Confirmed

Bug description:
  Description of problem:

  the CPU utilization of ovs-vswitchd is high without DPDK enabled

   PID USER  PR  NIVIRTRESSHR S  %CPU %MEM TIME+ COMMAND
  1512 root  10 -10 4352840 793864  12008 R  1101  0.3  15810:26 
ovs-vswitchd

  at the same time we were observing failures to send packets (ICMP)
  over VXLAN tunnel, we think this might be related to high CPU usage.

  
  --- Reproducer and analysis on ovs side done by Jiri Benc:

  Reproducer:

  Create an ovs bridge:

  --
  ovs-vsctl add-br ovs0
  ip l s ovs0 up
  --

  Save this to a file named "reproducer.py":

  --
  #!/usr/bin/python
  from scapy.all import *

  data = [(str(RandMAC()), str(RandIP())) for i in
  range(int(sys.argv[1]))]

  s = conf.L2socket(iface="ovs0")
  while True:
  for mac, ip in data:
  p = Ether(src=mac, dst=mac)/IP(src=ip, dst=ip)
  s.send(p)
  --

  Run the reproducer:

  ./reproducer.py 5000

  
  
  The problem is how flow revalidation works in ovs. There are several 
'revalidator' threads launched. They should normally sleep (modulo waking every 
0.5 second just to do nothing) and they wake if anything of interest happens 
(udpif_revalidator => poll_block). On every wake up, each revalidator thread 
checks whether flow revalidation is needed and if it is, it does the 
revalidation.

  The revalidation is very costly with high number of flows. I also
  suspect there's a lot of contention between the revalidator threads.

  The flow revalidation is triggered by many things. What is of interest
  for us is that any eviction of a MAC learning table entry triggers
  revalidation.

  The reproducer script repeatedly sends the same 5000 packets, all of
  them with a different MAC address. This causes constant overflows of
  the MAC learning table and constant revalidation. The revalidator
  threads are being immediately woken up and are busy looping the
  revalidation.

  Which is exactly the pattern from the customers' data: there are
  16000+ flows and the packet capture shows that the packets are
  repeating every second.

  A quick fix is to increase the MAC learning table size:

  ovs-vsctl set bridge  other-config:mac-table-size=5

  This should lower the CPU usage down s

[Yahoo-eng-team] [Bug 1773449] Re: VMs do not survive host reboot

2018-06-08 Thread James Page
Raising bug tasks for ceph and qemu as I think this is where the issue
lies; nova generates the same libvirt xml disk stanzas for versions that
work and versions that have this issue.

** Also affects: ceph (Ubuntu)
   Importance: Undecided
   Status: New

** Also affects: qemu (Ubuntu)
   Importance: Undecided
   Status: New

** Changed in: ceph (Ubuntu)
   Importance: Undecided => High

** Changed in: qemu (Ubuntu)
   Importance: Undecided => High

** Summary changed:

- VMs do not survive host reboot
+ VM rbd backed block devices inconsistent after unexpected host outage

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to OpenStack Compute (nova).
https://bugs.launchpad.net/bugs/1773449

Title:
  VM rbd backed block devices inconsistent after unexpected host outage

Status in Ubuntu Cloud Archive:
  New
Status in OpenStack Compute (nova):
  New
Status in ceph package in Ubuntu:
  New
Status in nova package in Ubuntu:
  New
Status in qemu package in Ubuntu:
  New

Bug description:
  Reboot host that contains VMs with volumes and all VMs fail to boot.
  Happens with Queens on Bionic and Xenial

  [0.00] Initializing cgroup subsys cpuset

  [0.00] Initializing cgroup subsys cpu

  [0.00] Initializing cgroup subsys cpuacct

  [0.00] Linux version 4.4.0-124-generic
  (buildd@lcy01-amd64-028) (gcc version 5.4.0 20160609 (Ubuntu
  5.4.0-6ubuntu1~16.04.9) ) #148-Ubuntu SMP Wed May 2 13:00:18 UTC 2018
  (Ubuntu 4.4.0-124.148-generic 4.4.117)

  [0.00] Command line:
  BOOT_IMAGE=/boot/vmlinuz-4.4.0-124-generic
  root=UUID=bca2de6e-f774-4203-ae05-e8deeb05f64a ro console=tty1
  console=ttyS0

  [0.00] KERNEL supported cpus:

  [0.00]   Intel GenuineIntel

  [0.00]   AMD AuthenticAMD

  [0.00]   Centaur CentaurHauls

  [0.00] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256

  [0.00] x86/fpu: Supporting XSAVE feature 0x01: 'x87 floating
  point registers'

  [0.00] x86/fpu: Supporting XSAVE feature 0x02: 'SSE registers'

  [0.00] x86/fpu: Supporting XSAVE feature 0x04: 'AVX registers'

  [0.00] x86/fpu: Enabled xstate features 0x7, context size is
  832 bytes, using 'standard' format.

  [0.00] x86/fpu: Using 'eager' FPU context switches.

  [0.00] e820: BIOS-provided physical RAM map:

  [0.00] BIOS-e820: [mem 0x-0x0009fbff]
  usable

  [0.00] BIOS-e820: [mem 0x0009fc00-0x0009]
  reserved

  [0.00] BIOS-e820: [mem 0x000f-0x000f]
  reserved

  [0.00] BIOS-e820: [mem 0x0010-0x7ffdbfff]
  usable

  [0.00] BIOS-e820: [mem 0x7ffdc000-0x7fff]
  reserved

  [0.00] BIOS-e820: [mem 0xfeffc000-0xfeff]
  reserved

  [0.00] BIOS-e820: [mem 0xfffc-0x]
  reserved

  [0.00] NX (Execute Disable) protection: active

  [0.00] SMBIOS 2.8 present.

  [0.00] Hypervisor detected: KVM

  [0.00] e820: last_pfn = 0x7ffdc max_arch_pfn = 0x4

  [0.00] x86/PAT: Configuration [0-7]: WB  WC  UC- UC  WB  WC
  UC- WT

  [0.00] found SMP MP-table at [mem 0x000f6a20-0x000f6a2f]
  mapped at [880f6a20]

  [0.00] Scanning 1 areas for low memory corruption

  [0.00] Using GB pages for direct mapping

  [0.00] RAMDISK: [mem 0x361f4000-0x370f1fff]

  [0.00] ACPI: Early table checksum verification disabled

  [0.00] ACPI: RSDP 0x000F6780 14 (v00 BOCHS )

  [0.00] ACPI: RSDT 0x7FFE1649 2C (v01 BOCHS
  BXPCRSDT 0001 BXPC 0001)

  [0.00] ACPI: FACP 0x7FFE14CD 74 (v01 BOCHS
  BXPCFACP 0001 BXPC 0001)

  [0.00] ACPI: DSDT 0x7FFE0040 00148D (v01 BOCHS
  BXPCDSDT 0001 BXPC 0001)

  [0.00] ACPI: FACS 0x7FFE 40

  [0.00] ACPI: APIC 0x7FFE15C1 88 (v01 BOCHS
  BXPCAPIC 0001 BXPC 0001)

  [0.00] No NUMA configuration found

  [0.00] Faking a node at [mem
  0x-0x7ffdbfff]

  [0.00] NODE_DATA(0) allocated [mem 0x7ffd7000-0x7ffdbfff]

  [0.00] kvm-clock: Using msrs 4b564d01 and 4b564d00

  [0.00] kvm-clock: cpu 0, msr 0:7ffcf001, primary cpu clock

  [0.00] kvm-clock: using sched offset of 17590935813 cycles

  [0.00] clocksource: kvm-clock: mask: 0x
  max_cycles: 0x1cd42e4dffb, max_idle_ns: 881590591483 ns

  [0.00] Zone ranges:

  [0.00]   DMA  [mem 0x1000-0x00ff]

  [0.00]   DMA32[mem 0x0100-0x7ffdbfff]

  [0.00]   Normal   empty

  [0.00]   Device   empty

  [0.00] Movable zone start for each node

  [0.00] Early

[Yahoo-eng-team] [Bug 1775782] [NEW] glance-image-import.conf not parsed when running under wsgi

2018-06-08 Thread Abhishek Kekane
Public bug reported:

When development environment is set to run under wsgi mode (by setting
WSGI_MODE=mod_wsgi in local.conf) glance-image-import.conf file is not
parsed.

It is working properly if glance is running under uwsgi.

Steps to reproduce:
NOTE: Here I am trying to use plugin "inject_image_metadata" to inject metadata 
properties to the image.

1. Add glance-image-import.conf as mentioned at /etc/glance
   [image_import_opts]
   image_import_plugins = ["inject_image_metadata"]

   [inject_metadata_properties]
   inject = "property1":"value"
   ignore_user_roles = demo

2. Restart glance api service using "devstack@g-api.service"
3. Source using admin credentials (as we are ignoring demo role in 
glance-image-import.conf)
   $ source devstack/openrc admin admin
3. Create image using import api
   $ glance image-create-via-import --container-format bare --disk-format qcow2 
--name ceph_image_default --file temp.qcow2

Expected Output:
Properties mentioned in glance-image-import.conf should be injected to the image

Actual Output:
Properties are not injected as plugin is not loaded.

** Affects: glance
 Importance: Undecided
 Status: New

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to Glance.
https://bugs.launchpad.net/bugs/1775782

Title:
  glance-image-import.conf not parsed when running under wsgi

Status in Glance:
  New

Bug description:
  When development environment is set to run under wsgi mode (by setting
  WSGI_MODE=mod_wsgi in local.conf) glance-image-import.conf file is not
  parsed.

  It is working properly if glance is running under uwsgi.

  Steps to reproduce:
  NOTE: Here I am trying to use plugin "inject_image_metadata" to inject 
metadata properties to the image.

  1. Add glance-image-import.conf as mentioned at /etc/glance
 [image_import_opts]
 image_import_plugins = ["inject_image_metadata"]

 [inject_metadata_properties]
 inject = "property1":"value"
 ignore_user_roles = demo

  2. Restart glance api service using "devstack@g-api.service"
  3. Source using admin credentials (as we are ignoring demo role in 
glance-image-import.conf)
 $ source devstack/openrc admin admin
  3. Create image using import api
 $ glance image-create-via-import --container-format bare --disk-format 
qcow2 --name ceph_image_default --file temp.qcow2

  Expected Output:
  Properties mentioned in glance-image-import.conf should be injected to the 
image

  Actual Output:
  Properties are not injected as plugin is not loaded.

To manage notifications about this bug go to:
https://bugs.launchpad.net/glance/+bug/1775782/+subscriptions

-- 
Mailing list: https://launchpad.net/~yahoo-eng-team
Post to : yahoo-eng-team@lists.launchpad.net
Unsubscribe : https://launchpad.net/~yahoo-eng-team
More help   : https://help.launchpad.net/ListHelp