[Yahoo-eng-team] [Bug 2051108] Re: Support for the "bring your own keys" approach for Cinder

2024-01-30 Thread Dan Smith
** Also affects: cinder Importance: Undecided Status: New -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/2051108 Title: Support for the "bring your own keys"

[Yahoo-eng-team] [Bug 2039463] [NEW] live migration jobs failing missing lxml

2023-10-16 Thread Dan Smith
Public bug reported: Our jobs that run the evacuate post hook are failing due to not being able to run the ansible virt module because of a missing lxml library: 2023-10-16 14:38:57.818847 | TASK [run-evacuate-hook : Register running domains on subnode] 2023-10-16 14:38:58.598524 | controller

[Yahoo-eng-team] [Bug 2038840] [NEW] CPU state management fails if cpu0 is in dedicated set

2023-10-09 Thread Dan Smith
Public bug reported: If an operator configures cpu0 in the dedicated set and enables state management, nova-compute will fail on startup with this obscure error: Oct 06 20:08:43.195137 np0035436890 nova-compute[104711]: ERROR oslo_service.service nova.exception.FileNotFound: File

[Yahoo-eng-team] [Bug 2033393] Re: Nova does not update libvirts instance name after server rename

2023-09-05 Thread Dan Smith
The instance name in the XML is not the instance name according to nova. It is generated based on a template by the compute driver and is not otherwise mutable. So this is operating as designed. ** Changed in: nova Status: New => Invalid -- You received this bug notification because you

[Yahoo-eng-team] [Bug 2018612] [NEW] Guest kernel crashes with GPF on volume attach

2023-05-05 Thread Dan Smith
Public bug reported: This isn't really a bug in nova, but it's something that we're hitting in CI quite a bit, so I'm filing here to record the details and so I can recheck against it. The actual bug is either in the guest (cirros 0.5.2) kernel, QEMU, or something similar. In tests where we

[Yahoo-eng-team] [Bug 2012530] [NEW] nova-scheduler will crash at startup if placement is not up

2023-03-22 Thread Dan Smith
Public bug reported: This is the same problem as https://bugs.launchpad.net/nova/+bug/1846820 but for scheduler. Because we initialize our placement client during manager init, we will crash (and loop) on startup if keystone or placement are down. Example trace: Mar 22 15:54:39 jammy

[Yahoo-eng-team] [Bug 1958883] [NEW] Service version check breaks FFU

2022-01-24 Thread Dan Smith
Public bug reported: As reported on the mailing list: http://lists.openstack.org/pipermail/openstack- discuss/2022-January/026603.html The service version check at startup can prevent FFUs from being possible without hacking the database. As implemented here:

[Yahoo-eng-team] [Bug 1940460] [NEW] ORM fixes broke opportunistic testing on py36

2021-08-18 Thread Dan Smith
Public bug reported: The patch 9e002a77f2131d3594a2a4029a147beaf37f5b17 which is aimed at fixing things in advance of SQLAlchemy 2.0 seems to have broken our opportunistic testing of DB migrations on py36 only. This manifests as a total lockup of one worker during functional tests, which fails to

[Yahoo-eng-team] [Bug 1933360] [NEW] Test (and enforcement?) for os_hidden mutability on queued images is wrong

2021-06-23 Thread Dan Smith
Public bug reported: The test glance.tests.unit.v2.test_images_resource.TestImagesController.test_update_queued_image_with_hidden seems to be looking to confirm that queued images cannot be marked as hidden. However, if that was the case, it should be checking for BadRequest (or similar) and not

[Yahoo-eng-team] [Bug 1932337] [NEW] Cinder store migration will fail if first GET'er is not the owner

2021-06-17 Thread Dan Smith
Public bug reported: During an upgrade to Xena, cinder-backed image locations are migrated to include the store name in the URL field. This is lazily done on the first GET of the image. The problem is that the first user to GET an image after the migration may not be an admin or the owner of the

[Yahoo-eng-team] [Bug 1922928] [NEW] Image tasks API excludes in-progress tasks

2021-04-07 Thread Dan Smith
Public bug reported: The glance /images/$uuid/tasks API is excluding in-progress tasks, leading to test failures like this one: Traceback (most recent call last): File "/opt/stack/tempest/tempest/api/image/v2/test_images.py", line 111, in test_image_glance_direct_import

[Yahoo-eng-team] [Bug 1921399] [NEW] check_instance_shared_storage RPC call is broken

2021-03-25 Thread Dan Smith
Public bug reported: We broke check_instance_shared_storage() in this change: https://review.opendev.org/c/openstack/nova/+/761452/13..15/nova/compute/rpcapi.py Where we re-ordered the rpcapi client signature without adjusting the caller. This leads to this failure: Mar 25 13:46:28.041587

[Yahoo-eng-team] [Bug 1913625] Re: Glance will leak staging data

2021-02-22 Thread Dan Smith
** Changed in: glance Status: Invalid => Confirmed -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to Glance. https://bugs.launchpad.net/bugs/1913625 Title: Glance will leak staging data Status in Glance: Confirmed Bug

[Yahoo-eng-team] [Bug 1915543] [NEW] Glance returns 403 instead of 404 when images are not found

2021-02-12 Thread Dan Smith
Public bug reported: Glance is translating "Not Found" errors from the DB layer into "Not Authorized" errors in policy, which it should not be doing. In general, we should always return 404 when something either does not exist, or when permissions do not allow you to know if that thing exists.

[Yahoo-eng-team] [Bug 1914826] [NEW] web-download with invalid url does not report error

2021-02-05 Thread Dan Smith
Public bug reported: In my testing, if I provide a URL to web-download that yields an error from urlopen(), I never see the store listed in the os_glance_failed_import list, and the store remains in os_glance_importing_to_stores. The image status does not change, which means there's no way for

[Yahoo-eng-team] [Bug 1914665] [NEW] Cinder Multistore job hits Cinder Quota error

2021-02-04 Thread Dan Smith
Public bug reported: Noticed during a cinder multistore test run, we hit a quota not found error. It looks like we don't handle this well, which causes nova to see a 503: Proxy Error. I dunno if there's anything better can do than raise a 5xx, but we should probably explain in the error what

[Yahoo-eng-team] [Bug 1914664] [NEW] QEMU monitor read failure in ServerStableDeviceRescueTest

2021-02-04 Thread Dan Smith
Public bug reported: Seeing this failure in the gate: https://zuul.opendev.org/t/openstack/build/7c71502b04fe47039b87f76fbe04fe56/log/controller/logs/screen-n-cpu.txt#33096 Feb 04 20:54:32.857198 ubuntu-focal-limestone-regionone-0022873642 nova-compute[90163]: ERROR nova.virt.libvirt.driver

[Yahoo-eng-team] [Bug 1913625] [NEW] Glance will leak staging data

2021-01-28 Thread Dan Smith
Public bug reported: In various situations, glance will leak (potentially very large) temporary files in the staging store. One example is doing a web-download import, where glance initially downloads the image to its staging store. If the worker doing that activity crashes, loses power, etc,

[Yahoo-eng-team] [Bug 1912001] [NEW] glance allows reserved properties during create()

2021-01-15 Thread Dan Smith
Public bug reported: Certain image properties are reserved for internal glance usage, such as os_glance_import_task. Changing these properties is disallowed during PATCH. However, glance does not enforce that they are not present in an image POST. It should. This command: openstack --debug

[Yahoo-eng-team] [Bug 1897907] [NEW] DELETE fails on StaleDataError when updating image_properties

2020-09-30 Thread Dan Smith
Public bug reported: During the MultiStoresImportTest module in tempest, when we go to clean up images during tearDown, we occasionally get a 500 from the delete, which yields this from the test: ft1.1: tearDownClass

[Yahoo-eng-team] [Bug 1891352] [NEW] Failed import of one store will remain in progress forever if all_stores_must_succeed=True

2020-08-12 Thread Dan Smith
Public bug reported: If import is called with all_stores_must_succeed=True and a store fails during set_image_data(), the store will remain in os_glance_importing_stores forever, never going into the os_glance_failed_import list. This means a polling client will never notice that the import

[Yahoo-eng-team] [Bug 1891190] [NEW] test_reload() functional test causes hang and jobs TIMED_OUT

2020-08-11 Thread Dan Smith
Public bug reported: The glance.tests.functional.test_reload.TestReload.test_reload() test has been causing spurious deadlocks in functional test jobs, resulting in TIMED_OUT job statuses due to the global timeout expiring. This can be reproduced locally with lots of exposure, but Zuul runs

[Yahoo-eng-team] [Bug 1888713] [NEW] Async tasks, image import not supported in pure-WSGI mode

2020-07-23 Thread Dan Smith
Public bug reported: The wsgi_app.py file in the tree allows operators to run Glance API as a proper WSGI app. This has been the default devstack deployment for some time and multiple real clouds in the wild deploy like this. However, an attempt to start an import will be met with an image state

[Yahoo-eng-team] [Bug 1885003] [NEW] Interrupted copy-to-store may corrupt a subsequent operation

2020-06-24 Thread Dan Smith
Public bug reported: This is a hypothetical (but very possible) scenario that will result in a corrupted image stored by glance. I don't have code to reproduce it, but discussion seems to indicate that it is possible. Scenario: 1. Upload image to glance to one store, everything is good 2. Start

[Yahoo-eng-team] [Bug 1884596] [NEW] image import copy-to-store will start multiple importing threads due to race condition

2020-06-22 Thread Dan Smith
Public bug reported: I'm filing this bug a little prematurely because Abhi and I didn't get a chance to fully discuss it. However, looking at the code and the behavior I'm seeing due to another bug (1884587), I feel rather confident. Especially in a situation where glance is running on multiple

[Yahoo-eng-team] [Bug 1884587] [NEW] image import copy-to-store API should reflect proper authorization

2020-06-22 Thread Dan Smith
Public bug reported: In testing the image import copy-to-store mechanism from Nova, I hit an issue that seems clearly to be a bug. Scenario: A user boots an instance from an image they have permission to see. Nova uses their credentials to start an image import copy-to-store operation, which

[Yahoo-eng-team] [Bug 1858877] [NEW] Silent wasted storage with multiple RBD backends

2020-01-08 Thread Dan Smith
Public bug reported: Nova does not currently support multiple rbd backends. However, Glance does and an operator may point Nova at a Glance with access to multiple RBD clusters. If this happens, Nova will silently download the image from Glance, flatten it, and upload it to the local RBD cluster

[Yahoo-eng-team] [Bug 1853048] Re: Nova not updating VM's XML in KVM

2019-11-18 Thread Dan Smith
Nova does not even call down to the compute node when attributes like display_name are changed. The next time the xml is updated would be when it is regenerated, like during a lifecycle event (hard reboot) or migration. Ceilometer scraping that information out of the libvirt XML underneath nova

[Yahoo-eng-team] [Bug 1820125] [NEW] Libvirt driver ungracefully explodes if unsupported arch is found

2019-03-14 Thread Dan Smith
ognised 2019-03-14 19:11:31.709 6 ERROR nova.compute.manager ** Affects: nova Importance: Undecided Assignee: Dan Smith (danms) Status: In Progress -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Co

[Yahoo-eng-team] [Bug 1798158] [NEW] Non-templated transport_url will fail if not defined in config

2018-10-16 Thread Dan Smith
ind(':') ERROR nova.objects.cell_mapping AttributeError: 'NoneType' object has no attribute 'find' ERROR nova.objects.cell_mapping ** Affects: nova Importance: Undecided Assignee: Dan Smith (danms) Status: In Progress -- You received this bug notification because you are a

[Yahoo-eng-team] [Bug 1755602] [NEW] Ironic computes may not be discovered when node count is less than compute count

2018-03-13 Thread Dan Smith
than what we have today, even discounting this race. We can do what we did before, which is do it once for backports, and then add a mapped bit in master to make it more efficient, allowing it to be included in the scheduler periodic task. ** Affects: nova Importance: Medium Assignee:

[Yahoo-eng-team] [Bug 1746294] [NEW] Scheduler requests unlimited results from placement

2018-01-30 Thread Dan Smith
Importance: Medium Assignee: Dan Smith (danms) Status: In Progress ** Tags: queens-rc-potential scheduler -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1746294

[Yahoo-eng-team] [Bug 1738094] [NEW] TEXT is not large enough to store RequestSpec

2017-12-13 Thread Dan Smith
Public bug reported: This error occurs during Newton's online_data_migration phase: error: (pymysql.err.DataError) (1406, u"Data too long for column 'spec' at row 1") [SQL: u'INSERT INTO request_specs Which comes from RequestSpec.instance_group.members being extremely large ** Affects: nova

[Yahoo-eng-team] [Bug 1719966] [NEW] Microversion 2.47 punches nova in its special place

2017-09-27 Thread Dan Smith
Public bug reported: Testing with 500 instances in ACTIVE, and 500 in ERROR state, using curl to pull all 1000 instances ten times in a row, 2.47 clearly shows a knee in the curve on average response time: https://imgur.com/a/2lmiw We should...fix that and stuff. ** Affects: nova

[Yahoo-eng-team] [Bug 1713095] [NEW] Nova compute driver init happens before conductor is ready

2017-08-25 Thread Dan Smith
Public bug reported: In nova/service.py we poll for conductor readiness before we allow normal service startup behavior. The ironic driver does RPC to conductor in its _refresh_hash_ring() code, which may expect conductor be up before it's not. If so, we'll fail to start up because we called to

[Yahoo-eng-team] [Bug 1707071] [NEW] Compute nodes will fight over allocations during migration

2017-07-27 Thread Dan Smith
Public bug reported: As far back as Ocata, compute nodes that manage allocations will end up overwriting allocations from other compute nodes when doing a migration. This stems from the fact that the Resource Tracker was designed to manage a per-compute-node set of accounting, but placement is

[Yahoo-eng-team] [Bug 1698383] [NEW] Resource tracker regressed reporting negative memory

2017-06-16 Thread Dan Smith
Public bug reported: Nova's resource tracker is expected to publish negative values to the scheduler when resources are overcommitted. Nova's scheduler expects this: https://github.com/openstack/nova/blob/a43dbba2b8feea063ed2d0c79780b4c3507cf89b/nova/scheduler/host_manager.py#L215 In change

[Yahoo-eng-team] [Bug 1696125] Re: Detach interface failed - Unable to detach from guest transient domain (pike)

2017-06-09 Thread Dan Smith
** Changed in: nova Status: Fix Released => Confirmed -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1696125 Title: Detach interface failed - Unable to detach

[Yahoo-eng-team] [Bug 1693911] Re: compute node statistics will lie if service records are deleted

2017-05-26 Thread Dan Smith
Dupe of 1692397 ** Changed in: nova Status: New => Won't Fix -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1693911 Title: compute node statistics will lie if

[Yahoo-eng-team] [Bug 1693911] [NEW] compute node statistics will lie if service records are deleted

2017-05-26 Thread Dan Smith
Public bug reported: If a compute node references a deleted service, we will include it in the compute node statistics output. This happens even if the compute node record _is_ deleted, because of our join of the services table, which causes us to get back rows anyway. This results in the stats

[Yahoo-eng-team] [Bug 1684861] Re: Database online_data_migrations in newton fail due to missing keypairs

2017-04-20 Thread Dan Smith
** Changed in: nova Status: New => Invalid -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1684861 Title: Database online_data_migrations in newton fail due to

[Yahoo-eng-team] [Bug 1672625] Re: Instance stuck in schedule state in Ocata release

2017-04-17 Thread Dan Smith
The missed steps are documented here: https://docs.openstack.org/developer/nova/cells.html#first-time-setup That should get you a cell record created, hosts discovered, and back on track. ** Changed in: nova Status: New => Invalid -- You received this bug notification because you are a

[Yahoo-eng-team] [Bug 1670525] [NEW] Nova logs CellMapping objects at DEBUG

2017-03-06 Thread Dan Smith
Public bug reported: This could contain credentials for the DB and MQ ** Affects: nova Importance: Undecided Assignee: Dan Smith (danms) Status: In Progress ** Tags: newton-backport-potential -- You received this bug notification because you are a member of Yahoo

[Yahoo-eng-team] [Bug 1668310] [NEW] PCI device migration cannot continue with old deleted service records

2017-02-27 Thread Dan Smith
Public bug reported: If deleted service records are present in the database, the Service minimum version calculation should ignore them, but it does not. One manifestation of this is the PCI device migration from mitaka/newton will never complete, emitting an error message like this: 2017-02-27

[Yahoo-eng-team] [Bug 1663729] [NEW] CellsV1 regression introduced with flavor migration to api database

2017-02-10 Thread Dan Smith
the main database. ** Affects: nova Importance: Undecided Assignee: Dan Smith (danms) Status: In Progress ** Tags: newton-backport-potential ocata-backport-potential ** Tags added: newton-backport-potential -- You received this bug notification because you are a member of Yahoo

[Yahoo-eng-team] [Bug 1659391] Re: Server list API does not show scheduled servers that are not assigned to any cell

2017-02-06 Thread Dan Smith
Cells are not optional in Nova as of Ocata. Since cells are required, you should not see instances that are not assigned to a cell, because such a thing is not possible (post-scheduling). Creating an instance before nova is fully setup is not valid either. These two things combined are doubly

[Yahoo-eng-team] [Bug 1341420] Re: gap between scheduler selection and claim causes spurious failures when the instance is the last one to fit

2017-02-03 Thread Dan Smith
What you describe is fundamental to how nova works right now. We speculate in the scheduler, and if we race between two, we handle it with a reschedule. Nova specifically states that scheduling every last resource is out of scope. When trying to do that (which is often the use case for ironic)

[Yahoo-eng-team] [Bug 1661312] [NEW] Evacuation will corrupt instance allocations

2017-02-02 Thread Dan Smith
Public bug reported: The following sequence of events will result in a corrupted instance allocation in placement: 1. Instance running on host A, placement has allocations for instance on host A 2. Host A goes down 3. Instance is evacuated to host B, host B creates duplicated allocations in

[Yahoo-eng-team] [Bug 1660160] Re: No host-to-cell mapping found for selected host

2017-01-29 Thread Dan Smith
Something in your config has been preventing compute nodes from creating their compute node records for much longer than the referenced patch has been in place. I picked a random older run and found the same compute node record create failure:

[Yahoo-eng-team] [Bug 1655494] [NEW] Newton scheduler clients should keep trying to report

2017-01-10 Thread Dan Smith
Public bug reported: Newton scheduler clients will stop reporting any time they encounter a setup-related error, which isn't very operator-friendly for the ocata upgrade process. ** Affects: nova Importance: Medium Assignee: Dan Smith (danms) Status: Confirmed ** Tags

[Yahoo-eng-team] [Bug 1652233] Re: mitaka is incompatible with newton - IncompatibleObjectVersion Version 2.1 of InstanceList is not supported

2017-01-03 Thread Dan Smith
Yeah, mixed-version controllers isn't supported. We've made some progress towards being able to support it in master, but it's definitely not going to work in mitaka/newton. You have to upgrade your controllers simultaneously (well, most critically, your conductor services), and then you can have

[Yahoo-eng-team] [Bug 1648840] [NEW] libvirt driver leaves interface residue after failed start

2016-12-09 Thread Dan Smith
on the system, VM creates started failing with "Argument list too long" as libvirt was choking on enumerating the interfaces it had left behind. ** Affects: nova Importance: Medium Assignee: Dan Smith (danms) Status: In Progress ** Affects: nova/newton Importance:

[Yahoo-eng-team] [Bug 1583720] [NEW] "Migration instance not found" is logged repeatedly to nova-compute.log after an instance was deleted

2016-05-19 Thread Dan Smith
4323 DEBUG nova.compute.resource_tracker [req-...] Migration instance not found: Instance 585ac641-... could not be found. ** Affects: nova Importance: Undecided Assignee: Dan Smith (danms) Status: In Progress ** Tags: mitaka-backport-potential -- You received this bug notification b

[Yahoo-eng-team] [Bug 1555287] [NEW] Libvirt driver broken for non-disk-image backends

2016-03-09 Thread Dan Smith
Public bug reported: Recently the ceph job (and any other configuration that doesn't use disk image as the backend storage) started failing like this: 2016-03-09 14:47:29.102 17597 ERROR oslo_messaging.rpc.dispatcher Traceback (most recent call last): 2016-03-09 14:47:29.102 17597 ERROR

[Yahoo-eng-team] [Bug 1540526] [NEW] Too many lazy-loads in predictable situations

2016-02-01 Thread Dan Smith
Public bug reported: During a normal tempest run, way (way) too many object lazy-loads are being triggered, which causes extra RPC and database traffic. In a given tempest run, we should be able to pretty much prevent any lazy-loads in that predictable situation. The only case where we might want

[Yahoo-eng-team] [Bug 1539271] [NEW] Libvirt live migration stalls

2016-01-28 Thread Dan Smith
Public bug reported: The following message in nova gate test logs shows that libvirt live migration can stall on some sort of deadlock: 2016-01-28 16:53:20.878 INFO nova.virt.libvirt.driver [req-692a1f4f- 16aa-4d93-a694-1f7eef4df9f6 tempest- LiveBlockMigrationTestJSON-1471114638 tempest-

[Yahoo-eng-team] [Bug 1506089] [NEW] Nova incorrectly calculates service version

2015-10-14 Thread Dan Smith
version 0 Which is clearly wrong (service_version minimum should be 2 not 0) ** Affects: nova Importance: Medium Assignee: Dan Smith (danms) Status: In Progress -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed

[Yahoo-eng-team] [Bug 1503708] [NEW] InstanceV2 backports to V1 lack a context

2015-10-07 Thread Dan Smith
Public bug reported: When we convert a V2 instance to a V1 instance, we don't provide it a context, which could, under some circumstances, cause a failure to lazy- load things we need to construct the older instance. ** Affects: nova Importance: High Assignee: Dan Smith (danms

[Yahoo-eng-team] [Bug 1498023] Re: _cleanup_incomplete_migrations() does not check for shared storage

2015-09-21 Thread Dan Smith
** Changed in: nova Importance: High => Undecided ** Changed in: nova Status: In Progress => Invalid ** Changed in: nova Milestone: liberty-rc1 => None -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack

[Yahoo-eng-team] [Bug 1498023] [NEW] _cleanup_incomplete_migrations() does not check for shared storage

2015-09-21 Thread Dan Smith
Assignee: Dan Smith (danms) Status: Confirmed -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1498023 Title: _cleanup_incomplete_migrations() does not check

[Yahoo-eng-team] [Bug 1387244] Re: Increasing number of InstancePCIRequests.get_by_instance_uuid RPC calls during compute host auditing

2015-09-14 Thread Dan Smith
** Changed in: nova Status: Confirmed => Fix Released -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1387244 Title: Increasing number of

[Yahoo-eng-team] [Bug 1493961] [NEW] nova-conductor object debug does not format

2015-09-09 Thread Dan Smith
Public bug reported: The debug log statement in nova-conductor's object_backport_versions() method doesn't format and looks like this: 2015-09-09 11:26:57.126 DEBUG nova.conductor.manager [req-9ff7962c- c8b8-4579-8943-cbf2ef0be373 demo demo] Backporting %(obj)s to %(ver)s with versions

[Yahoo-eng-team] [Bug 1478108] [NEW] Live migration should throttle itself

2015-07-24 Thread Dan Smith
Public bug reported: Nova will accept an unbounded number of live migrations for a single host, which will result in timeouts and failures (at least for libvirt). Since live migrations are seriously IO intensive, allowing this to be unlimited is just never going to be the right thing to do,

[Yahoo-eng-team] [Bug 1471887] [NEW] nova-compute will delete all instances if hostname changes

2015-07-06 Thread Dan Smith
changes, then we will just delete data based on a hunch. Nova-compute needs a better mechanism to detect if an evacuation has actually been requested before deleting the data. See Blueprint robustify-evacuate ** Affects: nova Importance: Undecided Assignee: Dan Smith (danms

[Yahoo-eng-team] [Bug 1470153] [NEW] Nova object relationships ignore List objects

2015-06-30 Thread Dan Smith
Public bug reported: In nova/tests/objects/test_objects.py, we have an important test called test_relationships(). This ensures that we have version mappings between objects that depend on each other, and that those versions and relationships are bumped when one object changes versions. That

[Yahoo-eng-team] [Bug 1470154] [NEW] List objects should use obj_relationships

2015-06-30 Thread Dan Smith
Public bug reported: Nova's List-based objects have something called child_versions, which is a naive mapping of the objects field and the version relationships between the list object and the content object. This was created before we generalized the work in obj_relationships, which normal

[Yahoo-eng-team] [Bug 1450624] [NEW] Nova waits for events from neutron on resize-revert that aren't coming

2015-04-30 Thread Dan Smith
creation failed ** Affects: nova Importance: High Assignee: Dan Smith (danms) Status: In Progress ** Tags: juno-backport-potential kilo-backport-potential libvirt neutron -- You received this bug notification because you are a member of Yahoo! Engineering Team, which

[Yahoo-eng-team] [Bug 1442236] [NEW] Bump compute RPC API to 4.0

2015-04-09 Thread Dan Smith
Public bug reported: We badly need to bump the compute RPC version to 4.0 BEFORE we release kilo. ** Affects: nova Importance: Critical Assignee: Dan Smith (danms) Status: Confirmed -- You received this bug notification because you are a member of Yahoo! Engineering Team

[Yahoo-eng-team] [Bug 1441243] [NEW] EnumField can be None and thus unrestricted

2015-04-07 Thread Dan Smith
Assignee: Dan Smith (danms) Status: In Progress ** Tags: unified-objects -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1441243 Title: EnumField can be None and thus

[Yahoo-eng-team] [Bug 1435586] [NEW] trigger security group refresh gives 'dict' object has no attribute 'uuid'

2015-03-23 Thread Dan Smith
to refresh_instance_security_rules() ** Affects: nova Importance: Undecided Assignee: Dan Smith (danms) Status: Confirmed -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1435586

[Yahoo-eng-team] [Bug 1403162] [NEW] fake_notifier: ValueError: Circular reference detected

2014-12-16 Thread Dan Smith
, in encode chunks = self.iterencode(o, _one_shot=True) File /usr/lib64/python2.7/json/encoder.py, line 270, in iterencode return _iterencode(o, 0) ValueError: Circular reference detected ** Affects: nova Importance: Critical Assignee: Dan Smith (danms

[Yahoo-eng-team] [Bug 1396324] [NEW] Instance object has no attribute get_flavor()

2014-11-25 Thread Dan Smith
27090 TRACE nova.notifications [instance: 74bb24d3-ba69-41e2-b99a-1c35a2331c1b] ** Affects: nova Importance: Medium Assignee: Dan Smith (danms) Status: Confirmed ** Changed in: nova Importance: Undecided = Medium ** Changed in: nova Status: New = Confirmed ** Changed

[Yahoo-eng-team] [Bug 1373106] Re: jogo and sdague are making me sad

2014-09-23 Thread Dan Smith
** Changed in: nova Status: Opinion = Confirmed -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1373106 Title: jogo and sdague are making me sad Status in

[Yahoo-eng-team] [Bug 1370536] [NEW] DB migrations can go unchecked

2014-09-17 Thread Dan Smith
Public bug reported: Currently DB migrations can be added to the tree without the corresponding migration tests. This is bad and means that we have some that are untested in the tree already. ** Affects: nova Importance: Medium Assignee: Dan Smith (danms) Status: In Progress

[Yahoo-eng-team] [Bug 1155800] Re: Cannot delete / confirm / revert resize an instance if nova-compute crashes after VERIFY_RESIZE

2014-09-16 Thread Dan Smith
This is super old, lots has changed since then, and several folks have not been able to reproduce. Please re-open if this is still valid. ** Changed in: nova Importance: High = Undecided ** Changed in: nova Status: Triaged = Invalid ** Changed in: nova Assignee: Dan Smith (danms

[Yahoo-eng-team] [Bug 1361683] [NEW] Instance pci_devices and security_groups refreshing can break backporting

2014-08-26 Thread Dan Smith
Assignee: Dan Smith (danms) Status: Confirmed ** Tags: unified-objects -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1361683 Title: Instance pci_devices

[Yahoo-eng-team] [Bug 1360320] [NEW] Unit tests fail in handle_schedule_error with wrong instance

2014-08-22 Thread Dan Smith
Public bug reported: From http://logs.openstack.org/70/113270/3/check/gate-nova- python26/038b3fa/console.html: 2014-08-21 20:08:33.507 | Traceback (most recent call last): 2014-08-21 20:08:33.507 | File nova/tests/conductor/test_conductor.py, line 1343, in

[Yahoo-eng-team] [Bug 1360333] [NEW] Object hash test fails to detect changes when serialize_args is used

2014-08-22 Thread Dan Smith
Public bug reported: The object hash test will fail to detect method signature changes when something like the serialize_args decorator is used. The test needs to drill down until it finds the remotable level and do the calculation there. ** Affects: nova Importance: Low Assignee: Dan

[Yahoo-eng-team] [Bug 1351020] [NEW] FloatingIP fails to load from database when not associated

2014-07-31 Thread Dan Smith
Importance: Undecided Assignee: Dan Smith (danms) Status: In Progress -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1351020 Title: FloatingIP fails

[Yahoo-eng-team] [Bug 1328245] [NEW] libvirt does not store connection_info after BFV setup

2014-06-09 Thread Dan Smith
which means we don't fully tear down everything at disconnect time. This is present in at least Havana, and I expect it is present in Icehosue and master as well. ** Affects: nova Importance: Medium Assignee: Dan Smith (danms) Status: Confirmed ** Tags: libvirt -- You received

[Yahoo-eng-team] [Bug 1250525] Re: nova-conductor did not start after upgrade

2014-06-04 Thread Dan Smith
Marking this as invalid since there was no follow up to the question of recurrence in the last three months. ** Changed in: nova Status: Incomplete = Invalid -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack

[Yahoo-eng-team] [Bug 1308805] Re: object backport doesn't work

2014-06-04 Thread Dan Smith
I think we determined this was related to a bad backport via mailing list convo. Re-open if not. ** Changed in: nova Status: New = Invalid -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova).

[Yahoo-eng-team] [Bug 1308715] Re: Deadlock on quota_usages

2014-05-06 Thread Dan Smith
** Also affects: nova/icehouse Importance: Undecided Status: New -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1308715 Title: Deadlock on quota_usages

[Yahoo-eng-team] [Bug 1306718] [NEW] Instances become undelete-able if vif unplugging fails

2014-04-11 Thread Dan Smith
: ['ovs-vsctl', '--timeout=120', 'del-port', 'br-int', u'qvo81ce661d-1a'] ** Affects: nova Importance: High Assignee: Dan Smith (danms) Status: Confirmed -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack

[Yahoo-eng-team] [Bug 1284312] [NEW] vmware driver races to create instance images

2014-02-24 Thread Dan Smith
Public bug reported: Change Ia0ebd674345734e7cfa31ccd400fdba93646c554 traded one race condition for another. By ignoring all mkdir() calls that would otherwise fail because an instance directory already exists, two nodes racing to create a single image will corrupt or lose data, or fail in a

[Yahoo-eng-team] [Bug 1280034] [NEW] compute_node_update broken with havana compute nodes

2014-02-13 Thread Dan Smith
Public bug reported: This change: https://review.openstack.org/#/c/66469 Changed the format of the data in the values dictionary of compute_node_update. This causes an icehouse conductor to generate a broken SQL query when called from a havana compute node:

[Yahoo-eng-team] [Bug 1276731] [NEW] simple_tenant_usage extension should not rely on looking up flavors

2014-02-05 Thread Dan Smith
Public bug reported: The simple_tenant_usage extension gets the flavor data from the instance and then looks up the flavor from the database to return usage information. Since we now store all of the flavor data in the instance itself, we should use that information instead of what the flavor

[Yahoo-eng-team] [Bug 1275875] [NEW] Virt drivers should use standard image properties

2014-02-03 Thread Dan Smith
Public bug reported: Several virt drivers are using non-standard driver-specific image metadata properties. This creates an API contract between the external user and the driver implementation. These non-standard ones should be marked as deprecated in some way, enforced in v3, etc. We need a

[Yahoo-eng-team] [Bug 1265607] [NEW] Instance.refresh() sends new info_cache objects

2014-01-02 Thread Dan Smith
Public bug reported: If an older node does an Instance.refresh() it will fail because conductor will overwrite the info_cache field with a new InstanceInfoCache object. This happens during the LifecycleEvent handler in nova-compute. ** Affects: nova Importance: Undecided Assignee: Dan

[Yahoo-eng-team] [Bug 1265618] [NEW] image_snapshot_pending state breaks havana nodes

2014-01-02 Thread Dan Smith
Public bug reported: Icehouse introduced a state called image_snapshot_pending which havana nodes do not understand. If they call save with expected_task_state=image_snapshot they will crash on the new state. 2014-01-02 11:58:46.766 TRACE nova.openstack.common.rpc.amqp File

[Yahoo-eng-team] [Bug 1258256] [NEW] Live upgrade from Havana broken by commit 62e9829

2013-12-05 Thread Dan Smith
this properly. ** Affects: nova Importance: Medium Assignee: Dan Smith (danms) Status: Confirmed ** Tags: unified-objects -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net

[Yahoo-eng-team] [Bug 1180618] Re: fault['message'] needs to be updated with exception message

2013-06-05 Thread Dan Smith
I don't think this bug is valid. Isn't the problem just that you're failing to schedule both times and ending up with the same error message? ** Changed in: nova Status: In Progress = Invalid -- You received this bug notification because you are a member of Yahoo! Engineering Team, which

[Yahoo-eng-team] [Bug 1165895] Re: image-create/snapshot image_state property/metadata always 'available'

2013-05-08 Thread Dan Smith
** Changed in: nova Importance: Undecided = Wishlist ** Changed in: nova Status: New = Opinion -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1165895 Title:

[Yahoo-eng-team] [Bug 1161709] Re: confirm-resize failed, after migration. KeyError: 'old_instance_type_memory_mb'

2013-04-01 Thread Dan Smith
Yes, that's the fix I'm talking about. I'm going to mark this bug as invalid since it has already been fixed. ** Changed in: nova Status: Incomplete = Invalid -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack

[Yahoo-eng-team] [Bug 1161496] Re: Boot from volume will attach the VM to all networks

2013-03-28 Thread Dan Smith
OP realized this is a dupe ** Changed in: nova Status: New = Invalid -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to OpenStack Compute (nova). https://bugs.launchpad.net/bugs/1161496 Title: Boot from volume will attach

[Yahoo-eng-team] [Bug 981263] Re: Nova API should present deleted flavors (instance_types) in some cases

2013-02-08 Thread Dan Smith
This was fixed at some point, probably after several recent changes, and is no longer an issue according to the reporter. ** Changed in: nova Status: Triaged = Invalid -- You received this bug notification because you are a member of Yahoo! Engineering Team, which is subscribed to