On Thu, Jul 25, 2019 at 7:00 PM Bob R <[email protected]> wrote:

> I would try 'mv /etc/ceph/osd{,.old}' then run 'ceph-volume  simple scan'
> again. We had some problems upgrading due to OSDs (perhaps initially
> installed as firefly?) missing the 'type' attribute and iirc the
> 'ceph-volume simple scan' command refused to overwrite existing json files
> after I made some changes to ceph-volume.
>

Ooof. I could swear that this issue was fixed already and it took me a
while to find out that it wasn't at all. We saw this a few months ago in
our Long Running Cluster used for dogfooding.

I've created a ticket to track this work at
http://tracker.ceph.com/issues/40987

But what you've done is exactly why we chose to persist the JSON files in
/etc/ceph/osd/*.json, so that an admin could tell if anything is missing
(or incorrect like in this case) and make the changes needed.



> Bob
>
> On Wed, Jul 24, 2019 at 1:24 PM Alfredo Deza <[email protected]> wrote:
>
>>
>>
>> On Wed, Jul 24, 2019 at 4:15 PM Peter Eisch <[email protected]>
>> wrote:
>>
>>> Hi,
>>>
>>>
>>>
>>> I appreciate the insistency that the directions be followed.  I wholly
>>> agree.  The only liberty I took was to do a ‘yum update’ instead of just
>>> ‘yum update ceph-osd’ and then reboot.  (Also my MDS runs on the MON hosts,
>>> so it got update a step early.)
>>>
>>>
>>>
>>> As for the logs:
>>>
>>>
>>>
>>> [2019-07-24 15:07:22,713][ceph_volume.main][INFO  ] Running command:
>>> ceph-volume  simple scan
>>>
>>> [2019-07-24 15:07:22,714][ceph_volume.process][INFO  ] Running command:
>>> /bin/systemctl show --no-pager --property=Id --state=running ceph-osd@*
>>>
>>> [2019-07-24 15:07:27,574][ceph_volume.main][INFO  ] Running command:
>>> ceph-volume  simple activate --all
>>>
>>> [2019-07-24 15:07:27,575][ceph_volume.devices.simple.activate][INFO  ]
>>> activating OSD specified in
>>> /etc/ceph/osd/0-93fb5f2f-0273-4c87-a718-886d7e6db983.json
>>>
>>> [2019-07-24 15:07:27,576][ceph_volume.devices.simple.activate][ERROR ]
>>> Required devices (block and data) not present for bluestore
>>>
>>> [2019-07-24 15:07:27,576][ceph_volume.devices.simple.activate][ERROR ]
>>> bluestore devices found: [u'data']
>>>
>>> [2019-07-24 15:07:27,576][ceph_volume][ERROR ] exception caught by
>>> decorator
>>>
>>> Traceback (most recent call last):
>>>
>>>   File "/usr/lib/python2.7/site-packages/ceph_volume/decorators.py",
>>> line 59, in newfunc
>>>
>>>     return f(*a, **kw)
>>>
>>>   File "/usr/lib/python2.7/site-packages/ceph_volume/main.py", line 148,
>>> in main
>>>
>>>     terminal.dispatch(self.mapper, subcommand_args)
>>>
>>>   File "/usr/lib/python2.7/site-packages/ceph_volume/terminal.py", line
>>> 182, in dispatch
>>>
>>>     instance.main()
>>>
>>>   File
>>> "/usr/lib/python2.7/site-packages/ceph_volume/devices/simple/main.py", line
>>> 33, in main
>>>
>>>     terminal.dispatch(self.mapper, self.argv)
>>>
>>>   File "/usr/lib/python2.7/site-packages/ceph_volume/terminal.py", line
>>> 182, in dispatch
>>>
>>>     instance.main()
>>>
>>>   File
>>> "/usr/lib/python2.7/site-packages/ceph_volume/devices/simple/activate.py",
>>> line 272, in main
>>>
>>>     self.activate(args)
>>>
>>>   File "/usr/lib/python2.7/site-packages/ceph_volume/decorators.py",
>>> line 16, in is_root
>>>
>>>     return func(*a, **kw)
>>>
>>>   File
>>> "/usr/lib/python2.7/site-packages/ceph_volume/devices/simple/activate.py",
>>> line 131, in activate
>>>
>>>     self.validate_devices(osd_metadata)
>>>
>>>   File
>>> "/usr/lib/python2.7/site-packages/ceph_volume/devices/simple/activate.py",
>>> line 62, in validate_devices
>>>
>>>     raise RuntimeError('Unable to activate bluestore OSD due to missing
>>> devices')
>>>
>>> RuntimeError: Unable to activate bluestore OSD due to missing devices
>>>
>>>
>>>
>>> (this is repeated for each of the 16 drives)
>>>
>>>
>>>
>>> Any other thoughts?  (I’ll delete/create the OSDs with ceph-deply
>>> otherwise.)
>>>
>>
>> Try using `ceph-volume simple scan --stdout` so that it doesn't persist
>> data onto /etc/ceph/osd/ and inspect that the JSON produced is capturing
>> all the necessary details for OSDs.
>>
>> Alternatively, I would look into the JSON files already produced in
>> /etc/ceph/osd/ and check if the details are correct. The `scan` sub-command
>> does a tremendous effort to cover all cases where ceph-disk
>> created an OSD (filestore, bluestore, dmcrypt, etc...) but it is possible
>> that it may be hitting a problem. This is why the tool made these JSON
>> files available, so that they could be inspected and corrected if anything.
>>
>> The details of the scan sub-command can be found at
>> http://docs.ceph.com/docs/master/ceph-volume/simple/scan/ and the JSON
>> structure is described in detail below at
>> http://docs.ceph.com/docs/master/ceph-volume/simple/scan/#json-contents
>>
>> In this particular case the tool is refusing to activate what seems to be
>> a bluestore OSD. Is it really a bluestore OSD? if so, then it can't find
>> where is the data partition. What does that partition look like (for any of
>> the failing OSDs) ? Does it use dmcrypt, how was it created? (hopefully
>> with ceph-disk!)
>>
>> If you know the data partition for a given OSD, try and pass it onto
>> 'scan'. For example if it is /dev/sda1 you could do `ceph-volume simple
>> scan /dev/sda1` and check its output.
>>
>>
>>
>>>
>>> peter
>>>
>>>
>>>
>>>
>>> Peter Eisch
>>> Senior Site Reliability Engineer
>>> T *1.612.659.3228* <1.612.659.3228>
>>> [image: Facebook] <https://www.facebook.com/VirginPulse>
>>> [image: LinkedIn] <https://www.linkedin.com/company/virgin-pulse>
>>> [image: Twitter] <https://twitter.com/virginpulse>
>>> *virginpulse.com* <https://www.virginpulse.com/>
>>> | *virginpulse.com/global-challenge*
>>> <https://www.virginpulse.com/en-gb/global-challenge/>
>>>
>>> Australia | Bosnia and Herzegovina | Brazil | Canada | Singapore | 
>>> Switzerland | United Kingdom | USA
>>> Confidentiality Notice: The information contained in this e-mail,
>>> including any attachment(s), is intended solely for use by the designated
>>> recipient(s). Unauthorized use, dissemination, distribution, or
>>> reproduction of this message by anyone other than the intended
>>> recipient(s), or a person designated as responsible for delivering such
>>> messages to the intended recipient, is strictly prohibited and may be
>>> unlawful. This e-mail may contain proprietary, confidential or privileged
>>> information. Any views or opinions expressed are solely those of the author
>>> and do not necessarily represent those of Virgin Pulse, Inc. If you have
>>> received this message in error, or are not the named recipient(s), please
>>> immediately notify the sender and delete this e-mail message.
>>> v2.59
>>>
>>> *From: *Alfredo Deza <[email protected]>
>>> *Date: *Wednesday, July 24, 2019 at 3:02 PM
>>> *To: *Peter Eisch <[email protected]>
>>> *Cc: *Paul Emmerich <[email protected]>, "[email protected]"
>>> <[email protected]>
>>> *Subject: *Re: [ceph-users] Upgrading and lost OSDs
>>>
>>>
>>>
>>>
>>>
>>> On Wed, Jul 24, 2019 at 3:49 PM Peter Eisch <[email protected]>
>>> wrote:
>>>
>>>
>>>
>>> I’m at step 6.  I updated/rebooted the host to complete “installing the
>>> new packages and restarting the ceph-osd daemon” on the first OSD host.
>>> All the systemctl definitions to start the OSDs were deleted, all the
>>> properties in /var/lib/ceph/osd/ceph-* directories were deleted.  All the
>>> files in /var/lib/ceph/osd-lockbox, for comparison, were untouched and
>>> still present.
>>>
>>>
>>>
>>> Peeking into step 7 I can run ceph-volume:
>>>
>>>
>>>
>>> # ceph-volume simple scan /dev/sda1
>>>
>>> Running command: /usr/sbin/cryptsetup status /dev/sda1
>>>
>>> Running command: /usr/sbin/cryptsetup status
>>> 93fb5f2f-0273-4c87-a718-886d7e6db983
>>>
>>> Running command: /bin/mount -v /dev/sda5 /tmp/tmpF5F8t2
>>>
>>> stdout: mount: /dev/sda5 mounted on /tmp/tmpF5F8t2.
>>>
>>> Running command: /usr/sbin/cryptsetup status /dev/sda5
>>>
>>> Running command: /bin/ceph --cluster ceph --name
>>> client.osd-lockbox.93fb5f2f-0273-4c87-a718-886d7e6db983 --keyring
>>> /tmp/tmpF5F8t2/keyring config-key get
>>> dm-crypt/osd/93fb5f2f-0273-4c87-a718-886d7e6db983/luks
>>>
>>> Running command: /bin/umount -v /tmp/tmpF5F8t2
>>>
>>> stderr: umount: /tmp/tmpF5F8t2 (/dev/sda5) unmounted
>>>
>>> Running command: /usr/sbin/cryptsetup --key-file - --allow-discards
>>> luksOpen /dev/sda1 93fb5f2f-0273-4c87-a718-886d7e6db983
>>>
>>> Running command: /bin/mount -v
>>> /dev/mapper/93fb5f2f-0273-4c87-a718-886d7e6db983 /tmp/tmpYK0WEV
>>>
>>> stdout: mount: /dev/mapper/93fb5f2f-0273-4c87-a718-886d7e6db983 mounted
>>> on /tmp/tmpYK0WEV.
>>>
>>> --> broken symlink found /tmp/tmpYK0WEV/block ->
>>> /dev/mapper/a05b447c-c901-4690-a249-cc1a2d62a110
>>>
>>> Running command: /usr/sbin/cryptsetup status /tmp/tmpYK0WEV/block_dmcrypt
>>>
>>> Running command: /usr/sbin/cryptsetup status
>>> /dev/mapper/93fb5f2f-0273-4c87-a718-886d7e6db983
>>>
>>> Running command: /bin/umount -v /tmp/tmpYK0WEV
>>>
>>> stderr: umount: /tmp/tmpYK0WEV
>>> (/dev/mapper/93fb5f2f-0273-4c87-a718-886d7e6db983) unmounted
>>>
>>> Running command: /usr/sbin/cryptsetup remove
>>> /dev/mapper/93fb5f2f-0273-4c87-a718-886d7e6db983
>>>
>>> --> OSD 0 got scanned and metadata persisted to file:
>>> /etc/ceph/osd/0-93fb5f2f-0273-4c87-a718-886d7e6db983.json
>>>
>>> --> To take over management of this scanned OSD, and disable ceph-disk
>>> and udev, run:
>>>
>>> -->     ceph-volume simple activate 0
>>> 93fb5f2f-0273-4c87-a718-886d7e6db983
>>>
>>> #
>>>
>>> #
>>>
>>> # ceph-volume simple activate 0 93fb5f2f-0273-4c87-a718-886d7e6db983
>>>
>>> --> Required devices (block and data) not present for bluestore
>>>
>>> --> bluestore devices found: [u'data']
>>>
>>> -->  RuntimeError: Unable to activate bluestore OSD due to missing
>>> devices
>>>
>>> #
>>>
>>>
>>>
>>> The tool detected bluestore, or rather, it failed to find a journal
>>> associated with /dev/sda1. Scanning a single partition can cause that.
>>> There is a flag to spit out the findings to STDOUT instead of persisting
>>> them in /etc/ceph/osd/
>>>
>>>
>>>
>>> Since this is a "whole system" upgrade, then the upgrade documentation
>>> instructions need to be followed:
>>>
>>>
>>>
>>> ceph-volume simple scan
>>> ceph-volume simple activate --all
>>>
>>>
>>>
>>> If the `scan` command doesn't display any information (not even with the
>>> --stdout flag) then the logs at /var/log/ceph/ceph-volume.log need to be
>>> inspected. It would be useful to check any findings in there
>>>
>>>
>>>
>>>
>>> Okay, this created /etc/ceph/osd/*.json.  This is cool.  Is there a
>>> command or option which will read these files and mount the devices?
>>>
>>>
>>>
>>> peter
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>> *Peter Eisch*
>>>
>>> Senior Site Reliability Engineer
>>>
>>> *T*
>>>
>>> 1.612.659.3228
>>>
>>> [image: Facebook]
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.facebook.com%2FVirginPulse&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227707952&sdata=B2JiNp12z7gsfF2i5T2l%2FSjfg6Fhg8E85OpdyGpEMHg%3D&reserved=0>
>>>
>>> [image: LinkedIn]
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linkedin.com%2Fcompany%2Fvirgin-pulse&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227717942&sdata=z3Ii%2BGgPKe7fCOhNGXw%2BlD9j28YCY4gH81is%2BJoiSJU%3D&reserved=0>
>>>
>>> [image: Twitter]
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftwitter.com%2Fvirginpulse&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227717942&sdata=zUaGW2Fm16sdyJdHUPtDN6CzaMXtxMOHvmNDi9VshCw%3D&reserved=0>
>>>
>>> virginpulse.com
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.virginpulse.com%2F&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227717942&sdata=n3m9S%2Bt8fYzGY%2BqPw2wT433TQhf2oPXp9wAum9s9%2BUk%3D&reserved=0>
>>>
>>> |
>>>
>>> virginpulse.com/global-challenge
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.virginpulse.com%2Fen-gb%2Fglobal-challenge%2F&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227727937&sdata=5wn%2F%2FkY1IL0d4BxXNpqCJUHG09gUFRTr2S9KWv1mVG4%3D&reserved=0>
>>>
>>>
>>> Australia | Bosnia and Herzegovina | Brazil | Canada | Singapore | 
>>> Switzerland | United Kingdom | USA
>>>
>>> Confidentiality Notice: The information contained in this e-mail,
>>> including any attachment(s), is intended solely for use by the designated
>>> recipient(s). Unauthorized use, dissemination, distribution, or
>>> reproduction of this message by anyone other than the intended
>>> recipient(s), or a person designated as responsible for delivering such
>>> messages to the intended recipient, is strictly prohibited and may be
>>> unlawful. This e-mail may contain proprietary, confidential or privileged
>>> information. Any views or opinions expressed are solely those of the author
>>> and do not necessarily represent those of Virgin Pulse, Inc. If you have
>>> received this message in error, or are not the named recipient(s), please
>>> immediately notify the sender and delete this e-mail message.
>>>
>>> v2.59
>>>
>>> *From: *Alfredo Deza <[email protected]>
>>> *Date: *Wednesday, July 24, 2019 at 2:20 PM
>>> *To: *Peter Eisch <[email protected]>
>>> *Cc: *Paul Emmerich <[email protected]>, "[email protected]"
>>> <[email protected]>
>>> *Subject: *Re: [ceph-users] Upgrading and lost OSDs
>>>
>>>
>>>
>>> On Wed, Jul 24, 2019 at 2:56 PM Peter Eisch <[email protected]>
>>> wrote:
>>>
>>> Hi Paul,
>>>
>>> To do better to answer you question, I'm following:
>>> http://docs.ceph.com/docs/nautilus/releases/nautilus/
>>> <https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdocs.ceph.com%2Fdocs%2Fnautilus%2Freleases%2Fnautilus%2F&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227727937&sdata=D1Q0Hrg9mq2tTtfTt1kp4Ts3vBc7mertKvYy8vBWNF8%3D&reserved=0>
>>>
>>> At step 6, upgrade OSDs, I jumped on an OSD host and did a full 'yum
>>> update' for patching the host and rebooted to pick up the current centos
>>> kernel.
>>>
>>>
>>>
>>> If you are at Step 6 then it is *crucial* to understand that the tooling
>>> used to create the OSDs is no longer available and Step 7 *is absolutely
>>> required*.
>>>
>>>
>>>
>>> ceph-volume has to scan the system and give you the output of all OSDs
>>> found so that it can persist them in /etc/ceph/osd/*.json files and then
>>> can later be
>>>
>>> "activated".
>>>
>>>
>>>
>>>
>>> I didn't do anything to specific commands for just updating the ceph
>>> RPMs in this process.
>>>
>>>
>>>
>>> It is not clear if you are at Step 6 and wondering why OSDs are not up,
>>> or you are past that and ceph-volume wasn't able to detect anything.
>>>
>>>
>>>
>>> peter
>>>
>>> *Peter Eisch*
>>>
>>> Senior Site Reliability Engineer
>>>
>>> *T*
>>>
>>> 1.612.659.3228
>>>
>>> [image: Facebook]
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.facebook.com%2FVirginPulse&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227737929&sdata=uncHzIOSXt25%2F0NydXSJaLAf6E3Ad05N%2BJLBKYYJQ%2Fw%3D&reserved=0>
>>>
>>> [image: LinkedIn]
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linkedin.com%2Fcompany%2Fvirgin-pulse&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227737929&sdata=NxQ3BmhgWo93uoQfJ1W7lLcDdSUQHgoXu1I49vzibwE%3D&reserved=0>
>>>
>>> [image: Twitter]
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftwitter.com%2Fvirginpulse&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227747924&sdata=sYUapxFqHq0LVyxO4I7kkwN1y9PG5ZLHd83gseRIxvM%3D&reserved=0>
>>>
>>> virginpulse.com
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.virginpulse.com%2F&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227747924&sdata=aoXH94QOjdngJXkAPcz9kmJAK5BA6c9rR5BP01lX0bw%3D&reserved=0>
>>>
>>> |
>>>
>>> virginpulse.com/global-challenge
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.virginpulse.com%2Fen-gb%2Fglobal-challenge%2F&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227757921&sdata=aUbx8uIlqldr5JtKT1PL05nbzcNPHONpouVkj1qXRdM%3D&reserved=0>
>>>
>>>
>>> Australia | Bosnia and Herzegovina | Brazil | Canada | Singapore | 
>>> Switzerland | United Kingdom | USA
>>>
>>> Confidentiality Notice: The information contained in this e-mail,
>>> including any attachment(s), is intended solely for use by the designated
>>> recipient(s). Unauthorized use, dissemination, distribution, or
>>> reproduction of this message by anyone other than the intended
>>> recipient(s), or a person designated as responsible for delivering such
>>> messages to the intended recipient, is strictly prohibited and may be
>>> unlawful. This e-mail may contain proprietary, confidential or privileged
>>> information. Any views or opinions expressed are solely those of the author
>>> and do not necessarily represent those of Virgin Pulse, Inc. If you have
>>> received this message in error, or are not the named recipient(s), please
>>> immediately notify the sender and delete this e-mail message.
>>>
>>> v2.59
>>>
>>>
>>> From: Paul Emmerich <[email protected]>
>>> Date: Wednesday, July 24, 2019 at 1:39 PM
>>> To: Peter Eisch <[email protected]>
>>> Cc: Xavier Trilla <[email protected]>, "
>>> [email protected]" <[email protected]>
>>> Subject: Re: [ceph-users] Upgrading and lost OSDs
>>>
>>> On Wed, Jul 24, 2019 at 8:36 PM Peter Eisch <mailto:
>>> [email protected]> wrote:
>>> # lsblk
>>> NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
>>> sda 8:0 0 1.7T 0 disk
>>> ├─sda1 8:1 0 100M 0 part
>>> ├─sda2 8:2 0 1.7T 0 part
>>> └─sda5 8:5 0 10M 0 part
>>> sdb 8:16 0 1.7T 0 disk
>>> ├─sdb1 8:17 0 100M 0 part
>>> ├─sdb2 8:18 0 1.7T 0 part
>>> └─sdb5 8:21 0 10M 0 part
>>> sdc 8:32 0 1.7T 0 disk
>>> ├─sdc1 8:33 0 100M 0 part
>>>
>>> That's ceph-disk which was removed, run "ceph-volume simple scan"
>>>
>>>
>>> --
>>> Paul Emmerich
>>>
>>> Looking for help with your Ceph cluster? Contact us at
>>> https://nam02.safelinks.protection.outlook.com/?url=https://croit.io&data=02|01|[email protected]|93235ab7971a4beceab708d710664a14|b123a16e892b4cf6a55a6f8c7606a035|0|0|636995903843215231&sdata=YEQI+UvikVPVeOFNSB2ikqVRiul8ElD3JEZDVOQI+NY=&reserved=0
>>> <https://nam02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcroit.io&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227757921&sdata=gAFSElTqBRhu1zDIQYvWQ2WZtqqHoW%2FLa3stBfqXXHQ%3D&reserved=0>
>>>
>>> croit GmbH
>>> Freseniusstr. 31h
>>> 81247 München
>>>
>>> https://nam02.safelinks.protection.outlook.com/?url=http://www.croit.io&data=02|01|[email protected]|93235ab7971a4beceab708d710664a14|b123a16e892b4cf6a55a6f8c7606a035|0|0|636995903843225224&sdata=83sD9wJHxE5W0renuDE7RGR/cPznR6jl9rEfl1AO+oA=&reserved=0
>>> <https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.croit.io&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227767913&sdata=LWJKCZ4VxuwoHMJGSWiEWAcawFPw7pDGC48%2B6bnXk6A%3D&reserved=0>
>>> Tel: +49 89 1896585 90
>>>
>>>
>>> ...
>>> I'm thinking the OSD would start (I can recreate the .service
>>> definitions in systemctl) if the above were mounted in a way like they are
>>> on another of my hosts:
>>> # lsblk
>>> NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT
>>> sda 8:0 0 1.7T 0 disk
>>> ├─sda1 8:1 0 100M 0 part
>>> │ └─97712be4-1234-4acc-8102-2265769053a5 253:17 0 98M 0 crypt
>>> /var/lib/ceph/osd/ceph-16
>>> ├─sda2 8:2 0 1.7T 0 part
>>> │ └─049b7160-1234-4edd-a5dc-fe00faca8d89 253:16 0 1.7T 0 crypt
>>> └─sda5 8:5 0 10M 0 part
>>> /var/lib/ceph/osd-lockbox/97712be4-9674-4acc-1234-2265769053a5
>>> sdb 8:16 0 1.7T 0 disk
>>> ├─sdb1 8:17 0 100M 0 part
>>> │ └─f03f0298-1234-42e9-8b28-f3016e44d1e2 253:26 0 98M 0 crypt
>>> /var/lib/ceph/osd/ceph-17
>>> ├─sdb2 8:18 0 1.7T 0 part
>>> │ └─51177019-1234-4963-82d1-5006233f5ab2 253:30 0 1.7T 0 crypt
>>> └─sdb5 8:21 0 10M 0 part
>>> /var/lib/ceph/osd-lockbox/f03f0298-1234-42e9-8b28-f3016e44d1e2
>>> sdc 8:32 0 1.7T 0 disk
>>> ├─sdc1 8:33 0 100M 0 part
>>> │ └─0184df0c-1234-404d-92de-cb71b1047abf 253:8 0 98M 0 crypt
>>> /var/lib/ceph/osd/ceph-18
>>> ├─sdc2 8:34 0 1.7T 0 part
>>> │ └─fdad7618-1234-4021-a63e-40d973712e7b 253:13 0 1.7T 0 crypt
>>> ...
>>>
>>> Thank you for your time on this,
>>>
>>> peter
>>>
>>> From: Xavier Trilla <mailto:[email protected]>
>>> Date: Wednesday, July 24, 2019 at 1:25 PM
>>> To: Peter Eisch <mailto:[email protected]>
>>> Cc: "mailto:[email protected]"; <mailto:[email protected]
>>> >
>>> Subject: Re: [ceph-users] Upgrading and lost OSDs
>>>
>>> Hi Peter,
>>>
>>> Im not sure but maybe after some changes the OSDs are not being
>>> recongnized by ceph scripts.
>>>
>>> Ceph used to use udev to detect the OSDs and then moved to lvm, which
>>> kind of OSDs are you running? Blustore or filestore? Which version did you
>>> use to create them?
>>>
>>> Cheers!
>>>
>>> El 24 jul 2019, a les 20:04, Peter Eisch <mailto:mailto:
>>> [email protected]> va escriure:
>>> Hi,
>>>
>>> I’m working through updating from 12.2.12/luminious to 14.2.2/nautilus
>>> on centos 7.6. The managers are updated alright:
>>>
>>> # ceph -s
>>>   cluster:
>>>     id:     2fdb5976-1234-4b29-ad9c-1ca74a9466ec
>>>     health: HEALTH_WARN
>>>             Degraded data redundancy: 24177/9555955 objects degraded
>>> (0.253%), 7 pgs degraded, 1285 pgs undersized
>>>             3 monitors have not enabled msgr2
>>>  ...
>>>
>>> I updated ceph on a OSD host with 'yum update' and then rebooted to grab
>>> the current kernel. Along the way, the contents of all the directories in
>>> /var/lib/ceph/osd/ceph-*/ were deleted. Thus I have 16 OSDs down from this.
>>> I can manage the undersized but I'd like to get these drives working again
>>> without deleting each OSD and recreating them.
>>>
>>> So far I've pulled the respective cephx key into the 'keyring' file and
>>> populated 'bluestore' into the 'type' files but I'm unsure how to get the
>>> lockboxes mounted to where I can get the OSDs running. The osd-lockbox
>>> directory is otherwise untouched from when the OSDs were deployed.
>>>
>>> Is there a way to run ceph-deploy or some other tool to rebuild the
>>> mounts for the drives?
>>>
>>> peter
>>>
>>> _______________________________________________
>>> ceph-users mailing list
>>> [email protected]
>>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>> <https://nam02.safelinks.protection.outlook.com/?url=http%3A%2F%2Flists.ceph.com%2Flistinfo.cgi%2Fceph-users-ceph.com&data=02%7C01%7Cpeter.eisch%40virginpulse.com%7C25cea362ad224625423308d71071c968%7Cb123a16e892b4cf6a55a6f8c7606a035%7C0%7C0%7C636995953227767913&sdata=jeAj%2FfzN%2BOG1NPFeZwqYnQiB4mgbgLGqq85tz99xBz8%3D&reserved=0>
>>>
>>> _______________________________________________
>> ceph-users mailing list
>> [email protected]
>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>
>
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to