Hi,

Am 9/9/25 um 17:34 schrieb Yuri Weinstein:

* Dashboard: Added support for NVMe/TCP (gateway groups, multiple
namespaces),

I have just upgraded a test cluster and NVMEoF does not work any more.

The previous version running was quay.ceph.io/ceph-ci/ceph:squid-nvmeof and 
NVMEoF was working there.

The gateways seem to work and subsystem and namespaces are still configured but 
an initiator does not see namespaces.

# ceph orch ps --daemon_type nvmeof
NAME                              HOST           PORTS                   STATUS 
        REFRESHED  AGE  MEM USE  MEM LIM  VERSION  IMAGE ID      CONTAINER ID
nvmeof.hosa.b236asbrce101.ccaujq  b236asbrce101  *:5500,4420,8009,10008  
running (13h)    36s ago  13h     128M        -  1.5.7    2c3e7ef350b8  
74672e87a945
nvmeof.hosa.b236asbrce102.gnbwsc  b236asbrce102  *:5500,4420,8009,10008  
running (13h)    36s ago  13h     120M        -  1.5.7    2c3e7ef350b8  
1929979ab7e1
nvmeof.hosa.b236asbrce103.bthewi  b236asbrce103  *:5500,4420,8009,10008  
running (13h)    36s ago  13h     122M        -  1.5.7    2c3e7ef350b8  
14885171d1a5
nvmeof.hosa.b236asbrce104.vcgzvg  b236asbrce104  *:5500,4420,8009,10008  
running (13h)    36s ago  13h     122M        -  1.5.7    2c3e7ef350b8  
7b00b0f3356a
# ceph nvmeof subsystem list
+---------------------------------+------------------+--------------------+---------------+-------+--------------+--------------+--------------+-------------------+
|Nqn                              |Serial Number     |Model Number        
|Namespace Count|Subtype|Max Namespaces|Has Dhchap Key|Allow Any Host|Created 
Without Key|
+---------------------------------+------------------+--------------------+---------------+-------+--------------+--------------+--------------+-------------------+
|nqn.2025-08.com.example:HOSA.hosa|Ceph30844860597194|Ceph bdev Controller|2    
          |NVMe   |128           |False         |True          |False           
   |
+---------------------------------+------------------+--------------------+---------------+-------+--------------+--------------+--------------+-------------------+
# ceph nvmeof namespace list nqn.2025-08.com.example:HOSA.hosa
+-----------------------------------------+---------+--------+--------+--------------+----------+-----------+-----------+------------+-------------+------------+-----+----+------------------------------------+---------------------------------+-----------+-------------------+---------+
|Bdev Name                                |Rbd Image|Rbd Pool|Lb Group|Rbd 
Image Size|Block Size|R/W Ios/Sec|R/W Mbs/Sec|Read Mbs/Sec|Write Mbs/Sec|Auto 
Visible|Hosts|Nsid|Uuid                                |Ns Subsystem Nqn        
         |Trash Image|Disable Auto Resize|Read Only|
+-----------------------------------------+---------+--------+--------+--------------+----------+-----------+-----------+------------+-------------+------------+-----+----+------------------------------------+---------------------------------+-----------+-------------------+---------+
|bdev_57f2c81a-0b07-406b-84aa-c62d66b5f10d|ns01     |nvmeof  |5       |100GB    
     |512B      |0          |0          |0           |0            |True        
|[]   |1   
|57f2c81a-0b07-406b-84aa-c62d66b5f10d|nqn.2025-08.com.example:HOSA.hosa|False   
   |False              |False    |
|bdev_b4bf328a-8c67-4879-bf13-a799884d6dbe|ns02     |nvmeof  |5       |200GB    
     |512B      |0          |0          |0           |0            |True        
|[]   |2   
|b4bf328a-8c67-4879-bf13-a799884d6dbe|nqn.2025-08.com.example:HOSA.hosa|False   
   |False              |False    |
+-----------------------------------------+---------+--------+--------+--------------+----------+-----------+-----------+------------+-------------+------------+-----+----+------------------------------------+---------------------------------+-----------+-------------------+---------+
# ceph nvmeof listener list nqn.2025-08.com.example:HOSA.hosa
+-------------+------+--------------+------+------+-------+
|Host Name    |Trtype|Traddr        |Secure|Adrfam|Trsvcid|
+-------------+------+--------------+------+------+-------+
|b236asbrce101|TCP   |10.105.141.195|False |ipv4  |4420   |
|b236asbrce102|TCP   |10.105.141.196|False |ipv4  |4420   |
|b236asbrce103|TCP   |10.105.141.197|False |ipv4  |4420   |
|b236asbrce104|TCP   |10.105.141.198|False |ipv4  |4420   |
+-------------+------+--------------+------+------+-------+

On the initiator host:

# nvme discover -t tcp -a 10.105.141.195 -s 8009 -v
warning: use hostid which does not match uuid in hostnqn
nvme0: nqn.2014-08.org.nvmexpress.discovery connected

Discovery Log Number of Records 4, Generation counter 6
=====Discovery Log Entry 0======
trtype:  tcp
adrfam:  ipv4
subtype: nvme subsystem
treq:    not required
portid:  0
trsvcid: 4420
subnqn:  nqn.2025-08.com.example:HOSA.hosa
traddr:  10.105.141.195
eflags:  none
sectype: none
=====Discovery Log Entry 1======
trtype:  tcp
adrfam:  ipv4
subtype: nvme subsystem
treq:    not required
portid:  1
trsvcid: 4420
subnqn:  nqn.2025-08.com.example:HOSA.hosa
traddr:  10.105.141.196
eflags:  none
sectype: none
=====Discovery Log Entry 2======
trtype:  tcp
adrfam:  ipv4
subtype: nvme subsystem
treq:    not required
portid:  2
trsvcid: 4420
subnqn:  nqn.2025-08.com.example:HOSA.hosa
traddr:  10.105.141.197
eflags:  none
sectype: none
=====Discovery Log Entry 3======
trtype:  tcp
adrfam:  ipv4
subtype: nvme subsystem
treq:    not required
portid:  3
trsvcid: 4420
subnqn:  nqn.2025-08.com.example:HOSA.hosa
traddr:  10.105.141.198
eflags:  none
sectype: none
nvme0: nqn.2014-08.org.nvmexpress.discovery disconnected

# nvme connect-all -t tcp -a 10.105.141.195 -s 8009 -v
warning: use hostid which does not match uuid in hostnqn
nvme0: nqn.2014-08.org.nvmexpress.discovery connected
nvme1: nqn.2025-08.com.example:HOSA.hosa connected
nvme2: nqn.2025-08.com.example:HOSA.hosa connected
nvme3: nqn.2025-08.com.example:HOSA.hosa connected
nvme4: nqn.2025-08.com.example:HOSA.hosa connected
nvme0: nqn.2014-08.org.nvmexpress.discovery disconnected

Looking good so far, but:

# nvme list
Node                  Generic               SN                   Model          
                          Namespace  Usage                      Format          
 FW Rev
--------------------- --------------------- -------------------- 
---------------------------------------- ---------- -------------------------- 
---------------- --------
# nvme list -v
Subsystem        Subsystem-NQN                                                  
                              Controllers
---------------- 
--------------------------------------------------------------------------------------------
 ----------------
nvme-subsys1     nqn.2025-08.com.example:HOSA.hosa                              
                              nvme1, nvme2, nvme3, nvme4

Device   SN                   MN                                       FR       
TxPort Address        Slot   Subsystem    Namespaces
-------- -------------------- ---------------------------------------- -------- 
------ -------------- ------ ------------ ----------------
nvme1    Ceph30844860597194   Ceph bdev Controller                     25.05    
tcp    traddr=10.105.141.195,trsvcid=4420,src_addr=10.105.141.212        
nvme-subsys1
nvme2    Ceph30844860597194   Ceph bdev Controller                     25.05    
tcp    traddr=10.105.141.196,trsvcid=4420,src_addr=10.105.141.212        
nvme-subsys1
nvme3    Ceph30844860597194   Ceph bdev Controller                     25.05    
tcp    traddr=10.105.141.197,trsvcid=4420,src_addr=10.105.141.212        
nvme-subsys1
nvme4    Ceph30844860597194   Ceph bdev Controller                     25.05    
tcp    traddr=10.105.141.198,trsvcid=4420,src_addr=10.105.141.212        
nvme-subsys1

Device       Generic      NSID       Usage                      Format          
 Controllers
------------ ------------ ---------- -------------------------- 
---------------- ----------------

# nvme list-ns /dev/nvme1 -v -a
opcode       : 06
flags        : 00
rsvd1        : 0000
nsid         : 00000000
cdw2         : 00000000
cdw3         : 00000000
data_len     : 00001000
metadata_len : 00000000
addr         : 616b6af40000
metadata     : 0
cdw10        : 00000010
cdw11        : 00000000
cdw12        : 00000000
cdw13        : 00000000
cdw14        : 00000000
cdw15        : 00000000
timeout_ms   : 00000000
result       : 00000000
err          : 2
latency      : 569 us
NVMe status: Invalid Field in Command: A reserved coded value or an unsupported 
value in a defined field(0x2)

How do I make it work again?

Regards
--
Robert Sander
Linux Consultant

Heinlein Consulting GmbH
Schwedter Str. 8/9b, 10119 Berlin

https://www.heinlein-support.de

Tel: +49 30 405051 - 0
Fax: +49 30 405051 - 19

Amtsgericht Berlin-Charlottenburg - HRB 220009 B
Geschäftsführer: Peer Heinlein - Sitz: Berlin
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to