chenBright commented on PR #3290:
URL: https://github.com/apache/brpc/pull/3290#issuecomment-4545520919
> Is the error occurring during startup or a runtime error?
The error occurred at runtime.
> Could you please provide relevant environment information, including OS
and CPU details
Some environment information:
```shell
uname -r
5.10.134-16.3.al8.x86_64
lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 24.04.1 LTS
Release: 24.04
Codename: noble
```
```shell
lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
Address sizes: 52 bits physical, 57 bits virtual
Byte Order: Little Endian
CPU(s): 192
On-line CPU(s) list: 0-191
Vendor ID: GenuineIntel
BIOS Vendor ID: Intel(R) Corporation
Model name: Intel(R) Xeon(R) Platinum 8469C
BIOS Model name: Intel(R) Xeon(R) Platinum 8469C CPU @ 2.6GHz
BIOS CPU family: 179
CPU family: 6
Model: 143
Thread(s) per core: 2
Core(s) per socket: 48
Socket(s): 2
Stepping: 8
CPU(s) scaling MHz: 82%
CPU max MHz: 3800.0000
CPU min MHz: 800.0000
BogoMIPS: 5200.00
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr
pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall
nx pdpe1gb r
dtscp lm constant_tsc art arch_perfmon pebs bts
rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni
pclmulqdq dtes64 mon
itor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16
xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes
xsave avx f16c r
drand lahf_lm abm 3dnowprefetch cpuid_fault epb
cat_l3 cat_l2 cdp_l3 invpcid_single intel_ppin cdp_l2 ssbd mba ibrs ibpb stibp
ibrs_enhanced
tpr_shadow vnmi flexpriority ept vpid ept_ad
fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a avx512f
avx512dq rdseed
adx smap avx512ifma clflushopt clwb intel_pt
avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc
cqm_occup_llc cqm_mbm
_total cqm_mbm_local split_lock_detect avx_vnni
avx512_bf16 wbnoinvd dtherm ida arat pln pts hwp hwp_notify hwp_act_window
hwp_epp hwp_pkg_r
eq hfi avx512vbmi umip pku ospke waitpkg
avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg tme
avx512_vpopcntdq rdpid bus_lock_det
ect cldemote movdiri movdir64b enqcmd fsrm uintr
md_clear serialize tsxldtrk pconfig arch_lbr amx_bf16 avx512_fp16 amx_tile
amx_int8 flush_l
1d arch_capabilities
Virtualization features:
Virtualization: VT-x
Caches (sum of all):
L1d: 4.5 MiB (96 instances)
L1i: 3 MiB (96 instances)
L2: 192 MiB (96 instances)
L3: 195 MiB (2 instances)
NUMA:
NUMA node(s): 2
NUMA node0 CPU(s): 0-47,96-143
NUMA node1 CPU(s): 48-95,144-191
Vulnerabilities:
Itlb multihit: Not affected
L1tf: Not affected
Mds: Not affected
Meltdown: Not affected
Mmio stale data: Not affected
Retbleed: Not affected
Spec store bypass: Mitigation; Speculative Store Bypass disabled via
prctl and seccomp
Spectre v1: Mitigation; usercopy/swapgs barriers and __user
pointer sanitization
Spectre v2: Mitigation; Enhanced IBRS, IBPB conditional, RSB
filling, PBRSB-eIBRS SW sequence
Srbds: Not affected
Tsx async abort: Not affected
```
Complete runtime log:
```shell
./ubring_performance_client -use_ubring=true -echo_attachment=true
-attachment_size=6291456
I0526 23:04:09.249178 98087 0
/workspace/cgm/brpc/src/brpc/server.cpp:1232 StartInternal]
Server[DummyServerOf(./ubring_performance_client)] is serving on port=8001.
I0526 23:04:09.249319 98087 0
/workspace/cgm/brpc/src/brpc/server.cpp:1235 StartInternal] Check out
http://k8s-al-sh-gpu-rdma-h20-0032:8001 in web browser.
[Threads: 1, Depth: 1, Attachment: 6291456B, UBRING: yes, Echo: yes]
I0526 23:04:09.257395 98087 0
/workspace/cgm/brpc/src/brpc/ubshm/shm/shm_mgr.cpp:72 ShmMgrInit] shm mgr init
success, shm type=1
I0526 23:04:09.267279 98099 0
/workspace/cgm/brpc/src/brpc/ubshm/ub_ring.cpp:269 UbrTrxHBCallback] Heartbeat
cannot be started, wait connected state.
Avg-Latency: 0, 90th-Latency: 0, 99th-Latency: 0, 99.9th-Latency: 0,
Throughput: 29.9741MB/s, QPS: 0k, Server CPU-utilization: 0%, Client
CPU-utilization: 101%
[Threads: 2, Depth: 1, Attachment: 6291456B, UBRING: yes, Echo: yes]
I0526 23:04:30.303327 98099 0
/workspace/cgm/brpc/src/brpc/ubshm/ub_ring.cpp:269 UbrTrxHBCallback] Heartbeat
cannot be started, wait connected state.
Avg-Latency: 0, 90th-Latency: 0, 99th-Latency: 0, 99.9th-Latency: 0,
Throughput: 0.299211MB/s, QPS: 0k, Server CPU-utilization: 0%, Client
CPU-utilization: 102%
[Threads: 4, Depth: 1, Attachment: 6291456B, UBRING: yes, Echo: yes]
W0526 23:04:50.475469 98092 4294969093
/workspace/cgm/brpc/src/brpc/ubshm/ub_endpoint.cpp:385
ProcessHandshakeAtClient] Fail to get hello message from
server:brpc::Socket{id=5 fd=14 addr=0.0.0.0:8002:57824} (0x564645f47910): Got
EOF
W0526 23:04:50.475563 98087 0
/workspace/cgm/brpc/example/ubring_performance/client.cpp:131 Init] RPC call
failed, retrying... (3 left): [E1014]Fail to complete ubring handshake from
brpc::Socket{id=5 fd=14 addr=0.0.0.0:8002:57824} (0x564645f47910): Got EOF
W0526 23:04:51.475721 98087 0
/workspace/cgm/brpc/example/ubring_performance/client.cpp:131 Init] RPC call
failed, retrying... (2 left): [E112]Not connected to 0.0.0.0:8002 yet,
server_id=5
W0526 23:04:52.475883 98087 0
/workspace/cgm/brpc/example/ubring_performance/client.cpp:131 Init] RPC call
failed, retrying... (1 left): [E112]Not connected to 0.0.0.0:8002 yet,
server_id=5
E0526 23:04:53.476011 98087 0
/workspace/cgm/brpc/example/ubring_performance/client.cpp:135 Init] RPC call
failed after multiple retries
```
```shell
./ubring_performance_server -use_ubring=true
I0526 23:00:15.982886 97452 0 /brpc/src/brpc/ubshm/shm/shm_mgr.cpp:72
ShmMgrInit] shm mgr init success, shm type=1
I0526 23:00:15.997779 97452 0 /brpc/src/brpc/server.cpp:1232
StartInternal] Server[test::PerfTestServiceImpl] is serving on port=8002.
I0526 23:00:15.998154 97452 0 /brpc/src/brpc/server.cpp:1235
StartInternal] Check out http://k8s-al-sh-gpu-rdma-h20-0032:8002 in web browser.
I0526 23:00:46.670268 97457 4294969601 /brpc/src/brpc/ubshm/ub_ring.cpp:1021
UbrTrxCloseCheck] Trx close skipped, already closing, trx local
name=UBRING_127.0.0.1:35304_S
I0526 23:00:46.670297 97457 4294969601 /brpc/src/brpc/ubshm/ub_ring.cpp:62
UbrTrxClose] Trx close skipped, already closing, local
name=UBRING_127.0.0.1:35304_S
I0526 23:00:56.666588 97464 0 /brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185
IpcShmRemoteFree] IPC free remote shm=UBRING_127.0.0.1:35304_C success.
I0526 23:00:56.666952 97464 0 /brpc/src/brpc/ubshm/shm/shm_ipc.cpp:78
IpcShmMunmap] IPC unmap shm=UBRING_127.0.0.1:35304_S length=4194304 success.
I0526 23:00:56.667327 97464 0 /brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185
IpcShmRemoteFree] IPC free remote shm=UBRING_127.0.0.1:35304_C success.
E0526 23:02:17.708842 97484 4294969601
/brpc/src/brpc/ubshm/common/common.h:173 HasTimedOut] task time out 5 seconds.
W0526 23:02:17.708876 97484 4294969601 /brpc/src/brpc/ubshm/ub_ring.cpp:85
UbrTrxClose] Local shm UBRING_127.0.0.1:41854_S wait for the peer to close
timed out, force cleanup.
I0526 23:02:17.709291 97484 4294969601
/brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185 IpcShmRemoteFree] IPC free remote
shm=UBRING_127.0.0.1:41854_C success.
I0526 23:02:17.709631 97484 4294969601
/brpc/src/brpc/ubshm/shm/shm_ipc.cpp:78 IpcShmMunmap] IPC unmap
shm=UBRING_127.0.0.1:41854_S length=4194304 success.
I0526 23:02:17.709974 97484 4294969601
/brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185 IpcShmRemoteFree] IPC free remote
shm=UBRING_127.0.0.1:41854_C success.
[1] 97452 bus error (core dumped) ./ubring_performance_server
-use_ubring=true
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]