chenBright commented on PR #3290:
URL: https://github.com/apache/brpc/pull/3290#issuecomment-4545520919

   > Is the error occurring during startup or a runtime error?
   
   The error occurred at runtime.
   
   > Could you please provide relevant environment information, including OS 
and CPU details
   
   Some environment information:
   
   ```shell
   uname -r
   
   5.10.134-16.3.al8.x86_64
   
   lsb_release -a
   
   No LSB modules are available.
   Distributor ID:      Ubuntu
   Description: Ubuntu 24.04.1 LTS
   Release:     24.04
   Codename:    noble
   ```
   
   ```shell
   lscpu
   
   Architecture:            x86_64
     CPU op-mode(s):        32-bit, 64-bit
     Address sizes:         52 bits physical, 57 bits virtual
     Byte Order:            Little Endian
   CPU(s):                  192
     On-line CPU(s) list:   0-191
   Vendor ID:               GenuineIntel
     BIOS Vendor ID:        Intel(R) Corporation
     Model name:            Intel(R) Xeon(R) Platinum 8469C
       BIOS Model name:     Intel(R) Xeon(R) Platinum 8469C  CPU @ 2.6GHz
       BIOS CPU family:     179
       CPU family:          6
       Model:               143
       Thread(s) per core:  2
       Core(s) per socket:  48
       Socket(s):           2
       Stepping:            8
       CPU(s) scaling MHz:  82%
       CPU max MHz:         3800.0000
       CPU min MHz:         800.0000
       BogoMIPS:            5200.00
       Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr 
pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall 
nx pdpe1gb r
                            dtscp lm constant_tsc art arch_perfmon pebs bts 
rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni 
pclmulqdq dtes64 mon
                            itor ds_cpl vmx smx est tm2 ssse3 sdbg fma cx16 
xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes 
xsave avx f16c r
                            drand lahf_lm abm 3dnowprefetch cpuid_fault epb 
cat_l3 cat_l2 cdp_l3 invpcid_single intel_ppin cdp_l2 ssbd mba ibrs ibpb stibp 
ibrs_enhanced
                             tpr_shadow vnmi flexpriority ept vpid ept_ad 
fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm cqm rdt_a avx512f 
avx512dq rdseed
                             adx smap avx512ifma clflushopt clwb intel_pt 
avx512cd sha_ni avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves cqm_llc 
cqm_occup_llc cqm_mbm
                            _total cqm_mbm_local split_lock_detect avx_vnni 
avx512_bf16 wbnoinvd dtherm ida arat pln pts hwp hwp_notify hwp_act_window 
hwp_epp hwp_pkg_r
                            eq hfi avx512vbmi umip pku ospke waitpkg 
avx512_vbmi2 gfni vaes vpclmulqdq avx512_vnni avx512_bitalg tme 
avx512_vpopcntdq rdpid bus_lock_det
                            ect cldemote movdiri movdir64b enqcmd fsrm uintr 
md_clear serialize tsxldtrk pconfig arch_lbr amx_bf16 avx512_fp16 amx_tile 
amx_int8 flush_l
                            1d arch_capabilities
   Virtualization features:
     Virtualization:        VT-x
   Caches (sum of all):
     L1d:                   4.5 MiB (96 instances)
     L1i:                   3 MiB (96 instances)
     L2:                    192 MiB (96 instances)
     L3:                    195 MiB (2 instances)
   NUMA:
     NUMA node(s):          2
     NUMA node0 CPU(s):     0-47,96-143
     NUMA node1 CPU(s):     48-95,144-191
   Vulnerabilities:
     Itlb multihit:         Not affected
     L1tf:                  Not affected
     Mds:                   Not affected
     Meltdown:              Not affected
     Mmio stale data:       Not affected
     Retbleed:              Not affected
     Spec store bypass:     Mitigation; Speculative Store Bypass disabled via 
prctl and seccomp
     Spectre v1:            Mitigation; usercopy/swapgs barriers and __user 
pointer sanitization
     Spectre v2:            Mitigation; Enhanced IBRS, IBPB conditional, RSB 
filling, PBRSB-eIBRS SW sequence
     Srbds:                 Not affected
     Tsx async abort:       Not affected
   ```
   
   Complete runtime log:
   
   ```shell
   ./ubring_performance_client -use_ubring=true -echo_attachment=true 
-attachment_size=6291456
   I0526 23:04:09.249178 98087     0 
/workspace/cgm/brpc/src/brpc/server.cpp:1232 StartInternal] 
Server[DummyServerOf(./ubring_performance_client)] is serving on port=8001.
   I0526 23:04:09.249319 98087     0 
/workspace/cgm/brpc/src/brpc/server.cpp:1235 StartInternal] Check out 
http://k8s-al-sh-gpu-rdma-h20-0032:8001 in web browser.
   [Threads: 1, Depth: 1, Attachment: 6291456B, UBRING: yes, Echo: yes]
   I0526 23:04:09.257395 98087     0 
/workspace/cgm/brpc/src/brpc/ubshm/shm/shm_mgr.cpp:72 ShmMgrInit] shm mgr init 
success, shm type=1
   I0526 23:04:09.267279 98099     0 
/workspace/cgm/brpc/src/brpc/ubshm/ub_ring.cpp:269 UbrTrxHBCallback] Heartbeat 
cannot be started, wait connected state.
   Avg-Latency: 0, 90th-Latency: 0, 99th-Latency: 0, 99.9th-Latency: 0, 
Throughput: 29.9741MB/s, QPS: 0k, Server CPU-utilization: 0%, Client 
CPU-utilization: 101%
   [Threads: 2, Depth: 1, Attachment: 6291456B, UBRING: yes, Echo: yes]
   I0526 23:04:30.303327 98099     0 
/workspace/cgm/brpc/src/brpc/ubshm/ub_ring.cpp:269 UbrTrxHBCallback] Heartbeat 
cannot be started, wait connected state.
   Avg-Latency: 0, 90th-Latency: 0, 99th-Latency: 0, 99.9th-Latency: 0, 
Throughput: 0.299211MB/s, QPS: 0k, Server CPU-utilization: 0%, Client 
CPU-utilization: 102%
   [Threads: 4, Depth: 1, Attachment: 6291456B, UBRING: yes, Echo: yes]
   W0526 23:04:50.475469 98092 4294969093 
/workspace/cgm/brpc/src/brpc/ubshm/ub_endpoint.cpp:385 
ProcessHandshakeAtClient] Fail to get hello message from 
server:brpc::Socket{id=5 fd=14 addr=0.0.0.0:8002:57824} (0x564645f47910): Got 
EOF
   W0526 23:04:50.475563 98087     0 
/workspace/cgm/brpc/example/ubring_performance/client.cpp:131 Init] RPC call 
failed, retrying... (3 left): [E1014]Fail to complete ubring handshake from 
brpc::Socket{id=5 fd=14 addr=0.0.0.0:8002:57824} (0x564645f47910): Got EOF
   W0526 23:04:51.475721 98087     0 
/workspace/cgm/brpc/example/ubring_performance/client.cpp:131 Init] RPC call 
failed, retrying... (2 left): [E112]Not connected to 0.0.0.0:8002 yet, 
server_id=5
   W0526 23:04:52.475883 98087     0 
/workspace/cgm/brpc/example/ubring_performance/client.cpp:131 Init] RPC call 
failed, retrying... (1 left): [E112]Not connected to 0.0.0.0:8002 yet, 
server_id=5
   E0526 23:04:53.476011 98087     0 
/workspace/cgm/brpc/example/ubring_performance/client.cpp:135 Init] RPC call 
failed after multiple retries
   ```
   ```shell
   ./ubring_performance_server -use_ubring=true
   I0526 23:00:15.982886 97452     0 /brpc/src/brpc/ubshm/shm/shm_mgr.cpp:72 
ShmMgrInit] shm mgr init success, shm type=1
   I0526 23:00:15.997779 97452     0 /brpc/src/brpc/server.cpp:1232 
StartInternal] Server[test::PerfTestServiceImpl] is serving on port=8002.
   I0526 23:00:15.998154 97452     0 /brpc/src/brpc/server.cpp:1235 
StartInternal] Check out http://k8s-al-sh-gpu-rdma-h20-0032:8002 in web browser.
   I0526 23:00:46.670268 97457 4294969601 /brpc/src/brpc/ubshm/ub_ring.cpp:1021 
UbrTrxCloseCheck] Trx close skipped, already closing, trx local 
name=UBRING_127.0.0.1:35304_S
   I0526 23:00:46.670297 97457 4294969601 /brpc/src/brpc/ubshm/ub_ring.cpp:62 
UbrTrxClose] Trx close skipped, already closing, local 
name=UBRING_127.0.0.1:35304_S
   I0526 23:00:56.666588 97464     0 /brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185 
IpcShmRemoteFree] IPC free remote shm=UBRING_127.0.0.1:35304_C success.
   I0526 23:00:56.666952 97464     0 /brpc/src/brpc/ubshm/shm/shm_ipc.cpp:78 
IpcShmMunmap] IPC unmap shm=UBRING_127.0.0.1:35304_S length=4194304 success.
   I0526 23:00:56.667327 97464     0 /brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185 
IpcShmRemoteFree] IPC free remote shm=UBRING_127.0.0.1:35304_C success.
   E0526 23:02:17.708842 97484 4294969601 
/brpc/src/brpc/ubshm/common/common.h:173 HasTimedOut] task time out 5 seconds.
   W0526 23:02:17.708876 97484 4294969601 /brpc/src/brpc/ubshm/ub_ring.cpp:85 
UbrTrxClose] Local shm UBRING_127.0.0.1:41854_S wait for the peer to close 
timed out, force cleanup.
   I0526 23:02:17.709291 97484 4294969601 
/brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185 IpcShmRemoteFree] IPC free remote 
shm=UBRING_127.0.0.1:41854_C success.
   I0526 23:02:17.709631 97484 4294969601 
/brpc/src/brpc/ubshm/shm/shm_ipc.cpp:78 IpcShmMunmap] IPC unmap 
shm=UBRING_127.0.0.1:41854_S length=4194304 success.
   I0526 23:02:17.709974 97484 4294969601 
/brpc/src/brpc/ubshm/shm/shm_ipc.cpp:185 IpcShmRemoteFree] IPC free remote 
shm=UBRING_127.0.0.1:41854_C success.
   [1]    97452 bus error (core dumped)  ./ubring_performance_server 
-use_ubring=true
   ```
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to