Editing crashreport #71896

ReasonCrashing FunctionWhere to cut BacktraceReports Count
ASSERTION( md->md_handler != handler ) failedlnet_assert_handler_unusedlnet_assert_handler_unused
LNetNIFini
lnet_unconfigure
genl_family_rcv_msg_doit
genl_family_rcv_msg
genl_rcv_msg
netlink_rcv_skb
genl_rcv
netlink_unicast
netlink_sendmsg
____sys_sendmsg
___sys_sendmsg
__sys_sendmsg
do_syscall_64
entry_SYSCALL_64_after_hwframe
23

Added fields:

Match messages in logs
(every line would be required to be present in log output
Copy from "Messages before crash" column below):
Match messages in full crash
(every line would be required to be present in crash log output
Copy from "Full Crash" column below):
Limit to a test:
(Copy from below "Failing text"):
Delete these reports as invalid (real bug in review or some such)
Bug or comment:
Extra info:

Failures list (last 100):

Failing TestFull CrashMessages before crashComment
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1191577:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1191577:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1191577 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1.el9_5.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit+0xdc/0x130
genl_family_rcv_msg+0x14d/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __kmem_cache_alloc_node+0x18f/0x2e0
? netlink_realloc_groups+0xbe/0x120
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x248/0x370
netlink_sendmsg+0x206/0x440
____sys_sendmsg+0x38b/0x3b0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? syscall_exit_work+0x103/0x130
? do_syscall_64+0x6b/0xf0
? __sys_recvmsg+0x56/0xa0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5f/0xf0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? handle_mm_fault+0x116/0x270
? do_user_addr_fault+0x1d6/0x6a0
? do_syscall_64+0x6b/0xf0
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x78/0x80
RIP: 0033:0x7f86d270fa17
LNet: 1190047:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.25.232@tcp
LNet: Removed LNI 10.240.25.232@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.25.232@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.176@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.176@tcp
LNet: There was an unexpected network error while writing to 10.240.28.176: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-70vm1.onyx.whamcloud.com to discover 10.240.28.176@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-70vm1.onyx.whamcloud.com to discover 10.240.28.176@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.176@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-105vm13 to discover 10.240.25.232@tcp
Lustre: DEBUG MARKER: Force onyx-105vm13 to discover 10.240.25.232@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1190920
Lustre: DEBUG MARKER: Wait for 1190920
LNet: There was an unexpected network error while writing to 10.240.28.176: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1190920
Lustre: DEBUG MARKER: Finished wait on 1190920
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1134379:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1134379:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1134379 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1.el9_5.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit+0xdc/0x130
genl_family_rcv_msg+0x14d/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __skb_datagram_iter+0x7c/0x2b0
? __pfx_simple_copy_to_iter+0x10/0x10
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x248/0x370
netlink_sendmsg+0x206/0x440
____sys_sendmsg+0x38b/0x3b0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? syscall_exit_work+0x103/0x130
? do_syscall_64+0x6b/0xf0
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? __sys_recvmsg+0x56/0xa0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5f/0xf0
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x78/0x80
RIP: 0033:0x7f8070d0fa17
LNet: 1132972:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.25.239@tcp
LNet: Removed LNI 10.240.25.239@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.25.239@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.161@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl net_drop add -s *@tcp -d *@tcp -r 1 -e local_error
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.161@tcp
LNet: There was an unexpected network error while writing to 10.240.25.161: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lnetctl net_drop del -a
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault delay add -s *@tcp -d 10.240.25.161@tcp -r 1 -m GET -l 3
Lustre: DEBUG MARKER: /usr/sbin/lnetctl net_drop add -s *@tcp -d 10.240.25.161@tcp -r 1 -m GET -e local_error
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault delay add -s *@tcp -d 10.240.25.161@tcp -r 1 -m PUT -l 6
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-70vm8.onyx.whamcloud.com to discover 10.240.25.161@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-70vm8.onyx.whamcloud.com to discover 10.240.25.161@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.161@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-66vm10 to discover 10.240.25.239@tcp
Lustre: DEBUG MARKER: Force onyx-66vm10 to discover 10.240.25.239@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1133780
Lustre: DEBUG MARKER: Wait for 1133780
LNet: There was an unexpected network error while writing to 10.240.25.161: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1133780
Lustre: DEBUG MARKER: Finished wait on 1133780
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault delay del -a
Lustre: DEBUG MARKER: /usr/sbin/lnetctl net_drop del -a
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1036612:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1036612:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1036612 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.38.1.el9_5.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit+0xdc/0x130
genl_family_rcv_msg+0x14d/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x248/0x370
netlink_sendmsg+0x206/0x440
____sys_sendmsg+0x38b/0x3b0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? netlink_recvmsg+0x212/0x290
? _copy_to_user+0x1a/0x30
? move_addr_to_user+0x4b/0xe0
? ____sys_recvmsg+0xeb/0x1b0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5f/0xf0
? idr_get_next_ul+0xb6/0xf0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? netlink_setsockopt+0x2f7/0x430
? do_sock_setsockopt+0xb7/0x180
? __sys_setsockopt+0x75/0xc0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? do_user_addr_fault+0x1d6/0x6a0
? syscall_exit_work+0x103/0x130
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x78/0x80
RIP: 0033:0x7fbdf9b0fa17
LNet: 1035205:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.22.240@tcp
LNet: Removed LNI 10.240.22.240@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.22.240@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.190@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault drop add -s *@tcp -d *@tcp -r 1 -e local_error
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.190@tcp
LNet: There was an unexpected network error while writing to 10.240.28.190: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault drop del -a
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault delay add -s *@tcp -d 10.240.28.190@tcp -r 1 -m GET -l 3
Lustre: DEBUG MARKER: /usr/sbin/lnetctl net_drop add -s *@tcp -d 10.240.28.190@tcp -r 1 -m GET -e local_error
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault delay add -s *@tcp -d 10.240.28.190@tcp -r 1 -m PUT -l 6
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-32vm1.onyx.whamcloud.com to discover 10.240.28.190@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-32vm1.onyx.whamcloud.com to discover 10.240.28.190@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.190@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-106vm7 to discover 10.240.22.240@tcp
Lustre: DEBUG MARKER: Force onyx-106vm7 to discover 10.240.22.240@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1036013
LNet: There was an unexpected network error while writing to 10.240.28.190: rc = -22
Lustre: DEBUG MARKER: Wait for 1036013
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1036013
Lustre: DEBUG MARKER: Finished wait on 1036013
Lustre: DEBUG MARKER: /usr/sbin/lnetctl fault delay del -a
Lustre: DEBUG MARKER: /usr/sbin/lnetctl net_drop del -a
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1136674:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1136674:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1136674 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.38.1.el9_5.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit+0xdc/0x130
genl_family_rcv_msg+0x14d/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __check_object_size.part.0+0x35/0xd0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x248/0x370
netlink_sendmsg+0x206/0x440
____sys_sendmsg+0x38b/0x3b0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? do_syscall_64+0x6b/0xf0
? __sys_recvmsg+0x56/0xa0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? __check_object_size.part.0+0x35/0xd0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5f/0xf0
? ___sys_recvmsg+0x88/0xd0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? __sys_recvmsg+0x56/0xa0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x78/0x80
RIP: 0033:0x7f86aab0fa17
LNet: 1135229:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.43.140@tcp
LNet: Removed LNI 10.240.43.140@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.43.140@tcp [8/256/0/180]
LNet: Accept all, port 7988
Autotest: Test running for 255 minutes (lustre-reviews_review-ldiskfs_113261.33)
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.40.34@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.40.34@tcp
LNet: There was an unexpected network error while writing to 10.240.40.34: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-91vm1.trevis.whamcloud.com to discover 10.240.40.34@tcp \(in background\)
Lustre: DEBUG MARKER: Force trevis-91vm1.trevis.whamcloud.com to discover 10.240.40.34@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.40.34@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-47vm7 to discover 10.240.43.140@tcp
Lustre: DEBUG MARKER: Force trevis-47vm7 to discover 10.240.43.140@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1136077
Lustre: DEBUG MARKER: Wait for 1136077
LNet: There was an unexpected network error while writing to 10.240.40.34: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1136077
Lustre: DEBUG MARKER: Finished wait on 1136077
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1143920:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1143920:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1143920 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? krealloc+0xa5/0xd0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? move_addr_to_user+0x4b/0xe0
? ____sys_recvmsg+0xeb/0x1b0
? __import_iovec+0x46/0x150
? import_iovec+0x17/0x20
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7f5c92f0f917
LNet: 1142520:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.24.89@tcp
LNet: Removed LNI 10.240.24.89@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.24.89@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.29.78@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.29.78@tcp
LNet: There was an unexpected network error while writing to 10.240.29.78: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-50vm2.onyx.whamcloud.com to discover 10.240.29.78@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-50vm2.onyx.whamcloud.com to discover 10.240.29.78@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.29.78@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-113vm11 to discover 10.240.24.89@tcp
Lustre: DEBUG MARKER: Force onyx-113vm11 to discover 10.240.24.89@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1143323
Lustre: DEBUG MARKER: Wait for 1143323
LNet: There was an unexpected network error while writing to 10.240.29.78: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1143323
Lustre: DEBUG MARKER: Finished wait on 1143323
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1365282:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1365282:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1365282 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 1.16.0-4.module+el8.8.0+1454+0b2cbfb8 04/01/2014
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? krealloc+0xa5/0xd0
? _copy_to_iter+0x1d4/0x630
? _copy_to_iter+0x1d4/0x630
genl_rcv_msg+0x47/0xa0
? __check_object_size.part.0+0x47/0xd0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? __import_iovec+0x46/0x150
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
? idr_get_next_ul+0xb6/0xf0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? do_syscall_64+0x69/0x90
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7fb3c130f917
LNet: 1363811:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.39.124@tcp
LNet: Removed LNI 10.240.39.124@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.39.124@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.39.89@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.39.89@tcp
LNet: There was an unexpected network error while writing to 10.240.39.89: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-39vm1.trevis.whamcloud.com to discover 10.240.39.89@tcp \(in background\)
Lustre: DEBUG MARKER: Force trevis-39vm1.trevis.whamcloud.com to discover 10.240.39.89@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.39.89@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-37vm6 to discover 10.240.39.124@tcp
Lustre: DEBUG MARKER: Force trevis-37vm6 to discover 10.240.39.124@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1364649
Lustre: DEBUG MARKER: Wait for 1364649
LNet: There was an unexpected network error while writing to 10.240.39.89: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1364649
Lustre: DEBUG MARKER: Finished wait on 1364649
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1129184:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1129184:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1129184 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __kmem_cache_alloc_node+0x1c7/0x2d0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? _copy_to_user+0x1a/0x30
? move_addr_to_user+0x4b/0xe0
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
? idr_get_next_ul+0xb6/0xf0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7fc0e770f917
LNet: 1127783:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.24.51@tcp
LNet: Removed LNI 10.240.24.51@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.24.51@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.185@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.185@tcp
LNet: There was an unexpected network error while writing to 10.240.28.185: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-48vm4.onyx.whamcloud.com to discover 10.240.28.185@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-48vm4.onyx.whamcloud.com to discover 10.240.28.185@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.185@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-106vm2 to discover 10.240.24.51@tcp
Lustre: DEBUG MARKER: Force onyx-106vm2 to discover 10.240.24.51@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1128587
Lustre: DEBUG MARKER: Wait for 1128587
LNet: There was an unexpected network error while writing to 10.240.28.185: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1128587
Lustre: DEBUG MARKER: Finished wait on 1128587
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1140984:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1140984:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1140984 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? skb_queue_tail+0x1b/0x50
? sock_def_readable+0x10/0xc0
? __netlink_sendskb+0x67/0x90
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? __import_iovec+0x46/0x150
? import_iovec+0x17/0x20
? idr_get_next_ul+0xb6/0xf0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? do_user_addr_fault+0x1d6/0x6a0
? do_syscall_64+0x69/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7f8e5af0f917
LNet: 1139584:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.24.152@tcp
LNet: Removed LNI 10.240.24.152@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.24.152@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.193@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.193@tcp
LNet: There was an unexpected network error while writing to 10.240.28.193: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-53vm5.onyx.whamcloud.com to discover 10.240.28.193@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-53vm5.onyx.whamcloud.com to discover 10.240.28.193@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.193@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-106vm10 to discover 10.240.24.152@tcp
Lustre: DEBUG MARKER: Force onyx-106vm10 to discover 10.240.24.152@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1140387
Lustre: DEBUG MARKER: Wait for 1140387
LNet: There was an unexpected network error while writing to 10.240.28.193: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1140387
Lustre: DEBUG MARKER: Finished wait on 1140387
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1153163:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1153163:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1153163 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-503.15.1.el9_5.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit+0xdc/0x130
genl_family_rcv_msg+0x14d/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __check_object_size.part.0+0x47/0xd0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x248/0x370
netlink_sendmsg+0x206/0x440
____sys_sendmsg+0x38b/0x3b0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? idr_get_next_ul+0xb6/0xf0
? __wake_up+0x40/0x60
? netlink_setsockopt+0x2f7/0x430
? do_sock_setsockopt+0xb7/0x180
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5f/0xf0
? ___sys_recvmsg+0x88/0xd0
? __handle_mm_fault+0x2fb/0x690
? __sys_recvmsg+0x56/0xa0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
entry_SYSCALL_64_after_hwframe+0x78/0x80
RIP: 0033:0x7fa52310fa17
LNet: 1151705:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.25.235@tcp
LNet: Removed LNI 10.240.25.235@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.25.235@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.24.229@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.24.229@tcp
LNet: There was an unexpected network error while writing to 10.240.24.229: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-70vm4.onyx.whamcloud.com to discover 10.240.24.229@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-70vm4.onyx.whamcloud.com to discover 10.240.24.229@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.24.229@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-57vm2 to discover 10.240.25.235@tcp
Lustre: DEBUG MARKER: Force onyx-57vm2 to discover 10.240.25.235@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1152542
Lustre: DEBUG MARKER: Wait for 1152542
LNet: There was an unexpected network error while writing to 10.240.24.229: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1152542
Lustre: DEBUG MARKER: Finished wait on 1152542
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1153217:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1153217:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1153217 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-503.15.1.el9_5.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit+0xdc/0x130
genl_family_rcv_msg+0x14d/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __pfx_genl_rcv_msg+0x10/0x10
? netlink_rcv_skb+0x84/0x100
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x248/0x370
netlink_sendmsg+0x206/0x440
____sys_sendmsg+0x38b/0x3b0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? __sys_setsockopt+0x75/0xc0
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5f/0xf0
? __import_iovec+0x46/0x150
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
? ___sys_recvmsg+0x88/0xd0
? __pfx_lru_add_fn+0x10/0x10
? folio_batch_move_lru+0xd3/0x150
? __sys_recvmsg+0x56/0xa0
? __sys_recvmsg+0x56/0xa0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? handle_mm_fault+0x116/0x270
? do_user_addr_fault+0x1d6/0x6a0
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x78/0x80
RIP: 0033:0x7f277df0fa17
LNet: 1151761:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.25.238@tcp
LNet: Removed LNI 10.240.25.238@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.25.238@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.224@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.224@tcp
LNet: There was an unexpected network error while writing to 10.240.25.224: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-70vm7.onyx.whamcloud.com to discover 10.240.25.224@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-70vm7.onyx.whamcloud.com to discover 10.240.25.224@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.224@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-69vm13 to discover 10.240.25.238@tcp
Lustre: DEBUG MARKER: Force onyx-69vm13 to discover 10.240.25.238@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1152596
Lustre: DEBUG MARKER: Wait for 1152596
LNet: There was an unexpected network error while writing to 10.240.25.224: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1152596
Lustre: DEBUG MARKER: Finished wait on 1152596
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1128144:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1128144:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1128144 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 1.16.0-4.module+el8.8.0+1454+0b2cbfb8 04/01/2014
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? _copy_to_iter+0x1d4/0x630
? sock_def_readable+0x10/0xc0
? _copy_to_iter+0x1d4/0x630
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
? idr_get_next_ul+0xb6/0xf0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? netlink_setsockopt+0x281/0x460
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? do_syscall_64+0x69/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7f32b630f917
LNet: 1126744:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.24.210@tcp
LNet: Removed LNI 10.240.24.210@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.24.210@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.195@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.195@tcp
LNet: There was an unexpected network error while writing to 10.240.28.195: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-56vm3.onyx.whamcloud.com to discover 10.240.28.195@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-56vm3.onyx.whamcloud.com to discover 10.240.28.195@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.195@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-106vm12 to discover 10.240.24.210@tcp
Lustre: DEBUG MARKER: Force onyx-106vm12 to discover 10.240.24.210@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1127547
Lustre: DEBUG MARKER: Wait for 1127547
LNet: There was an unexpected network error while writing to 10.240.28.195: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1127547
Lustre: DEBUG MARKER: Finished wait on 1127547
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1137822:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1137822:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1137822 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? netlink_ack+0x157/0x240
? __pfx_genl_rcv_msg+0x10/0x10
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? netlink_setsockopt+0x281/0x460
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7f634c50f917
LNet: 1136422:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.43.235@tcp
LNet: Removed LNI 10.240.43.235@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.43.235@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.44.85@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.44.85@tcp
LNet: There was an unexpected network error while writing to 10.240.44.85: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-96vm4.trevis.whamcloud.com to discover 10.240.44.85@tcp \(in background\)
Lustre: DEBUG MARKER: Force trevis-96vm4.trevis.whamcloud.com to discover 10.240.44.85@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.44.85@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-103vm10 to discover 10.240.43.235@tcp
Lustre: DEBUG MARKER: Force trevis-103vm10 to discover 10.240.43.235@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1137225
Lustre: DEBUG MARKER: Wait for 1137225
LNet: There was an unexpected network error while writing to 10.240.44.85: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1137225
Lustre: DEBUG MARKER: Finished wait on 1137225
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1154644:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1154644:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1154644 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __kmem_cache_alloc_node+0x1c7/0x2d0
? __alloc_skb+0x8e/0x1d0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x69/0x90
entry_SYSCALL_64_after_hwframe+0x77/0xe1
RIP: 0033:0x7fb4ab50f917
LNet: 1153244:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.27.188@tcp
LNet: Removed LNI 10.240.27.188@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.27.188@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.193@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.193@tcp
LNet: There was an unexpected network error while writing to 10.240.28.193: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-152vm9.onyx.whamcloud.com to discover 10.240.28.193@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-152vm9.onyx.whamcloud.com to discover 10.240.28.193@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.193@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-106vm10 to discover 10.240.27.188@tcp
Lustre: DEBUG MARKER: Force onyx-106vm10 to discover 10.240.27.188@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1154047
Lustre: DEBUG MARKER: Wait for 1154047
LNet: There was an unexpected network error while writing to 10.240.28.193: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1154047
Lustre: DEBUG MARKER: Finished wait on 1154047
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1141620:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1141620:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1141620 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-503.22.1.el9_5.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x43 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit+0xdc/0x130
genl_family_rcv_msg+0x14d/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x248/0x370
netlink_sendmsg+0x206/0x440
____sys_sendmsg+0x38b/0x3b0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? netlink_recvmsg+0x212/0x290
? __check_object_size.part.0+0x35/0xd0
? _copy_to_user+0x1a/0x30
? move_addr_to_user+0x4b/0xe0
? ____sys_recvmsg+0xeb/0x1b0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5f/0xf0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? netlink_setsockopt+0x2f7/0x430
? do_sock_setsockopt+0xb7/0x180
? __sys_setsockopt+0x75/0xc0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? syscall_exit_to_user_mode+0x19/0x40
? do_syscall_64+0x6b/0xf0
? do_syscall_64+0x6b/0xf0
? do_user_addr_fault+0x1d6/0x6a0
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x78/0x80
RIP: 0033:0x7fe72f90fa17
LNet: 1140220:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.28.193@tcp
LNet: Removed LNI 10.240.28.193@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.28.193@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.130@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.130@tcp
LNet: There was an unexpected network error while writing to 10.240.23.130: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-106vm10.onyx.whamcloud.com to discover 10.240.23.130@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-106vm10.onyx.whamcloud.com to discover 10.240.23.130@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.130@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-39vm7 to discover 10.240.28.193@tcp
Lustre: DEBUG MARKER: Force onyx-39vm7 to discover 10.240.28.193@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1141023
Lustre: DEBUG MARKER: Wait for 1141023
LNet: There was an unexpected network error while writing to 10.240.23.130: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1141023
Lustre: DEBUG MARKER: Finished wait on 1141023
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1122043:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1122043:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1122043 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __kmem_cache_alloc_node+0x1c7/0x2d0
? __alloc_skb+0x8e/0x1d0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
? idr_get_next_ul+0xb6/0xf0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7f4a1cb0f917
LNet: 1120641:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.23.207@tcp
LNet: Removed LNI 10.240.23.207@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.23.207@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.187@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.187@tcp
LNet: There was an unexpected network error while writing to 10.240.23.187: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-43vm4.onyx.whamcloud.com to discover 10.240.23.187@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-43vm4.onyx.whamcloud.com to discover 10.240.23.187@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.187@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-42vm4 to discover 10.240.23.207@tcp
Lustre: DEBUG MARKER: Force onyx-42vm4 to discover 10.240.23.207@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1121446
Lustre: DEBUG MARKER: Wait for 1121446
LNet: There was an unexpected network error while writing to 10.240.23.187: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1121446
Lustre: DEBUG MARKER: Finished wait on 1121446
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1123793:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1123793:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1123793 Comm: lnetctl Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? ctrl_getfamily+0x16c/0x1b0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? __import_iovec+0x46/0x150
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
? ___sys_recvmsg+0x88/0xd0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? netlink_setsockopt+0x281/0x460
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7f0395d0f917
LNet: 1122391:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.27.244@tcp
LNet: Removed LNI 10.240.27.244@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.27.244@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.241@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.241@tcp
LNet: There was an unexpected network error while writing to 10.240.25.241: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-96vm5.onyx.whamcloud.com to discover 10.240.25.241@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-96vm5.onyx.whamcloud.com to discover 10.240.25.241@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.241@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-70vm10 to discover 10.240.27.244@tcp
Lustre: DEBUG MARKER: Force onyx-70vm10 to discover 10.240.27.244@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1123196
Lustre: DEBUG MARKER: Wait for 1123196
LNet: There was an unexpected network error while writing to 10.240.25.241: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1123196
Lustre: DEBUG MARKER: Finished wait on 1123196
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1097910:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1097910:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1097910 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __check_object_size.part.0+0x47/0xd0
genl_rcv_msg+0x47/0xa0
? __pfx_simple_copy_to_iter+0x10/0x10
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7fdf4d10f917
LNet: 1096508:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.23.247@tcp
LNet: Removed LNI 10.240.23.247@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.23.247@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.207@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.207@tcp
LNet: There was an unexpected network error while writing to 10.240.23.207: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-45vm4.onyx.whamcloud.com to discover 10.240.23.207@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-45vm4.onyx.whamcloud.com to discover 10.240.23.207@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.23.207@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-43vm4 to discover 10.240.23.247@tcp
Lustre: DEBUG MARKER: Force onyx-43vm4 to discover 10.240.23.247@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1097313
Lustre: DEBUG MARKER: Wait for 1097313
LNet: There was an unexpected network error while writing to 10.240.23.207: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1097313
Lustre: DEBUG MARKER: Finished wait on 1097313
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1090725:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1090725:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1090725 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? netlink_rcv_skb+0x84/0x100
? _copy_to_iter+0x1d4/0x630
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7f9b0410f917
LNet: 1089323:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.27.46@tcp
LNet: Removed LNI 10.240.27.46@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.27.46@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.27.53@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.27.53@tcp
LNet: There was an unexpected network error while writing to 10.240.27.53: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-86vm7.onyx.whamcloud.com to discover 10.240.27.53@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-86vm7.onyx.whamcloud.com to discover 10.240.27.53@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.27.53@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-86vm14 to discover 10.240.27.46@tcp
Lustre: DEBUG MARKER: Force onyx-86vm14 to discover 10.240.27.46@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1090128
Lustre: DEBUG MARKER: Wait for 1090128
LNet: There was an unexpected network error while writing to 10.240.27.53: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1090128
Lustre: DEBUG MARKER: Finished wait on 1090128
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1104577:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1104577:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1104577 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? do_syscall_64+0x69/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7fb95e70f917
LNet: 1103179:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.26.99@tcp
LNet: Removed LNI 10.240.26.99@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.26.99@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.26.108@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.26.108@tcp
LNet: There was an unexpected network error while writing to 10.240.26.108: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-76vm4.onyx.whamcloud.com to discover 10.240.26.108@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-76vm4.onyx.whamcloud.com to discover 10.240.26.108@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.26.108@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-76vm13 to discover 10.240.26.99@tcp
Lustre: DEBUG MARKER: Force onyx-76vm13 to discover 10.240.26.99@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1103980
Lustre: DEBUG MARKER: Wait for 1103980
LNet: There was an unexpected network error while writing to 10.240.26.108: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1103980
Lustre: DEBUG MARKER: Finished wait on 1103980
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1085356:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1085356:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1085356 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? krealloc+0xa5/0xd0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? move_addr_to_user+0x4b/0xe0
? ____sys_recvmsg+0xeb/0x1b0
? __import_iovec+0x46/0x150
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? netlink_setsockopt+0x281/0x460
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7f5a6630f917
LNet: 1083955:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.28.167@tcp
LNet: Removed LNI 10.240.28.167@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.28.167@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.175@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.175@tcp
LNet: There was an unexpected network error while writing to 10.240.28.175: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-105vm4.onyx.whamcloud.com to discover 10.240.28.175@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-105vm4.onyx.whamcloud.com to discover 10.240.28.175@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.28.175@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-105vm12 to discover 10.240.28.167@tcp
Lustre: DEBUG MARKER: Force onyx-105vm12 to discover 10.240.28.167@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1084759
Lustre: DEBUG MARKER: Wait for 1084759
LNet: There was an unexpected network error while writing to 10.240.28.175: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1084759
Lustre: DEBUG MARKER: Finished wait on 1084759
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1102693:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1102693:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1102693 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? netlink_rcv_skb+0x84/0x100
? _copy_to_iter+0x1d4/0x630
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? _raw_spin_unlock_irqrestore+0xa/0x30
? __wake_up+0x40/0x60
? netlink_setsockopt+0x281/0x460
? __sys_setsockopt+0xdc/0x1d0
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7fa139b0f917
LNet: 1101290:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.26.139@tcp
LNet: Removed LNI 10.240.26.139@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.26.139@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.26.147@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.26.147@tcp
LNet: There was an unexpected network error while writing to 10.240.26.147: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-78vm4.onyx.whamcloud.com to discover 10.240.26.147@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-78vm4.onyx.whamcloud.com to discover 10.240.26.147@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.26.147@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-78vm12 to discover 10.240.26.139@tcp
Lustre: DEBUG MARKER: Force onyx-78vm12 to discover 10.240.26.139@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1102096
Lustre: DEBUG MARKER: Wait for 1102096
LNet: There was an unexpected network error while writing to 10.240.26.147: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1102096
Lustre: DEBUG MARKER: Finished wait on 1102096
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1100766:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1100766:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 0 PID: 1100766 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.31.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? __alloc_skb+0x8e/0x1d0
? __alloc_skb+0x8e/0x1d0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x24c/0x4c0
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? __import_iovec+0x46/0x150
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
? do_syscall_64+0x69/0x90
? do_syscall_64+0x69/0x90
? exc_page_fault+0x62/0x150
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7f481c30f917
LNet: 1099364:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.39.4@tcp
LNet: Removed LNI 10.240.39.4@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.39.4@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.39.5@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.39.5@tcp
LNet: There was an unexpected network error while writing to 10.240.39.5: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-33vm1.trevis.whamcloud.com to discover 10.240.39.5@tcp \(in background\)
Lustre: DEBUG MARKER: Force trevis-33vm1.trevis.whamcloud.com to discover 10.240.39.5@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.39.5@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force trevis-33vm2 to discover 10.240.39.4@tcp
Lustre: DEBUG MARKER: Force trevis-33vm2 to discover 10.240.39.4@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1100169
Lustre: DEBUG MARKER: Wait for 1100169
LNet: There was an unexpected network error while writing to 10.240.39.5: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1100169
Lustre: DEBUG MARKER: Finished wait on 1100169
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
sanity-lnet test 212: Check discovery refcount loss bug (LU-14627)
LNetError: 1062187:0:(lib-md.c:259:lnet_assert_handler_unused()) ASSERTION( md->md_handler != handler ) failed:
LNetError: 1062187:0:(lib-md.c:259:lnet_assert_handler_unused()) LBUG
CPU: 1 PID: 1062187 Comm: lnetctl Kdump: loaded Tainted: G W OE ------- --- 5.14.0-427.24.1.el9_4.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
Call Trace:
<TASK>
dump_stack_lvl+0x34/0x48
lbug_with_loc.cold+0x5/0x58 [libcfs]
lnet_assert_handler_unused+0x9c/0xd0 [lnet]
? __pfx_lnet_discovery_event_handler+0x10/0x10 [lnet]
LNetNIFini+0x9f/0x150 [lnet]
lnet_unconfigure+0x66/0x80 [lnet]
genl_family_rcv_msg_doit.isra.0+0xcb/0x120
genl_family_rcv_msg+0x14c/0x220
? __pfx_lnet_net_conf_cmd+0x10/0x10 [lnet]
? krealloc+0xa5/0xd0
genl_rcv_msg+0x47/0xa0
? __pfx_genl_rcv_msg+0x10/0x10
netlink_rcv_skb+0x57/0x100
genl_rcv+0x24/0x40
netlink_unicast+0x23e/0x360
netlink_sendmsg+0x238/0x480
____sys_sendmsg+0x31f/0x340
? import_iovec+0x17/0x20
? copy_msghdr_from_user+0x6d/0xa0
___sys_sendmsg+0x88/0xd0
? copy_msghdr_from_user+0x6d/0xa0
? __kmem_cache_alloc_node+0x1c7/0x2d0
? netlink_realloc_groups+0xbe/0x120
? idr_get_next_ul+0xb6/0xf0
__sys_sendmsg+0x59/0xa0
do_syscall_64+0x5c/0x90
? do_syscall_64+0x69/0x90
? do_syscall_64+0x69/0x90
? syscall_exit_work+0x103/0x130
? syscall_exit_to_user_mode+0x22/0x40
? do_syscall_64+0x69/0x90
entry_SYSCALL_64_after_hwframe+0x72/0xdc
RIP: 0033:0x7f4ee630f917
LNet: 1060787:0:(lib-ptl.c:966:lnet_clear_lazy_portal()) Active lazy portal 0 on exit
LNet: Removed LNI 10.240.25.232@tcp
LNet: Removed LNI 10.240.25.232@tcp1
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Key type .llcrypt unregistered
Key type ._llcrypt unregistered
Key type ._llcrypt registered
Key type .llcrypt registered
libcfs: HW NUMA nodes: 1, HW CPU cores: 2, npartitions: 2
alg: No test for adler32 (adler32-zlib)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure -a
LNet: Added LNI 10.240.25.232@tcp [8/256/0/180]
LNet: Accept all, port 7988
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Initial discovery
Lustre: DEBUG MARKER: Initial discovery
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.242@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: Fail local discover ping to set LNET_PEER_REDISCOVER flag
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.242@tcp
LNet: There was an unexpected network error while writing to 10.240.25.242: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-70vm1.onyx.whamcloud.com to discover 10.240.25.242@tcp \(in background\)
Lustre: DEBUG MARKER: Force onyx-70vm1.onyx.whamcloud.com to discover 10.240.25.242@tcp (in background)
Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover --force 10.240.25.242@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Force onyx-70vm11 to discover 10.240.25.232@tcp
Lustre: DEBUG MARKER: Force onyx-70vm11 to discover 10.240.25.232@tcp
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Wait for 1061590
Lustre: DEBUG MARKER: Wait for 1061590
LNet: There was an unexpected network error while writing to 10.240.25.242: rc = -22
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Finished wait on 1061590
Lustre: DEBUG MARKER: Finished wait on 1061590
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Link to test
Return to new crashes list