Editing crashreport #71310

ReasonCrashing FunctionWhere to cut BacktraceReports Count
BUG: kernel NULL pointer dereferenceosd_iit_igetosd_preload_next
osd_inode_iteration
osd_otable_it_next
lfsck_master_oit_engine
lfsck_master_engine
kthread
ret_from_fork
26

Added fields:

Match messages in logs
(every line would be required to be present in log output
Copy from "Messages before crash" column below):
Match messages in full crash
(every line would be required to be present in crash log output
Copy from "Full Crash" column below):
Limit to a test:
(Copy from below "Failing text"):
Delete these reports as invalid (real bug in review or some such)
Bug or comment:
Extra info:

Failures list (last 100):

Failing TestFull CrashMessages before crashComment
sanity-lfsck test 23b: LFSCK can repair dangling name entry (2)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0
Oops: 0000 [#1] PREEMPT SMP NOPTI
CPU: 0 PID: 391116 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-570.62.1_lustre.ddn1.el9.x86_64 #1
Hardware name: Red Hat KVM/RHEL, BIOS 1.16.3-2.el9_5.1 04/01/2014
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ff29590889dafbf8 EFLAGS: 00010256
RAX: 0000000000000000 RBX: ff1e8430aec50000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ff1e8430b41e46e8 R08: 0000000000000000 R09: 0000000000000000
R10: ff1e8430ce1b0ca0 R11: 0000000000000000 R12: 0000000000004e5c
R13: ff1e843097ef6880 R14: 0000000000000002 R15: ff1e84309f69c000
FS: 0000000000000000(0000) GS:ff1e8430fba00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000044e10002 CR4: 0000000000771ef0
PKRU: 55555554
Call Trace:
<TASK>
? srso_alias_return_thunk+0x5/0xfbef5
? show_trace_log_lvl+0x26e/0x2df
? show_trace_log_lvl+0x26e/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? osd_iit_iget+0x25f/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xc0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4f7/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x4a0
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x2e7/0x1180 [lfsck]
? __wake_up_common+0x75/0xa0
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
lfsck_master_engine+0x142/0x960 [lfsck]
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xdd/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x29/0x50
</TASK>
Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) lraft(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace intel_rapl_msr fscache intel_rapl_common netfs kvm_amd ccp rfkill kvm iTCO_wdt iTCO_vendor_support pcspkr i2c_i801 virtio_balloon i2c_smbus lpc_ich joydev sunrpc drm fuse dm_mod ext4 mbcache jbd2 ahci crct10dif_pclmul libahci crc32_pclmul crc32c_intel libata ghash_clmulni_intel virtio_net net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=145 fail_loc=0x1621
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -C
Link to test
sanity-lfsck test 17: LFSCK can repair multiple references
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0
Oops: 0000 [#1] PREEMPT SMP NOPTI
CPU: 1 PID: 996629 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-570.62.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM/RHEL, BIOS 1.16.3-2.el9_5.1 04/01/2014
RIP: 0010:osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ff524f0ec7823c00 EFLAGS: 00010256
RAX: 0000000000000000 RBX: ff3ab9bd85700000 RCX: 0000000000000004
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ff3ab9bdc88dd8a0 R08: 0000000000000000 R09: 0000000000000000
R10: ff3ab9bdd45fb8a0 R11: 0000000000000002 R12: 0000000000004e47
R13: ff3ab9bd9a6bc628 R14: 0000000000000002 R15: ff3ab9bda4bcf000
FS: 0000000000000000(0000) GS:ff3ab9bdfbb00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000107a10004 CR4: 0000000000771ef0
PKRU: 55555554
Call Trace:
<TASK>
? srso_alias_return_thunk+0x5/0xfbef5
? show_trace_log_lvl+0x26e/0x2df
? show_trace_log_lvl+0x26e/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x501/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x4a0
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x2e7/0x1180 [lfsck]
? __wake_up_common+0x75/0xa0
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
lfsck_master_engine+0x142/0x960 [lfsck]
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xdd/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x29/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common kvm_amd ccp kvm iTCO_wdt iTCO_vendor_support i2c_i801 pcspkr i2c_smbus virtio_balloon lpc_ich joydev fuse drm dm_mod ext4 mbcache jbd2 crct10dif_pclmul crc32_pclmul crc32c_intel ahci libahci libata ghash_clmulni_intel virtio_net virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0x1614
Lustre: *** cfs_fail_loc=1614, val=0***
Lustre: Skipped 2 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r
Link to test
sanity-lfsck test 17: LFSCK can repair multiple references
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0
Oops: 0000 [#1] PREEMPT SMP NOPTI
CPU: 0 PID: 339933 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.ddn1.el9.x86_64 #1
Hardware name: Red Hat KVM/RHEL, BIOS 1.16.3-2.el9_5.1 04/01/2014
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ff70f9b843013bf8 EFLAGS: 00010256
RAX: 0000000000000000 RBX: ff48fc860de18000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ff48fc86014f77b0 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e7c
R13: ff48fc8605abc620 R14: 0000000000000002 R15: ff48fc86014f6000
FS: 0000000000000000(0000) GS:ff48fc863ba00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000163810006 CR4: 0000000000771ef0
PKRU: 55555554
Call Trace:
<TASK>
? srso_alias_return_thunk+0x5/0xfbef5
? show_trace_log_lvl+0x26e/0x2df
? show_trace_log_lvl+0x26e/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? osd_iit_iget+0x25f/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4f7/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x550
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x2e7/0x1180 [lfsck]
? __wake_up_common+0x75/0xa0
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
lfsck_master_engine+0x142/0x960 [lfsck]
? srso_alias_return_thunk+0x5/0xfbef5
? srso_alias_return_thunk+0x5/0xfbef5
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xdd/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x29/0x50
</TASK>
Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) lraft(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 intel_rapl_msr dns_resolver intel_rapl_common nfs kvm_amd lockd grace ccp fscache netfs kvm rfkill iTCO_wdt iTCO_vendor_support virtio_balloon i2c_i801 pcspkr lpc_ich i2c_smbus joydev sunrpc drm fuse ext4 mbcache jbd2 ahci libahci libata crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel virtio_net net_failover failover virtio_blk serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0x1614
Lustre: *** cfs_fail_loc=1614, val=0***
Lustre: Skipped 2 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r
Link to test
sanity-lfsck test 29c: verify linkEA size limitation
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 335152 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffa5db00a1fc58 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8e219a9b0000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8e21b17217d8 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e5b
R13: ffff8e21b7cd94c0 R14: 0000000000000002 R15: ffff8e21a6bf8000
FS: 0000000000000000(0000) GS:ffff8e21bbc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000103dc0003 CR4: 00000000003706f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs]
? osd_iit_iget+0x1f0/0x4d0 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x504/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x550
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? _raw_spin_unlock_irqrestore+0xa/0x30
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata virtio_net crct10dif_pclmul crc32_pclmul crc32c_intel net_failover failover ghash_clmulni_intel virtio_blk serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=ha
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=super+ioctl+neterror+warning+dlmtrace+error+emerg+ha+rpctrace+vfstrace+config+console+lfsck
Lustre: 315954:0:(osd_internal.h:1339:osd_trans_exec_op()) lustre-MDT0000: opcode 2: before 260 < left 401, rollback = 2
Lustre: 315954:0:(osd_internal.h:1339:osd_trans_exec_op()) Skipped 19 previous similar messages
Lustre: 315954:0:(osd_handler.c:2072:osd_trans_dump_creds()) create: 0/0/0, destroy: 0/0/0
Lustre: 315954:0:(osd_handler.c:2072:osd_trans_dump_creds()) Skipped 19 previous similar messages
Lustre: 315954:0:(osd_handler.c:2079:osd_trans_dump_creds()) attr_set: 2/2/0, xattr_set: 6/401/0
Lustre: 315954:0:(osd_handler.c:2079:osd_trans_dump_creds()) Skipped 19 previous similar messages
Lustre: 315954:0:(osd_handler.c:2086:osd_trans_dump_creds()) write: 5/21/0, punch: 0/0/0, quota 1/3/0
Lustre: 315954:0:(osd_handler.c:2086:osd_trans_dump_creds()) Skipped 19 previous similar messages
Lustre: 315954:0:(osd_handler.c:2096:osd_trans_dump_creds()) insert: 2/72/0, delete: 2/2/0
Lustre: 315954:0:(osd_handler.c:2096:osd_trans_dump_creds()) Skipped 19 previous similar messages
Lustre: 315954:0:(osd_handler.c:2103:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 1/1/0
Lustre: 315954:0:(osd_handler.c:2103:osd_trans_dump_creds()) Skipped 19 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 31e: Re-generate the lost slave LMV EA for striped directory (1)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 381603 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.ddn1.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb9738a657c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9783b6c60000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff9783bb291710 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e69
R13: ffff9783b4fc3bc0 R14: 0000000000000002 R15: ffff9783d5f4b000
FS: 0000000000000000(0000) GS:ffff97843cd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 00000000b8610004 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x550
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1150 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? _raw_spin_unlock_irqrestore+0xa/0x30
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 virtio_net ata_generic crct10dif_pclmul crc32_pclmul net_failover crc32c_intel ata_piix ghash_clmulni_intel libata virtio_blk failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162a fail_val=0
Lustre: 349063:0:(osd_internal.h:1332:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 260 < left 271, rollback = 9
Lustre: 349063:0:(osd_internal.h:1332:osd_trans_exec_op()) Skipped 18 previous similar messages
Lustre: 349063:0:(osd_handler.c:1962:osd_trans_dump_creds()) create: 3/12/0, destroy: 0/0/0
Lustre: 349063:0:(osd_handler.c:1962:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 349063:0:(osd_handler.c:1969:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 5/314/0
Lustre: 349063:0:(osd_handler.c:1969:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 349063:0:(osd_handler.c:1976:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0
Lustre: 349063:0:(osd_handler.c:1976:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 349063:0:(osd_handler.c:1986:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0
Lustre: 349063:0:(osd_handler.c:1986:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 349063:0:(osd_handler.c:1993:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0
Lustre: 349063:0:(osd_handler.c:1993:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 39: LFSCK does not break foreign dir and reverse is also true
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 359346 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb44980fc7c58 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff91a307028000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff91a3003c5760 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e87
R13: ffff91a3017ce360 R14: 0000000000000000 R15: ffff91a3003c6000
FS: 0000000000000000(0000) GS:ffff91a37fc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 00000000883ea006 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs]
? osd_iit_iget+0x1f0/0x4d0 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x504/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x550
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? _raw_spin_unlock_irqrestore+0xa/0x30
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon i2c_piix4 pcspkr joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 31d: Set broken striped directory (modified after broken) as read-only
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 372885 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.ddn1.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb66b42dc7c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9e8a87708000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff9e8a7b037350 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e5d
R13: ffff9e8a891699a0 R14: 0000000000000002 R15: ffff9e8a81044000
FS: 0000000000000000(0000) GS:ffff9e8affc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000046eec006 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x550
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? _raw_spin_unlock_irqrestore+0xa/0x30
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill intel_rapl_msr intel_rapl_common i2c_piix4 virtio_balloon pcspkr joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1629
Lustre: 343023:0:(osd_internal.h:1331:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 260 < left 271, rollback = 9
Lustre: 343023:0:(osd_internal.h:1331:osd_trans_exec_op()) Skipped 18 previous similar messages
Lustre: 343023:0:(osd_handler.c:1955:osd_trans_dump_creds()) create: 3/12/0, destroy: 0/0/0
Lustre: 343023:0:(osd_handler.c:1955:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 343023:0:(osd_handler.c:1962:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 6/403/0
Lustre: 343023:0:(osd_handler.c:1962:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 343023:0:(osd_handler.c:1969:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0
Lustre: 343023:0:(osd_handler.c:1969:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 343023:0:(osd_handler.c:1979:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0
Lustre: 343023:0:(osd_handler.c:1979:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: 343023:0:(osd_handler.c:1986:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0
Lustre: 343023:0:(osd_handler.c:1986:osd_trans_dump_creds()) Skipped 18 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0
Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds1' ' /proc/mounts || true
Lustre: DEBUG MARKER: umount -d /mnt/lustre-mds1
Lustre: Failing over lustre-MDT0000
Lustre: server umount lustre-MDT0000 complete
LDISKFS-fs (dm-3): unmounting filesystem 834a400e-2536-4308-b1ee-d516827694eb.
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1
LDISKFS-fs (dm-3): mounted filesystem 834a400e-2536-4308-b1ee-d516827694eb r/w with ordered data mode. Quota mode: journalled.
Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect
Lustre: Skipped 1 previous similar message
Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 4 clients reconnect
Lustre: Skipped 1 previous similar message
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null
Lustre: lustre-MDT0000: Denying connection for new client 6c5c174d-a1f2-4e4a-960f-9a7b6cb9afba (at 10.240.43.22@tcp), waiting for 4 known clients (1 recovered, 1 in progress, and 0 evicted) to recover in 1:06
Lustre: Skipped 3 previous similar messages
Lustre: lustre-MDT0000: Recovery over after 0:05, of 4 clients 4 recovered and 0 were evicted.
Lustre: Skipped 1 previous similar message
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 31h: Repair the corrupted shard's name entry
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 1525286 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.38.1_lustre.ddn1.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb04d03a37c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9a9bef0a8000 RCX: 0000000000000004
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff9a9c19b640f8 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e71
R13: ffff9a9bc6864140 R14: 0000000000000002 R15: ffff9a9c1bf4d000
FS: 0000000000000000(0000) GS:ffff9a9c7fc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000090410006 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x550
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rdma_ucm(OE) rdma_cm(OE) iw_cm(OE) ib_ipoib(OE) ib_cm(OE) ib_umad(OE) mlx5_ib(OE) mlx5_core(OE) mlxdevm(OE) ib_uverbs(OE) ib_core(OE) psample mlxfw(OE) mlx_compat(OE) macsec tls pci_hyperv_intf intel_rapl_msr intel_rapl_common rfkill pcspkr virtio_balloon i2c_piix4 joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162c fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 18h: LFSCK can repair crashed PFL extent range
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 157797 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffa20ac836bc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8b6ef8c30000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8b6ef6505300 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e55
R13: ffff8b6ef25914c0 R14: 0000000000000002 R15: ffff8b6ef6504000
FS: 0000000000000000(0000) GS:ffff8b6f7fc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 00000000018b6006 CR4: 00000000001706f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? down_write+0xe/0x60
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x223/0x550
? try_to_wake_up+0x254/0x5d0
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common i2c_piix4 virtio_balloon pcspkr joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel libata virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Link to test
sanity-lfsck test 23c: LFSCK can repair dangling name entry (3)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 320275 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.ddn1.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb777490efc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9e6d81db8000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff9e6d7002f760 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e45
R13: ffff9e6d46136360 R14: 0000000000000002 R15: ffff9e6d7002d000
FS: 0000000000000000(0000) GS:ffff9e6dffc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000075010004 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x223/0x550
? try_to_wake_up+0x254/0x5d0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs intel_rapl_msr intel_rapl_common rfkill virtio_balloon pcspkr i2c_piix4 joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel libata virtio_blk virtio_net net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param debug=-1 debug_mb=150
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=super+ioctl+neterror+warning+dlmtrace+error+emerg+ha+rpctrace+vfstrace+config+console+lfsck
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0x9f fail_loc=0x1621
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=10 fail_loc=0x1602
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -C
Link to test
sanity-lfsck test 29c: verify linkEA size limitation
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 327356 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb81b8bc77c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff95ee414d8000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff95ee462b6760 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e62
R13: ffff95ee4502b200 R14: 0000000000000002 R15: ffff95ee337bf000
FS: 0000000000000000(0000) GS:ffff95eebfc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000009b610004 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x223/0x550
? try_to_wake_up+0x3e2/0x5d0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon i2c_piix4 joydev pcspkr drm fuse ext4 mbcache jbd2 ata_generic crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ata_piix ghash_clmulni_intel libata virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=0
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=super+ioctl+neterror+warning+dlmtrace+error+emerg+ha+rpctrace+vfstrace+config+console+lfsck
Lustre: 310285:0:(osd_internal.h:1324:osd_trans_exec_op()) lustre-MDT0000: opcode 2: before 260 < left 401, rollback = 2
Lustre: 310285:0:(osd_internal.h:1324:osd_trans_exec_op()) Skipped 83 previous similar messages
Lustre: 310285:0:(osd_handler.c:1961:osd_trans_dump_creds()) create: 0/0/0, destroy: 0/0/0
Lustre: 310285:0:(osd_handler.c:1961:osd_trans_dump_creds()) Skipped 83 previous similar messages
Lustre: 310285:0:(osd_handler.c:1968:osd_trans_dump_creds()) attr_set: 2/2/0, xattr_set: 6/401/0
Lustre: 310285:0:(osd_handler.c:1968:osd_trans_dump_creds()) Skipped 83 previous similar messages
Lustre: 310285:0:(osd_handler.c:1975:osd_trans_dump_creds()) write: 5/21/0, punch: 0/0/0, quota 1/3/0
Lustre: 310285:0:(osd_handler.c:1975:osd_trans_dump_creds()) Skipped 83 previous similar messages
Lustre: 310285:0:(osd_handler.c:1985:osd_trans_dump_creds()) insert: 2/72/0, delete: 2/2/0
Lustre: 310285:0:(osd_handler.c:1985:osd_trans_dump_creds()) Skipped 83 previous similar messages
Lustre: 310285:0:(osd_handler.c:1992:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 1/1/0
Lustre: 310285:0:(osd_handler.c:1992:osd_trans_dump_creds()) Skipped 83 previous similar messages
LustreError: 307371:0:(mdt_reint.c:2511:mdt_reint_migrate()) lustre-MDT0000: migrate [0x2400013a2:0x11:0x0]/f0 failed: rc = -75
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 18g: Find out orphan OST-object and repair it (7)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 157752 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.22.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb99787f37c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8dc43a278000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8dc441856300 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e81
R13: ffff8dc44889ed20 R14: 0000000000000000 R15: ffff8dc441854000
FS: 0000000000000000(0000) GS:ffff8dc4bfd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000031dfc005 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? __schedule+0x231/0x550
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? _raw_spin_unlock_irqrestore+0xa/0x30
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon i2c_piix4 pcspkr joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net net_failover ghash_clmulni_intel failover virtio_blk serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Link to test
sanity-lfsck test 18b: Find out orphan OST-object and repair it (2)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 304609 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffff9a90c6953c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8b153e3a8000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 000000000000000a RDI: 0000000000000000
RBP: ffff8b15255d5648 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e64
R13: ffff8b15556dd9a0 R14: 0000000000000002 R15: ffff8b15255d4000
FS: 0000000000000000(0000) GS:ffff8b15bfc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000038410005 CR4: 00000000001706f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x223/0x550
? try_to_wake_up+0x254/0x5d0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs intel_rapl_msr rfkill intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net libata ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1616
Lustre: *** cfs_fail_loc=1616, val=0***
Lustre: Skipped 3 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout --dryrun -o -r
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.lfsck_layout
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.lfsck_layout
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o
Link to test
sanity-lfsck test 18d: Find out orphan OST-object and repair it (4)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 153739 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb93180fc7c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9e24984c0000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000005 RDI: 0000000000000000
RBP: ffff9e24bb362328 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e63
R13: ffff9e2483d65e80 R14: 0000000000000002 R15: ffff9e24bb367000
FS: 0000000000000000(0000) GS:ffff9e24bbd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000103736005 CR4: 00000000003706e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x223/0x550
? try_to_wake_up+0x254/0x5d0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net virtio_blk ghash_clmulni_intel net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds2' ' /proc/mounts || true
Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds2
Lustre: lustre-MDT0001: Not available for connect from 10.240.24.214@tcp (stopping)
Lustre: Skipped 26 previous similar messages
Lustre: lustre-MDT0001 is waiting for obd_unlinked_exports more than 8 seconds. The obd refcount = 3. Is it stuck?
LDISKFS-fs (dm-3): unmounting filesystem 710190c8-0742-4318-9b16-4684beb799e3.
Lustre: server umount lustre-MDT0001 complete
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds4' ' /proc/mounts || true
Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds4
LDISKFS-fs (dm-4): unmounting filesystem b53e1310-47a0-46db-9c03-488ef6a0f246.
Lustre: server umount lustre-MDT0003 complete
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds2' ' /proc/mounts);
Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds4' ' /proc/mounts);
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds2
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds2_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds2_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds2_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds2_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds2; mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2
LDISKFS-fs (dm-3): mounted filesystem 710190c8-0742-4318-9b16-4684beb799e3 r/w with ordered data mode. Quota mode: journalled.
Lustre: lustre-MDT0001: Imperative Recovery not enabled, recovery window 60-180
Lustre: Skipped 7 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds2_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds2_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds4
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds4_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds4_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds4_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds4_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds4; mount -t lustre -o localrecov /dev/mapper/mds4_flakey /mnt/lustre-mds4
LDISKFS-fs (dm-4): mounted filesystem b53e1310-47a0-46db-9c03-488ef6a0f246 r/w with ordered data mode. Quota mode: journalled.
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds4_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds4_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm6.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: onyx-56vm6.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Using TIMEOUT=20
Lustre: DEBUG MARKER: Using TIMEOUT=20
Lustre: DEBUG MARKER: [ -f /sys/module/mgc/parameters/mgc_requeue_timeout_min ] && echo 1 > /sys/module/mgc/parameters/mgc_requeue_timeout_min; exit 0
Link to test
sanity-lfsck test 31h: Repair the corrupted shard's name entry
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 343430 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffff9ccb88cdfc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8d7889a40000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8d787b94e710 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e50
R13: ffff8d788c1b0b00 R14: 0000000000000002 R15: ffff8d7874c91000
FS: 0000000000000000(0000) GS:ffff8d78ffd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000070e10001 CR4: 00000000001706e0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic crct10dif_pclmul crc32_pclmul crc32c_intel ata_piix ghash_clmulni_intel virtio_net libata net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162c fail_val=0
Lustre: 313034:0:(osd_internal.h:1312:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 259 < left 271, rollback = 9
Lustre: 313034:0:(osd_internal.h:1312:osd_trans_exec_op()) Skipped 58 previous similar messages
Lustre: 313034:0:(osd_handler.c:1949:osd_trans_dump_creds()) create: 3/12/1, destroy: 0/0/0
Lustre: 313034:0:(osd_handler.c:1949:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 313034:0:(osd_handler.c:1956:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 6/403/0
Lustre: 313034:0:(osd_handler.c:1956:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 313034:0:(osd_handler.c:1963:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0
Lustre: 313034:0:(osd_handler.c:1963:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 313034:0:(osd_handler.c:1973:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0
Lustre: 313034:0:(osd_handler.c:1973:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 313034:0:(osd_handler.c:1980:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0
Lustre: 313034:0:(osd_handler.c:1980:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 20b: Handle the orphan with dummy LOV EA slot properly - PFL case
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 317164 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffff9f53cbf13c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8de83a498000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8de842efd7b0 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e89
R13: ffff8de8564b8fe0 R14: 0000000000000002 R15: ffff8de83d580000
FS: 0000000000000000(0000) GS:ffff8de8bfd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000003d82a003 CR4: 00000000000606e0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover failover virtio_blk serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1616
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x161a
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=1
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o
Link to test
sanity-lfsck test 28: Skip the failed MDT(s) when handle orphan MDT-objects
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 330643 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffff9f498a8fbc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8ad63ef38000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8ad6bc498800 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e54
R13: ffff8ad644c5cb00 R14: 0000000000000002 R15: ffff8ad647938000
FS: 0000000000000000(0000) GS:ffff8ad6bcc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000000381c002 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1624
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 36b: rebuild LOV EA for mirrored file (2)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 179178 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffbc3907457c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff941337900000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff94134d186300 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e62
R13: ffff9413357da840 R14: 0000000000000002 R15: ffff941333c19000
FS: 0000000000000000(0000) GS:ffff9413bfc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000067610006 CR4: 00000000003706f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic crct10dif_pclmul crc32_pclmul crc32c_intel ata_piix libata virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Link to test
sanity-lfsck test 2e: namespace LFSCK can verify remote object linkEA
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 219169 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffaf1d44c4bc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8daefd630000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 000000000000000a RDI: 0000000000000000
RBP: ffff8daef7a50698 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e79
R13: ffff8daefd08c140 R14: 0000000000000002 R15: ffff8daef679a000
FS: 0000000000000000(0000) GS:ffff8daf7fd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000000562e005 CR4: 00000000000606e0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs sunrpc intel_rapl_msr rfkill intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel virtio_net virtio_blk net_failover ghash_clmulni_intel failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1603
Lustre: *** cfs_fail_loc=1603, val=0***
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 18h: LFSCK can repair crashed PFL extent range
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 933992 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.31.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffb6bd85bcbc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9c9d33d80000 RCX: 0000000000000004
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff9c9d171a6738 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e64
R13: ffff9c9d35a36840 R14: 0000000000000002 R15: ffff9c9d171a4000
FS: 0000000000000000(0000) GS:ffff9c9dbfd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 00000000ac010002 CR4: 00000000000606e0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.lfsck_layout |
awk '/^status/ { print $2 }'
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x223/0x550
? try_to_wake_up+0x254/0x5d0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs sunrpc rfkill intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162f
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o
Link to test
conf-sanity test 61b: large xattr
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 993232 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffaa58870fbc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9b622a790000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000005 RDI: 0000000000000000
RBP: ffff9b610d13e418 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 000000000000d061
R13: ffff9b6239693200 R14: 0000000000000002 R15: ffff9b622c58d000
FS: 0000000000000000(0000) GS:ffff9b623bc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000010361c002 CR4: 00000000003706f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_flakey nfsv3 nfs_acl loop tls dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: libcfs]
CR2: 0000000000000000
Lustre: DEBUG MARKER: dumpe2fs -h /dev/mapper/mds1_flakey 2>&1 |
Lustre: DEBUG MARKER: dumpe2fs -h /dev/mapper/mds1_flakey 2>&1 |
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1
LDISKFS-fs (dm-4): mounted filesystem with ordered data mode. Quota mode: journalled.
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds3_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3; mount -t lustre -o localrecov /dev/mapper/mds3_flakey /mnt/lustre-mds3
LDISKFS-fs (dm-5): mounted filesystem with ordered data mode. Quota mode: journalled.
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid
Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds1' ' /proc/mounts || true
Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds1
Lustre: server umount lustre-MDT0000 complete
LDISKFS-fs (dm-4): unmounting filesystem.
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds3' ' /proc/mounts || true
Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds3
LDISKFS-fs (dm-5): unmounting filesystem.
Lustre: server umount lustre-MDT0002 complete
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: debugfs -R 'stat /ROOT/panda' /dev/mapper/mds1_flakey | grep trusted.big
Lustre: DEBUG MARKER: debugfs -w -R "ln <167> /lost+found" /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1
LDISKFS-fs (dm-4): mounted filesystem with ordered data mode. Quota mode: journalled.
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds3_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3; mount -t lustre -o localrecov /dev/mapper/mds3_flakey /mnt/lustre-mds3
LDISKFS-fs (dm-5): mounted filesystem with ordered data mode. Quota mode: journalled.
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid
Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid
Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace
Link to test
sanity-lfsck test 28: Skip the failed MDT(s) when handle orphan MDT-objects
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 328180 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08
RSP: 0018:ffffacaeca8ffc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8b917ebd0000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8b9186938788 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e78
R13: ffff8b91757a0b00 R14: 0000000000000002 R15: ffff8b9174120000
FS: 0000000000000000(0000) GS:ffff8b91ffd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000038010001 CR4: 00000000000606e0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs]
? __getblk_gfp+0x28/0xd0
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel libata virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1624
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
sanity-lfsck test 22b: LFSCK can repair unmatched pairs (2)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 321164 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46
RSP: 0018:ffffb63647137c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8b9980ef0000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 000000000000000a RDI: 0000000000000000
RBP: ffff8b9983f10000 R08: 0000000000000000 R09: 0000000000000000
R10: ffff8b99845fd4c0 R11: 0000000000000018 R12: 0000000000004e45
R13: 0000000000000002 R14: ffff8b9987d0d670 R15: 0000000000000000
FS: 0000000000000000(0000) GS:ffff8b99ffd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000001bc10006 CR4: 00000000000606e0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs sunrpc intel_rapl_msr intel_rapl_common rfkill pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover failover virtio_blk serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x161e
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -A -r
Link to test
sanity-lfsck test 18d: Find out orphan OST-object and repair it (4)
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 1 PID: 283224 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46
RSP: 0018:ffffa55006fabc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8fc0b69a8000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8fc0c1ae5000 R08: 0000000000000000 R09: 0000000000000000
R10: ffff8fc0cf386840 R11: 0000000000000018 R12: 0000000000004e8d
R13: 0000000000000002 R14: ffff8fc0b329a7d8 R15: 0000000000000000
FS: 0000000000000000(0000) GS:ffff8fc13fd00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000032a78006 CR4: 00000000000606e0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill intel_rapl_msr intel_rapl_common i2c_piix4 sunrpc virtio_balloon pcspkr joydev drm fuse ext4 mbcache jbd2 ata_generic virtio_net ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel net_failover ghash_clmulni_intel failover virtio_blk serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1618
Lustre: *** cfs_fail_loc=1618, val=0***
Lustre: Skipped 3 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds1' ' /proc/mounts || true
Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds1
LustreError: lustre-MDT0000-osp-MDT0002: operation mds_statfs to node 0@lo failed: rc = -107
LustreError: Skipped 9 previous similar messages
Lustre: server umount lustre-MDT0000 complete
LDISKFS-fs (dm-3): unmounting filesystem.
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Lustre: DEBUG MARKER: modprobe dm-flakey;
LustreError: lustre-MDT0000: not available for connect from 10.240.39.30@tcp (no target). If you are running an HA pair check that the target is mounted on the other server.
LustreError: Skipped 114 previous similar messages
Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds3' ' /proc/mounts || true
Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds3
LustreError: MGC10.240.39.29@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail
LustreError: Skipped 3 previous similar messages
Lustre: lustre-MDT0002 is waiting for obd_unlinked_exports more than 8 seconds. The obd refcount = 3. Is it stuck?
LDISKFS-fs (dm-4): unmounting filesystem.
Lustre: server umount lustre-MDT0002 complete
Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null &&
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds1' ' /proc/mounts);
Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds3' ' /proc/mounts);
Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds1' ' /proc/mounts);
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1
LDISKFS-fs (dm-3): mounted filesystem with ordered data mode. Quota mode: journalled.
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3
Lustre: DEBUG MARKER: modprobe dm-flakey;
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey >/dev/null 2>&1
Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey 2>&1
Lustre: DEBUG MARKER: test -b /dev/mapper/mds3_flakey
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey
Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3; mount -t lustre -o localrecov /dev/mapper/mds3_flakey /mnt/lustre-mds3
LDISKFS-fs (dm-4): mounted filesystem with ordered data mode. Quota mode: journalled.
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check
Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -P debug_raw_pointers=Y
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm2.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: trevis-34vm2.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all
Lustre: DEBUG MARKER: lctl get_param -n timeout
Lustre: DEBUG MARKER: /usr/sbin/lctl mark Using TIMEOUT=20
Lustre: DEBUG MARKER: Using TIMEOUT=20
Lustre: DEBUG MARKER: [ -f /sys/module/mgc/parameters/mgc_requeue_timeout_min ] && echo 1 > /sys/module/mgc/parameters/mgc_requeue_timeout_min; exit 0
Lustre: DEBUG MARKER: lctl dl | grep ' IN osc ' 2>/dev/null | wc -l
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -P lod.*.mdt_hash=crush
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o -c -d
Link to test
sanity-lfsck test 18f: Skip the failed OST(s) when handle orphan OST-objects
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 153929 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46
RSP: 0018:ffffb83887647c50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff8bedaf798000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff8bedbb667000 R08: 0000000000000000 R09: 0000000000000000
R10: ffff8bedcb014620 R11: 0000000000000018 R12: 0000000000004e79
R13: 0000000000000002 R14: ffff8bedbb664300 R15: 0000000000000000
FS: 0000000000000000(0000) GS:ffff8bee3fc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 0000000001bb8003 CR4: 00000000000606f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? lfsck_layout_get_lovea+0x29/0xc0 [lfsck]
? down_write+0xe/0x60
? lfsck_layout_master_exec_oit+0x464/0x1190 [lfsck]
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? lu_object_put+0x222/0x410 [obdclass]
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata virtio_net crct10dif_pclmul crc32_pclmul crc32c_intel net_failover virtio_blk failover ghash_clmulni_intel serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1616
Lustre: *** cfs_fail_loc=1616, val=0***
Lustre: Skipped 1 previous similar message
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.lfsck_layout |
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0003.lfsck_layout |
Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.lfsck_layout
Link to test
sanity-lfsck test 31h: Repair the corrupted shard's name entry
BUG: kernel NULL pointer dereference, address: 0000000000000000
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#1] PREEMPT SMP PTI
CPU: 0 PID: 315721 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1
Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46
RSP: 0018:ffffaeed0143fc50 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff9850bd408000 RCX: 0000000000000003
RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000
RBP: ffff9850b8385000 R08: 0000000000000000 R09: 0000000000000000
R10: ffff9850863eb200 R11: 0000000000000018 R12: 0000000000004e47
R13: 0000000000000002 R14: ffff9850b8382788 R15: 0000000000000000
FS: 0000000000000000(0000) GS:ffff98513fc00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 000000009ea10003 CR4: 00000000001706f0
Call Trace:
<TASK>
? show_trace_log_lvl+0x1c4/0x2df
? show_trace_log_lvl+0x1c4/0x2df
? osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
? __die_body.cold+0x8/0xd
? page_fault_oops+0x134/0x170
? kernelmode_fixup_or_oops+0x84/0x110
? exc_page_fault+0x62/0x150
? asm_exc_page_fault+0x22/0x30
? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs]
osd_preload_next+0x8f/0xa0 [osd_ldiskfs]
osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs]
? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs]
? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs]
? _raw_spin_unlock_irqrestore+0xa/0x30
? prepare_to_wait_event+0x5d/0x180
osd_otable_it_next+0x1af/0x620 [osd_ldiskfs]
? __pfx_var_wake_function+0x10/0x10
lfsck_master_oit_engine+0x246/0x1140 [lfsck]
? prepare_to_wait_event+0x5d/0x180
lfsck_master_engine+0x1a2/0xb40 [lfsck]
? __schedule+0x212/0x550
? try_to_wake_up+0x22a/0x5b0
? __pfx_autoremove_wake_function+0x10/0x10
? __pfx_lfsck_master_engine+0x10/0x10 [lfsck]
kthread+0xe0/0x100
? __pfx_kthread+0x10/0x10
ret_from_fork+0x2c/0x50
</TASK>
Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common i2c_piix4 virtio_balloon pcspkr joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey]
CR2: 0000000000000000
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162c fail_val=0
Lustre: 286350:0:(osd_internal.h:1333:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 260 < left 271, rollback = 9
Lustre: 286350:0:(osd_internal.h:1333:osd_trans_exec_op()) Skipped 58 previous similar messages
Lustre: 286350:0:(osd_handler.c:1969:osd_trans_dump_creds()) create: 3/12/0, destroy: 0/0/0
Lustre: 286350:0:(osd_handler.c:1969:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 286350:0:(osd_handler.c:1976:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 6/403/0
Lustre: 286350:0:(osd_handler.c:1976:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 286350:0:(osd_handler.c:1983:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0
Lustre: 286350:0:(osd_handler.c:1983:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 286350:0:(osd_handler.c:1993:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0
Lustre: 286350:0:(osd_handler.c:1993:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: 286350:0:(osd_handler.c:2000:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0
Lustre: 286350:0:(osd_handler.c:2000:osd_trans_dump_creds()) Skipped 58 previous similar messages
Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0
Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A
Link to test
Return to new crashes list