| Match messages in logs (every line would be required to be present in log output Copy from "Messages before crash" column below): | |
| Match messages in full crash (every line would be required to be present in crash log output Copy from "Full Crash" column below): | |
| Limit to a test: (Copy from below "Failing text"): | |
| Delete these reports as invalid (real bug in review or some such) | |
| Bug or comment: | |
| Extra info: |
| Failing Test | Full Crash | Messages before crash | Comment |
|---|---|---|---|
| sanity-lfsck test 23b: LFSCK can repair dangling name entry (2) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 Oops: 0000 [#1] PREEMPT SMP NOPTI CPU: 0 PID: 391116 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-570.62.1_lustre.ddn1.el9.x86_64 #1 Hardware name: Red Hat KVM/RHEL, BIOS 1.16.3-2.el9_5.1 04/01/2014 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ff29590889dafbf8 EFLAGS: 00010256 RAX: 0000000000000000 RBX: ff1e8430aec50000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ff1e8430b41e46e8 R08: 0000000000000000 R09: 0000000000000000 R10: ff1e8430ce1b0ca0 R11: 0000000000000000 R12: 0000000000004e5c R13: ff1e843097ef6880 R14: 0000000000000002 R15: ff1e84309f69c000 FS: 0000000000000000(0000) GS:ff1e8430fba00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000044e10002 CR4: 0000000000771ef0 PKRU: 55555554 Call Trace: <TASK> ? srso_alias_return_thunk+0x5/0xfbef5 ? show_trace_log_lvl+0x26e/0x2df ? show_trace_log_lvl+0x26e/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? osd_iit_iget+0x25f/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xc0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4f7/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x4a0 ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x2e7/0x1180 [lfsck] ? __wake_up_common+0x75/0xa0 ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 lfsck_master_engine+0x142/0x960 [lfsck] ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xdd/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x29/0x50 </TASK> Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) lraft(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace intel_rapl_msr fscache intel_rapl_common netfs kvm_amd ccp rfkill kvm iTCO_wdt iTCO_vendor_support pcspkr i2c_i801 virtio_balloon i2c_smbus lpc_ich joydev sunrpc drm fuse dm_mod ext4 mbcache jbd2 ahci crct10dif_pclmul libahci crc32_pclmul crc32c_intel libata ghash_clmulni_intel virtio_net net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=145 fail_loc=0x1621 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -C | Link to test |
| sanity-lfsck test 17: LFSCK can repair multiple references | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 Oops: 0000 [#1] PREEMPT SMP NOPTI CPU: 1 PID: 996629 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-570.62.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM/RHEL, BIOS 1.16.3-2.el9_5.1 04/01/2014 RIP: 0010:osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ff524f0ec7823c00 EFLAGS: 00010256 RAX: 0000000000000000 RBX: ff3ab9bd85700000 RCX: 0000000000000004 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ff3ab9bdc88dd8a0 R08: 0000000000000000 R09: 0000000000000000 R10: ff3ab9bdd45fb8a0 R11: 0000000000000002 R12: 0000000000004e47 R13: ff3ab9bd9a6bc628 R14: 0000000000000002 R15: ff3ab9bda4bcf000 FS: 0000000000000000(0000) GS:ff3ab9bdfbb00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000107a10004 CR4: 0000000000771ef0 PKRU: 55555554 Call Trace: <TASK> ? srso_alias_return_thunk+0x5/0xfbef5 ? show_trace_log_lvl+0x26e/0x2df ? show_trace_log_lvl+0x26e/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x501/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x4a0 ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x2e7/0x1180 [lfsck] ? __wake_up_common+0x75/0xa0 ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 lfsck_master_engine+0x142/0x960 [lfsck] ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xdd/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x29/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common kvm_amd ccp kvm iTCO_wdt iTCO_vendor_support i2c_i801 pcspkr i2c_smbus virtio_balloon lpc_ich joydev fuse drm dm_mod ext4 mbcache jbd2 crct10dif_pclmul crc32_pclmul crc32c_intel ahci libahci libata ghash_clmulni_intel virtio_net virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0x1614 Lustre: *** cfs_fail_loc=1614, val=0*** Lustre: Skipped 2 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r | Link to test |
| sanity-lfsck test 17: LFSCK can repair multiple references | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 Oops: 0000 [#1] PREEMPT SMP NOPTI CPU: 0 PID: 339933 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.ddn1.el9.x86_64 #1 Hardware name: Red Hat KVM/RHEL, BIOS 1.16.3-2.el9_5.1 04/01/2014 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ff70f9b843013bf8 EFLAGS: 00010256 RAX: 0000000000000000 RBX: ff48fc860de18000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ff48fc86014f77b0 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e7c R13: ff48fc8605abc620 R14: 0000000000000002 R15: ff48fc86014f6000 FS: 0000000000000000(0000) GS:ff48fc863ba00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000163810006 CR4: 0000000000771ef0 PKRU: 55555554 Call Trace: <TASK> ? srso_alias_return_thunk+0x5/0xfbef5 ? show_trace_log_lvl+0x26e/0x2df ? show_trace_log_lvl+0x26e/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? osd_iit_iget+0x25f/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4f7/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x550 ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x2e7/0x1180 [lfsck] ? __wake_up_common+0x75/0xa0 ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 lfsck_master_engine+0x142/0x960 [lfsck] ? srso_alias_return_thunk+0x5/0xfbef5 ? srso_alias_return_thunk+0x5/0xfbef5 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xdd/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x29/0x50 </TASK> Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) lraft(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 intel_rapl_msr dns_resolver intel_rapl_common nfs kvm_amd lockd grace ccp fscache netfs kvm rfkill iTCO_wdt iTCO_vendor_support virtio_balloon i2c_i801 pcspkr lpc_ich i2c_smbus joydev sunrpc drm fuse ext4 mbcache jbd2 ahci libahci libata crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel virtio_net net_failover failover virtio_blk serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0x1614 Lustre: *** cfs_fail_loc=1614, val=0*** Lustre: Skipped 2 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r | Link to test |
| sanity-lfsck test 29c: verify linkEA size limitation | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 335152 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffa5db00a1fc58 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8e219a9b0000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8e21b17217d8 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e5b R13: ffff8e21b7cd94c0 R14: 0000000000000002 R15: ffff8e21a6bf8000 FS: 0000000000000000(0000) GS:ffff8e21bbc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000103dc0003 CR4: 00000000003706f0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs] ? osd_iit_iget+0x1f0/0x4d0 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x504/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x550 ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata virtio_net crct10dif_pclmul crc32_pclmul crc32c_intel net_failover failover ghash_clmulni_intel virtio_blk serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=ha Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=super+ioctl+neterror+warning+dlmtrace+error+emerg+ha+rpctrace+vfstrace+config+console+lfsck Lustre: 315954:0:(osd_internal.h:1339:osd_trans_exec_op()) lustre-MDT0000: opcode 2: before 260 < left 401, rollback = 2 Lustre: 315954:0:(osd_internal.h:1339:osd_trans_exec_op()) Skipped 19 previous similar messages Lustre: 315954:0:(osd_handler.c:2072:osd_trans_dump_creds()) create: 0/0/0, destroy: 0/0/0 Lustre: 315954:0:(osd_handler.c:2072:osd_trans_dump_creds()) Skipped 19 previous similar messages Lustre: 315954:0:(osd_handler.c:2079:osd_trans_dump_creds()) attr_set: 2/2/0, xattr_set: 6/401/0 Lustre: 315954:0:(osd_handler.c:2079:osd_trans_dump_creds()) Skipped 19 previous similar messages Lustre: 315954:0:(osd_handler.c:2086:osd_trans_dump_creds()) write: 5/21/0, punch: 0/0/0, quota 1/3/0 Lustre: 315954:0:(osd_handler.c:2086:osd_trans_dump_creds()) Skipped 19 previous similar messages Lustre: 315954:0:(osd_handler.c:2096:osd_trans_dump_creds()) insert: 2/72/0, delete: 2/2/0 Lustre: 315954:0:(osd_handler.c:2096:osd_trans_dump_creds()) Skipped 19 previous similar messages Lustre: 315954:0:(osd_handler.c:2103:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 1/1/0 Lustre: 315954:0:(osd_handler.c:2103:osd_trans_dump_creds()) Skipped 19 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 31e: Re-generate the lost slave LMV EA for striped directory (1) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 381603 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.ddn1.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb9738a657c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9783b6c60000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff9783bb291710 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e69 R13: ffff9783b4fc3bc0 R14: 0000000000000002 R15: ffff9783d5f4b000 FS: 0000000000000000(0000) GS:ffff97843cd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 00000000b8610004 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x550 ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1150 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 virtio_net ata_generic crct10dif_pclmul crc32_pclmul net_failover crc32c_intel ata_piix ghash_clmulni_intel libata virtio_blk failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162a fail_val=0 Lustre: 349063:0:(osd_internal.h:1332:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 260 < left 271, rollback = 9 Lustre: 349063:0:(osd_internal.h:1332:osd_trans_exec_op()) Skipped 18 previous similar messages Lustre: 349063:0:(osd_handler.c:1962:osd_trans_dump_creds()) create: 3/12/0, destroy: 0/0/0 Lustre: 349063:0:(osd_handler.c:1962:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 349063:0:(osd_handler.c:1969:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 5/314/0 Lustre: 349063:0:(osd_handler.c:1969:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 349063:0:(osd_handler.c:1976:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0 Lustre: 349063:0:(osd_handler.c:1976:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 349063:0:(osd_handler.c:1986:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0 Lustre: 349063:0:(osd_handler.c:1986:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 349063:0:(osd_handler.c:1993:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0 Lustre: 349063:0:(osd_handler.c:1993:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 39: LFSCK does not break foreign dir and reverse is also true | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 359346 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb44980fc7c58 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff91a307028000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff91a3003c5760 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e87 R13: ffff91a3017ce360 R14: 0000000000000000 R15: ffff91a3003c6000 FS: 0000000000000000(0000) GS:ffff91a37fc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 00000000883ea006 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x23f/0x4d0 [osd_ldiskfs] ? osd_iit_iget+0x1f0/0x4d0 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x504/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x550 ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon i2c_piix4 pcspkr joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 31d: Set broken striped directory (modified after broken) as read-only | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 372885 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.40.1_lustre.ddn1.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb66b42dc7c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9e8a87708000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff9e8a7b037350 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e5d R13: ffff9e8a891699a0 R14: 0000000000000002 R15: ffff9e8a81044000 FS: 0000000000000000(0000) GS:ffff9e8affc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000046eec006 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x550 ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill intel_rapl_msr intel_rapl_common i2c_piix4 virtio_balloon pcspkr joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1629 Lustre: 343023:0:(osd_internal.h:1331:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 260 < left 271, rollback = 9 Lustre: 343023:0:(osd_internal.h:1331:osd_trans_exec_op()) Skipped 18 previous similar messages Lustre: 343023:0:(osd_handler.c:1955:osd_trans_dump_creds()) create: 3/12/0, destroy: 0/0/0 Lustre: 343023:0:(osd_handler.c:1955:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 343023:0:(osd_handler.c:1962:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 6/403/0 Lustre: 343023:0:(osd_handler.c:1962:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 343023:0:(osd_handler.c:1969:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0 Lustre: 343023:0:(osd_handler.c:1969:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 343023:0:(osd_handler.c:1979:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0 Lustre: 343023:0:(osd_handler.c:1979:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: 343023:0:(osd_handler.c:1986:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0 Lustre: 343023:0:(osd_handler.c:1986:osd_trans_dump_creds()) Skipped 18 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds1' ' /proc/mounts || true Lustre: DEBUG MARKER: umount -d /mnt/lustre-mds1 Lustre: Failing over lustre-MDT0000 Lustre: server umount lustre-MDT0000 complete LDISKFS-fs (dm-3): unmounting filesystem 834a400e-2536-4308-b1ee-d516827694eb. Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 LDISKFS-fs (dm-3): mounted filesystem 834a400e-2536-4308-b1ee-d516827694eb r/w with ordered data mode. Quota mode: journalled. Lustre: lustre-MDT0000: in recovery but waiting for the first client to connect Lustre: Skipped 1 previous similar message Lustre: lustre-MDT0000: Will be in recovery for at least 1:00, or until 4 clients reconnect Lustre: Skipped 1 previous similar message Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-24vm1.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null Lustre: lustre-MDT0000: Denying connection for new client 6c5c174d-a1f2-4e4a-960f-9a7b6cb9afba (at 10.240.43.22@tcp), waiting for 4 known clients (1 recovered, 1 in progress, and 0 evicted) to recover in 1:06 Lustre: Skipped 3 previous similar messages Lustre: lustre-MDT0000: Recovery over after 0:05, of 4 clients 4 recovered and 0 were evicted. Lustre: Skipped 1 previous similar message Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 31h: Repair the corrupted shard's name entry | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 1525286 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.38.1_lustre.ddn1.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb04d03a37c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9a9bef0a8000 RCX: 0000000000000004 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff9a9c19b640f8 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e71 R13: ffff9a9bc6864140 R14: 0000000000000002 R15: ffff9a9c1bf4d000 FS: 0000000000000000(0000) GS:ffff9a9c7fc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000090410006 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x550 ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rdma_ucm(OE) rdma_cm(OE) iw_cm(OE) ib_ipoib(OE) ib_cm(OE) ib_umad(OE) mlx5_ib(OE) mlx5_core(OE) mlxdevm(OE) ib_uverbs(OE) ib_core(OE) psample mlxfw(OE) mlx_compat(OE) macsec tls pci_hyperv_intf intel_rapl_msr intel_rapl_common rfkill pcspkr virtio_balloon i2c_piix4 joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162c fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 18h: LFSCK can repair crashed PFL extent range | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 157797 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffa20ac836bc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8b6ef8c30000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8b6ef6505300 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e55 R13: ffff8b6ef25914c0 R14: 0000000000000002 R15: ffff8b6ef6504000 FS: 0000000000000000(0000) GS:ffff8b6f7fc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 00000000018b6006 CR4: 00000000001706f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? down_write+0xe/0x60 ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x223/0x550 ? try_to_wake_up+0x254/0x5d0 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common i2c_piix4 virtio_balloon pcspkr joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel libata virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Link to test | |
| sanity-lfsck test 23c: LFSCK can repair dangling name entry (3) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 320275 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.ddn1.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb777490efc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9e6d81db8000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff9e6d7002f760 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e45 R13: ffff9e6d46136360 R14: 0000000000000002 R15: ffff9e6d7002d000 FS: 0000000000000000(0000) GS:ffff9e6dffc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000075010004 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x223/0x550 ? try_to_wake_up+0x254/0x5d0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs intel_rapl_msr intel_rapl_common rfkill virtio_balloon pcspkr i2c_piix4 joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel libata virtio_blk virtio_net net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug Lustre: DEBUG MARKER: /usr/sbin/lctl set_param debug=-1 debug_mb=150 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=super+ioctl+neterror+warning+dlmtrace+error+emerg+ha+rpctrace+vfstrace+config+console+lfsck Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0x9f fail_loc=0x1621 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=0 fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=10 fail_loc=0x1602 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -C | Link to test |
| sanity-lfsck test 29c: verify linkEA size limitation | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 327356 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb81b8bc77c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff95ee414d8000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff95ee462b6760 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e62 R13: ffff95ee4502b200 R14: 0000000000000002 R15: ffff95ee337bf000 FS: 0000000000000000(0000) GS:ffff95eebfc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000009b610004 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x223/0x550 ? try_to_wake_up+0x3e2/0x5d0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon i2c_piix4 joydev pcspkr drm fuse ext4 mbcache jbd2 ata_generic crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ata_piix ghash_clmulni_intel libata virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=0 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=super+ioctl+neterror+warning+dlmtrace+error+emerg+ha+rpctrace+vfstrace+config+console+lfsck Lustre: 310285:0:(osd_internal.h:1324:osd_trans_exec_op()) lustre-MDT0000: opcode 2: before 260 < left 401, rollback = 2 Lustre: 310285:0:(osd_internal.h:1324:osd_trans_exec_op()) Skipped 83 previous similar messages Lustre: 310285:0:(osd_handler.c:1961:osd_trans_dump_creds()) create: 0/0/0, destroy: 0/0/0 Lustre: 310285:0:(osd_handler.c:1961:osd_trans_dump_creds()) Skipped 83 previous similar messages Lustre: 310285:0:(osd_handler.c:1968:osd_trans_dump_creds()) attr_set: 2/2/0, xattr_set: 6/401/0 Lustre: 310285:0:(osd_handler.c:1968:osd_trans_dump_creds()) Skipped 83 previous similar messages Lustre: 310285:0:(osd_handler.c:1975:osd_trans_dump_creds()) write: 5/21/0, punch: 0/0/0, quota 1/3/0 Lustre: 310285:0:(osd_handler.c:1975:osd_trans_dump_creds()) Skipped 83 previous similar messages Lustre: 310285:0:(osd_handler.c:1985:osd_trans_dump_creds()) insert: 2/72/0, delete: 2/2/0 Lustre: 310285:0:(osd_handler.c:1985:osd_trans_dump_creds()) Skipped 83 previous similar messages Lustre: 310285:0:(osd_handler.c:1992:osd_trans_dump_creds()) ref_add: 1/1/0, ref_del: 1/1/0 Lustre: 310285:0:(osd_handler.c:1992:osd_trans_dump_creds()) Skipped 83 previous similar messages LustreError: 307371:0:(mdt_reint.c:2511:mdt_reint_migrate()) lustre-MDT0000: migrate [0x2400013a2:0x11:0x0]/f0 failed: rc = -75 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 18g: Find out orphan OST-object and repair it (7) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 157752 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-503.22.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb99787f37c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8dc43a278000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8dc441856300 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e81 R13: ffff8dc44889ed20 R14: 0000000000000000 R15: ffff8dc441854000 FS: 0000000000000000(0000) GS:ffff8dc4bfd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000031dfc005 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? __schedule+0x231/0x550 ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon i2c_piix4 pcspkr joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net net_failover ghash_clmulni_intel failover virtio_blk serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Link to test | |
| sanity-lfsck test 18b: Find out orphan OST-object and repair it (2) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 304609 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffff9a90c6953c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8b153e3a8000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 000000000000000a RDI: 0000000000000000 RBP: ffff8b15255d5648 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e64 R13: ffff8b15556dd9a0 R14: 0000000000000002 R15: ffff8b15255d4000 FS: 0000000000000000(0000) GS:ffff8b15bfc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000038410005 CR4: 00000000001706f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x223/0x550 ? try_to_wake_up+0x254/0x5d0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs intel_rapl_msr rfkill intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev sunrpc drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net libata ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1616 Lustre: *** cfs_fail_loc=1616, val=0*** Lustre: Skipped 3 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout --dryrun -o -r Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.lfsck_layout Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.lfsck_layout Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o | Link to test |
| sanity-lfsck test 18d: Find out orphan OST-object and repair it (4) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 153739 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.42.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb93180fc7c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9e24984c0000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000005 RDI: 0000000000000000 RBP: ffff9e24bb362328 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e63 R13: ffff9e2483d65e80 R14: 0000000000000002 R15: ffff9e24bb367000 FS: 0000000000000000(0000) GS:ffff9e24bbd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000103736005 CR4: 00000000003706e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x223/0x550 ? try_to_wake_up+0x254/0x5d0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net virtio_blk ghash_clmulni_intel net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds2' ' /proc/mounts || true Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds2 Lustre: lustre-MDT0001: Not available for connect from 10.240.24.214@tcp (stopping) Lustre: Skipped 26 previous similar messages Lustre: lustre-MDT0001 is waiting for obd_unlinked_exports more than 8 seconds. The obd refcount = 3. Is it stuck? LDISKFS-fs (dm-3): unmounting filesystem 710190c8-0742-4318-9b16-4684beb799e3. Lustre: server umount lustre-MDT0001 complete Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds4' ' /proc/mounts || true Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds4 LDISKFS-fs (dm-4): unmounting filesystem b53e1310-47a0-46db-9c03-488ef6a0f246. Lustre: server umount lustre-MDT0003 complete Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds2' ' /proc/mounts); Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds4' ' /proc/mounts); Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds2 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds2_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds2_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds2_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds2_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds2; mount -t lustre -o localrecov /dev/mapper/mds2_flakey /mnt/lustre-mds2 LDISKFS-fs (dm-3): mounted filesystem 710190c8-0742-4318-9b16-4684beb799e3 r/w with ordered data mode. Quota mode: journalled. Lustre: lustre-MDT0001: Imperative Recovery not enabled, recovery window 60-180 Lustre: Skipped 7 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: e2label /dev/mapper/mds2_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds2_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-96vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds4 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds4_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds4_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds4_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds4_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds4; mount -t lustre -o localrecov /dev/mapper/mds4_flakey /mnt/lustre-mds4 LDISKFS-fs (dm-4): mounted filesystem b53e1310-47a0-46db-9c03-488ef6a0f246 r/w with ordered data mode. Quota mode: journalled. Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-147vm15.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: e2label /dev/mapper/mds4_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds4_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-56vm7.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-56vm6.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: onyx-56vm6.onyx.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark Using TIMEOUT=20 Lustre: DEBUG MARKER: Using TIMEOUT=20 Lustre: DEBUG MARKER: [ -f /sys/module/mgc/parameters/mgc_requeue_timeout_min ] && echo 1 > /sys/module/mgc/parameters/mgc_requeue_timeout_min; exit 0 | Link to test |
| sanity-lfsck test 31h: Repair the corrupted shard's name entry | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 343430 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffff9ccb88cdfc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8d7889a40000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8d787b94e710 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e50 R13: ffff8d788c1b0b00 R14: 0000000000000002 R15: ffff8d7874c91000 FS: 0000000000000000(0000) GS:ffff8d78ffd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000070e10001 CR4: 00000000001706e0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic crct10dif_pclmul crc32_pclmul crc32c_intel ata_piix ghash_clmulni_intel virtio_net libata net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162c fail_val=0 Lustre: 313034:0:(osd_internal.h:1312:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 259 < left 271, rollback = 9 Lustre: 313034:0:(osd_internal.h:1312:osd_trans_exec_op()) Skipped 58 previous similar messages Lustre: 313034:0:(osd_handler.c:1949:osd_trans_dump_creds()) create: 3/12/1, destroy: 0/0/0 Lustre: 313034:0:(osd_handler.c:1949:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 313034:0:(osd_handler.c:1956:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 6/403/0 Lustre: 313034:0:(osd_handler.c:1956:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 313034:0:(osd_handler.c:1963:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0 Lustre: 313034:0:(osd_handler.c:1963:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 313034:0:(osd_handler.c:1973:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0 Lustre: 313034:0:(osd_handler.c:1973:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 313034:0:(osd_handler.c:1980:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0 Lustre: 313034:0:(osd_handler.c:1980:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 20b: Handle the orphan with dummy LOV EA slot properly - PFL case | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 317164 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffff9f53cbf13c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8de83a498000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8de842efd7b0 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e89 R13: ffff8de8564b8fe0 R14: 0000000000000002 R15: ffff8de83d580000 FS: 0000000000000000(0000) GS:ffff8de8bfd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000003d82a003 CR4: 00000000000606e0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover failover virtio_blk serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1616 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x161a Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_val=1 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o | Link to test |
| sanity-lfsck test 28: Skip the failed MDT(s) when handle orphan MDT-objects | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 330643 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffff9f498a8fbc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8ad63ef38000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8ad6bc498800 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e54 R13: ffff8ad644c5cb00 R14: 0000000000000002 R15: ffff8ad647938000 FS: 0000000000000000(0000) GS:ffff8ad6bcc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000000381c002 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1624 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 36b: rebuild LOV EA for mirrored file (2) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 179178 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffbc3907457c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff941337900000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff94134d186300 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e62 R13: ffff9413357da840 R14: 0000000000000002 R15: ffff941333c19000 FS: 0000000000000000(0000) GS:ffff9413bfc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000067610006 CR4: 00000000003706f0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic crct10dif_pclmul crc32_pclmul crc32c_intel ata_piix libata virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Link to test | |
| sanity-lfsck test 2e: namespace LFSCK can verify remote object linkEA | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 219169 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffaf1d44c4bc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8daefd630000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 000000000000000a RDI: 0000000000000000 RBP: ffff8daef7a50698 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e79 R13: ffff8daefd08c140 R14: 0000000000000002 R15: ffff8daef679a000 FS: 0000000000000000(0000) GS:ffff8daf7fd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000000562e005 CR4: 00000000000606e0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs sunrpc intel_rapl_msr rfkill intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel virtio_net virtio_blk net_failover ghash_clmulni_intel failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1603 Lustre: *** cfs_fail_loc=1603, val=0*** Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 18h: LFSCK can repair crashed PFL extent range | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 933992 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-427.31.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffb6bd85bcbc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9c9d33d80000 RCX: 0000000000000004 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff9c9d171a6738 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e64 R13: ffff9c9d35a36840 R14: 0000000000000002 R15: ffff9c9d171a4000 FS: 0000000000000000(0000) GS:ffff9c9dbfd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 00000000ac010002 CR4: 00000000000606e0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ae/0x500 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0000.lfsck_layout | awk '/^status/ { print $2 }' lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x223/0x550 ? try_to_wake_up+0x254/0x5d0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs sunrpc rfkill intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover virtio_blk failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162f Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o | Link to test |
| conf-sanity test 61b: large xattr | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 993232 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffaa58870fbc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9b622a790000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000005 RDI: 0000000000000000 RBP: ffff9b610d13e418 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 000000000000d061 R13: ffff9b6239693200 R14: 0000000000000002 R15: ffff9b622c58d000 FS: 0000000000000000(0000) GS:ffff9b623bc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000010361c002 CR4: 00000000003706f0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_flakey nfsv3 nfs_acl loop tls dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev fuse drm ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: libcfs] CR2: 0000000000000000 | Lustre: DEBUG MARKER: dumpe2fs -h /dev/mapper/mds1_flakey 2>&1 | Lustre: DEBUG MARKER: dumpe2fs -h /dev/mapper/mds1_flakey 2>&1 | Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 LDISKFS-fs (dm-4): mounted filesystem with ordered data mode. Quota mode: journalled. Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds3_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3; mount -t lustre -o localrecov /dev/mapper/mds3_flakey /mnt/lustre-mds3 LDISKFS-fs (dm-5): mounted filesystem with ordered data mode. Quota mode: journalled. Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds1' ' /proc/mounts || true Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds1 Lustre: server umount lustre-MDT0000 complete LDISKFS-fs (dm-4): unmounting filesystem. Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds3' ' /proc/mounts || true Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds3 LDISKFS-fs (dm-5): unmounting filesystem. Lustre: server umount lustre-MDT0002 complete Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: debugfs -R 'stat /ROOT/panda' /dev/mapper/mds1_flakey | grep trusted.big Lustre: DEBUG MARKER: debugfs -w -R "ln <167> /lost+found" /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 LDISKFS-fs (dm-4): mounted filesystem with ordered data mode. Quota mode: journalled. Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds3_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3; mount -t lustre -o localrecov /dev/mapper/mds3_flakey /mnt/lustre-mds3 LDISKFS-fs (dm-5): mounted filesystem with ordered data mode. Quota mode: journalled. Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm12.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm13.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0000-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0001-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0002-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-\*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount FULL mdc.lustre-MDT0003-mdc-*.mds_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: onyx-144vm9.onyx.whamcloud.com: executing set_default_debug -1 all Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl mark onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount \(FULL\|IDLE\) osc.lustre-OST0000-osc-[-0-9a-f]\*.ost_server_uuid Lustre: DEBUG MARKER: onyx-144vm7.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Lustre: DEBUG MARKER: onyx-144vm8.onyx.whamcloud.com: executing wait_import_state_mount (FULL|IDLE) osc.lustre-OST0000-osc-[-0-9a-f]*.ost_server_uuid Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace | Link to test |
| sanity-lfsck test 28: Skip the failed MDT(s) when handle orphan MDT-objects | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 328180 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 37 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 14 48 98 48 c1 e0 04 48 01 c7 8b 45 08 RSP: 0018:ffffacaeca8ffc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8b917ebd0000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8b9186938788 R08: 0000000000000000 R09: 0000000000000000 R10: 0000000000000000 R11: 0000000000000018 R12: 0000000000004e78 R13: ffff8b91757a0b00 R14: 0000000000000002 R15: ffff8b9174120000 FS: 0000000000000000(0000) GS:ffff8b91ffd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000038010001 CR4: 00000000000606e0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2ab/0x4f0 [osd_ldiskfs] ? __getblk_gfp+0x28/0xd0 osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul crc32_pclmul crc32c_intel libata virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1624 Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |
| sanity-lfsck test 22b: LFSCK can repair unmatched pairs (2) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 321164 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46 RSP: 0018:ffffb63647137c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8b9980ef0000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 000000000000000a RDI: 0000000000000000 RBP: ffff8b9983f10000 R08: 0000000000000000 R09: 0000000000000000 R10: ffff8b99845fd4c0 R11: 0000000000000018 R12: 0000000000004e45 R13: 0000000000000002 R14: ffff8b9987d0d670 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffff8b99ffd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000001bc10006 CR4: 00000000000606e0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs sunrpc intel_rapl_msr intel_rapl_common rfkill pcspkr virtio_balloon i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata crct10dif_pclmul crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel net_failover failover virtio_blk serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x161e Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -A -r | Link to test |
| sanity-lfsck test 18d: Find out orphan OST-object and repair it (4) | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 1 PID: 283224 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46 RSP: 0018:ffffa55006fabc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8fc0b69a8000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8fc0c1ae5000 R08: 0000000000000000 R09: 0000000000000000 R10: ffff8fc0cf386840 R11: 0000000000000018 R12: 0000000000004e8d R13: 0000000000000002 R14: ffff8fc0b329a7d8 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffff8fc13fd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000032a78006 CR4: 00000000000606e0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill intel_rapl_msr intel_rapl_common i2c_piix4 sunrpc virtio_balloon pcspkr joydev drm fuse ext4 mbcache jbd2 ata_generic virtio_net ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel net_failover ghash_clmulni_intel failover virtio_blk serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1618 Lustre: *** cfs_fail_loc=1618, val=0*** Lustre: Skipped 3 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds1' ' /proc/mounts || true Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds1 LustreError: lustre-MDT0000-osp-MDT0002: operation mds_statfs to node 0@lo failed: rc = -107 LustreError: Skipped 9 previous similar messages Lustre: server umount lustre-MDT0000 complete LDISKFS-fs (dm-3): unmounting filesystem. Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && Lustre: DEBUG MARKER: modprobe dm-flakey; LustreError: lustre-MDT0000: not available for connect from 10.240.39.30@tcp (no target). If you are running an HA pair check that the target is mounted on the other server. LustreError: Skipped 114 previous similar messages Lustre: DEBUG MARKER: grep -c /mnt/lustre-mds3' ' /proc/mounts || true Lustre: DEBUG MARKER: umount -d -f /mnt/lustre-mds3 LustreError: MGC10.240.39.29@tcp: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail LustreError: Skipped 3 previous similar messages Lustre: lustre-MDT0002 is waiting for obd_unlinked_exports more than 8 seconds. The obd refcount = 3. Is it stuck? LDISKFS-fs (dm-4): unmounting filesystem. Lustre: server umount lustre-MDT0002 complete Lustre: DEBUG MARKER: lsmod | grep lnet > /dev/null && Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds1' ' /proc/mounts); Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds3' ' /proc/mounts); Lustre: DEBUG MARKER: running=$(grep -c /mnt/lustre-mds1' ' /proc/mounts); Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds1_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds1; mount -t lustre -o localrecov /dev/mapper/mds1_flakey /mnt/lustre-mds1 LDISKFS-fs (dm-3): mounted filesystem with ordered data mode. Quota mode: journalled. Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds1_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3 Lustre: DEBUG MARKER: modprobe dm-flakey; Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey >/dev/null 2>&1 Lustre: DEBUG MARKER: dmsetup status /dev/mapper/mds3_flakey 2>&1 Lustre: DEBUG MARKER: test -b /dev/mapper/mds3_flakey Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey Lustre: DEBUG MARKER: mkdir -p /mnt/lustre-mds3; mount -t lustre -o localrecov /dev/mapper/mds3_flakey /mnt/lustre-mds3 LDISKFS-fs (dm-4): mounted filesystem with ordered data mode. Quota mode: journalled. Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n health_check Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lustre/tests/racer:/usr/lib64/lustre/../lustre-iokit/sgpdd-survey:/usr/lib64/lustre/tests:/usr/lib64/lustre/u Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm6.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}' Lustre: DEBUG MARKER: e2label /dev/mapper/mds3_flakey 2>/dev/null Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm7.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -P debug_raw_pointers=Y Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm3.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: /usr/sbin/lctl mark trevis-34vm2.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: trevis-34vm2.trevis.whamcloud.com: executing set_default_debug vfstrace rpctrace dlmtrace neterror ha config ioctl super lfsck all Lustre: DEBUG MARKER: lctl get_param -n timeout Lustre: DEBUG MARKER: /usr/sbin/lctl mark Using TIMEOUT=20 Lustre: DEBUG MARKER: Using TIMEOUT=20 Lustre: DEBUG MARKER: [ -f /sys/module/mgc/parameters/mgc_requeue_timeout_min ] && echo 1 > /sys/module/mgc/parameters/mgc_requeue_timeout_min; exit 0 Lustre: DEBUG MARKER: lctl dl | grep ' IN osc ' 2>/dev/null | wc -l Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -P lod.*.mdt_hash=crush Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t layout -r -o -c -d | Link to test |
| sanity-lfsck test 18f: Skip the failed OST(s) when handle orphan OST-objects | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 153929 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46 RSP: 0018:ffffb83887647c50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff8bedaf798000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff8bedbb667000 R08: 0000000000000000 R09: 0000000000000000 R10: ffff8bedcb014620 R11: 0000000000000018 R12: 0000000000004e79 R13: 0000000000000002 R14: ffff8bedbb664300 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffff8bee3fc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 0000000001bb8003 CR4: 00000000000606f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? lfsck_layout_get_lovea+0x29/0xc0 [lfsck] ? down_write+0xe/0x60 ? lfsck_layout_master_exec_oit+0x464/0x1190 [lfsck] osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? lu_object_put+0x222/0x410 [obdclass] lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common virtio_balloon pcspkr i2c_piix4 joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix libata virtio_net crct10dif_pclmul crc32_pclmul crc32c_intel net_failover virtio_blk failover ghash_clmulni_intel serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x1616 Lustre: *** cfs_fail_loc=1616, val=0*** Lustre: Skipped 1 previous similar message Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0 Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.lfsck_layout | Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0003.lfsck_layout | Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdd.lustre-MDT0001.lfsck_layout | Link to test |
| sanity-lfsck test 31h: Repair the corrupted shard's name entry | BUG: kernel NULL pointer dereference, address: 0000000000000000 #PF: supervisor read access in kernel mode #PF: error_code(0x0000) - not-present page PGD 0 P4D 0 Oops: 0000 [#1] PREEMPT SMP PTI CPU: 0 PID: 315721 Comm: lfsck Kdump: loaded Tainted: G OE ------- --- 5.14.0-362.24.1_lustre.el9.x86_64 #1 Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011 RIP: 0010:osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] Code: ff ff fd ff ff ff 48 01 f0 48 a9 fd ff ff ff 75 38 48 8b bb e8 57 00 00 31 c0 eb 07 83 c0 01 39 c8 74 0d 48 63 d0 48 c1 e2 04 <48> 39 34 17 75 ec 39 c1 76 15 48 98 48 c1 e0 04 48 01 c7 41 8b 46 RSP: 0018:ffffaeed0143fc50 EFLAGS: 00010246 RAX: 0000000000000000 RBX: ffff9850bd408000 RCX: 0000000000000003 RDX: 0000000000000000 RSI: 0000000200000003 RDI: 0000000000000000 RBP: ffff9850b8385000 R08: 0000000000000000 R09: 0000000000000000 R10: ffff9850863eb200 R11: 0000000000000018 R12: 0000000000004e47 R13: 0000000000000002 R14: ffff9850b8382788 R15: 0000000000000000 FS: 0000000000000000(0000) GS:ffff98513fc00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000009ea10003 CR4: 00000000001706f0 Call Trace: <TASK> ? show_trace_log_lvl+0x1c4/0x2df ? show_trace_log_lvl+0x1c4/0x2df ? osd_preload_next+0x8f/0xa0 [osd_ldiskfs] ? __die_body.cold+0x8/0xd ? page_fault_oops+0x134/0x170 ? kernelmode_fixup_or_oops+0x84/0x110 ? exc_page_fault+0x62/0x150 ? asm_exc_page_fault+0x22/0x30 ? osd_iit_iget+0x2b8/0x510 [osd_ldiskfs] osd_preload_next+0x8f/0xa0 [osd_ldiskfs] osd_inode_iteration+0x4fa/0xc40 [osd_ldiskfs] ? __pfx_osd_preload_next+0x10/0x10 [osd_ldiskfs] ? __pfx_osd_preload_exec+0x10/0x10 [osd_ldiskfs] ? _raw_spin_unlock_irqrestore+0xa/0x30 ? prepare_to_wait_event+0x5d/0x180 osd_otable_it_next+0x1af/0x620 [osd_ldiskfs] ? __pfx_var_wake_function+0x10/0x10 lfsck_master_oit_engine+0x246/0x1140 [lfsck] ? prepare_to_wait_event+0x5d/0x180 lfsck_master_engine+0x1a2/0xb40 [lfsck] ? __schedule+0x212/0x550 ? try_to_wake_up+0x22a/0x5b0 ? __pfx_autoremove_wake_function+0x10/0x10 ? __pfx_lfsck_master_engine+0x10/0x10 [lfsck] kthread+0xe0/0x100 ? __pfx_kthread+0x10/0x10 ret_from_fork+0x2c/0x50 </TASK> Modules linked in: dm_flakey tls osp(OE) mdd(OE) lod(OE) mdt(OE) lfsck(OE) mgs(OE) mgc(OE) osd_ldiskfs(OE) lquota(OE) lustre(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) ldiskfs(OE) libcfs(OE) dm_mod rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common i2c_piix4 virtio_balloon pcspkr joydev drm fuse ext4 mbcache jbd2 ata_generic ata_piix crct10dif_pclmul libata crc32_pclmul crc32c_intel virtio_net ghash_clmulni_intel virtio_blk net_failover failover serio_raw [last unloaded: dm_flakey] CR2: 0000000000000000 | Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x162c fail_val=0 Lustre: 286350:0:(osd_internal.h:1333:osd_trans_exec_op()) lustre-MDT0000: opcode 8: before 260 < left 271, rollback = 9 Lustre: 286350:0:(osd_internal.h:1333:osd_trans_exec_op()) Skipped 58 previous similar messages Lustre: 286350:0:(osd_handler.c:1969:osd_trans_dump_creds()) create: 3/12/0, destroy: 0/0/0 Lustre: 286350:0:(osd_handler.c:1969:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 286350:0:(osd_handler.c:1976:osd_trans_dump_creds()) attr_set: 1/1/0, xattr_set: 6/403/0 Lustre: 286350:0:(osd_handler.c:1976:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 286350:0:(osd_handler.c:1983:osd_trans_dump_creds()) write: 5/55/0, punch: 0/0/0, quota 1/3/0 Lustre: 286350:0:(osd_handler.c:1983:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 286350:0:(osd_handler.c:1993:osd_trans_dump_creds()) insert: 13/271/0, delete: 0/0/0 Lustre: 286350:0:(osd_handler.c:1993:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: 286350:0:(osd_handler.c:2000:osd_trans_dump_creds()) ref_add: 7/7/0, ref_del: 0/0/0 Lustre: 286350:0:(osd_handler.c:2000:osd_trans_dump_creds()) Skipped 58 previous similar messages Lustre: DEBUG MARKER: /usr/sbin/lctl set_param fail_loc=0x0 fail_val=0 Lustre: DEBUG MARKER: /usr/sbin/lctl lfsck_start -M lustre-MDT0000 -t namespace -r -A | Link to test |