Editing crashreport #68437

ReasonCrashing FunctionWhere to cut BacktraceReports Count
BUG: unable to handle kernel paging request__pv_queued_spin_lock_slowpathdo_raw_spin_lock
_raw_spin_lock
_debug_req
brw_interpret
ptlrpc_check_set
ptlrpcd
kthread
9

Added fields:

Match messages in logs
(every line would be required to be present in log output
Copy from "Messages before crash" column below):
Match messages in full crash
(every line would be required to be present in crash log output
Copy from "Full Crash" column below):
Limit to a test:
(Copy from below "Failing text"):
Delete these reports as invalid (real bug in review or some such)
Bug or comment:
Extra info:

Failures list (last 100):

Failing TestFull CrashMessages before crashComment
sanity test 118k: bio alloc -ENOMEM and IO TERM handling
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31beca063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: dm_flakey dm_mod lustre(OE) osp(OE) ofd(OE) lod(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) ext4 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_console pcspkr virtio_balloon i2c_piix4 ip_tables rpcsec_gss_krb5 drm_kms_helper ttm ata_generic pata_acpi drm ata_piix drm_panel_orientation_quirks floppy serio_raw libata i2c_core virtio_blk [last unloaded: libcfs]
CPU: 1 PID: 3667 Comm: ptlrpcd_00_15 Kdump: loaded Tainted: P OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff8802ec4024f0 ti: ffff8802e3ce0000 task.ti: ffff8802e3ce0000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff8802e3ce3930 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff88023f66e470 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff8802e3ce3970 R08: 0000000000000000 R09: ffff8802e3ce3b38
R10: 0000000000000000 R11: 0000000000000001 R12: ffff880331a5bf00
R13: ffffffff81dbbf10 R14: ffff880331a5bf44 R15: 0000000000090000
FS: 0000000000000000(0000) GS:ffff880331a40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 00000002e6a0e000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa068b52b>] _debug_req+0x8b/0x8f0 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff81068309>] ? kvm_sched_clock_read+0x9/0x20
[<ffffffff81035ec9>] ? sched_clock+0x9/0x10
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff817e3277>] ? _raw_spin_unlock_irqrestore+0x17/0x30
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa06773f6>] ? ptlrpc_set_add_new_req+0xe6/0x160 [ptlrpc]
[<ffffffffa06ac2a4>] ? ptlrpcd_add_req+0x164/0x440 [ptlrpc]
[<ffffffffa0990fb6>] brw_interpret+0xdb6/0xe70 [osc]
[<ffffffffa067ab78>] ptlrpc_check_set+0x3d8/0x2220 [ptlrpc]
[<ffffffffa06ad0f4>] ptlrpcd+0x9c4/0xa80 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa06ac730>] ? ptlrpcd_ctl_init+0x1b0/0x1b0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Link to test
sanity test 77f: repeat checksum error on write (expect error)
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31be3d063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: dm_flakey dm_mod lustre(OE) osp(OE) ofd(OE) lod(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) ext4 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_console i2c_piix4 virtio_balloon pcspkr ip_tables rpcsec_gss_krb5 ata_generic drm_kms_helper pata_acpi ttm drm drm_panel_orientation_quirks ata_piix serio_raw libata virtio_blk floppy i2c_core [last unloaded: libcfs]
CPU: 7 PID: 12976 Comm: ptlrpcd_00_02 Kdump: loaded Tainted: P OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff88031d929280 ti: ffff8802d8a5c000 task.ti: ffff8802d8a5c000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff8802d8a5f930 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff8800a2d38570 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff8802d8a5f970 R08: 0000000000000000 R09: ffffffff810baff0
R10: 0000000000000000 R11: 000000000000000f R12: ffff880331bdbf00
R13: ffffffff81dbbf10 R14: ffff880331bdbf44 R15: 0000000000390000
FS: 0000000000000000(0000) GS:ffff880331bc0000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 00000000b9026000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa064452b>] _debug_req+0x8b/0x8f0 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
[<ffffffff810d0100>] ? try_to_wake_up+0x170/0x390
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff810d03f2>] ? default_wake_function+0x12/0x20
[<ffffffff810bb001>] ? woken_wake_function+0x11/0x20
[<ffffffff810c8510>] ? __wake_up_common+0x70/0x100
[<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa06303f6>] ? ptlrpc_set_add_new_req+0xe6/0x160 [ptlrpc]
[<ffffffffa06652a4>] ? ptlrpcd_add_req+0x164/0x440 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa0989fb6>] brw_interpret+0xdb6/0xe70 [osc]
[<ffffffffa0633b78>] ptlrpc_check_set+0x3d8/0x2220 [ptlrpc]
[<ffffffffa06660f4>] ptlrpcd+0x9c4/0xa80 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa0665730>] ? ptlrpcd_ctl_init+0x1b0/0x1b0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Lustre: DEBUG MARKER: set checksum type to crc32, rc = 0
LustreError: lustre-OST0002: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: lustre-OST0003-osc-ffff8800a9864138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x4342:0x0] object 0x300000401:24285 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: lustre-OST0002: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0002-osc-ffff8800a9864138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0002: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 3 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 19 previous similar messages
LustreError: lustre-OST0002-osc-ffff8800a9864138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 3 previous similar messages
LustreError: 12987:0:(osc_request.c:2439:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8802e5aac640 x1831258367072768/t12884927795(12884927795) o4->lustre-OST0002-osc-ffff8800a9864138@0@lo:6/4 lens 488/448 e 0 to 0 dl 1746426151 ref 2 fl Interpret:RMQU/600/0 rc 0/0 job:'ptlrpcd_00_11.0' uid:0 gid:0 projid:0
LustreError: 12987:0:(osc_request.c:2439:osc_brw_redo_request()) Skipped 9 previous similar messages
Lustre: lustre-OST0000-osc-ffff8800a9864138: disconnect after 20s idle
Lustre: Skipped 1 previous similar message
LustreError: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x4342:0x0] object 0x300000401:24285 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0002-osc-ffff8800a9864138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x4342:0x0] object 0x300000401:24285 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 5 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 19 previous similar messages
LustreError: lustre-OST0002-osc-ffff8800a9864138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 5 previous similar messages
LustreError: 12975:0:(osc_request.c:2439:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8800a85bb740 x1831258367087872/t12884927800(12884927800) o4->lustre-OST0002-osc-ffff8800a9864138@0@lo:6/4 lens 488/448 e 0 to 0 dl 1746426186 ref 2 fl Interpret:RMQU/600/0 rc 0/0 job:'ptlrpcd_00_01.0' uid:0 gid:0 projid:0
LustreError: 12975:0:(osc_request.c:2439:osc_brw_redo_request()) Skipped 9 previous similar messages
LustreError: 12978:0:(osc_request.c:2594:brw_interpret()) lustre-OST0002-osc-ffff8800a9864138: too many resent retries for object: 11811161089:24189: rc = -11
LustreError: 12978:0:(osc_request.c:2594:brw_interpret()) Skipped 1 previous similar message
Lustre: DEBUG MARKER: set checksum type to adler, rc = 0
LustreError: lustre-OST0002: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303]: client csum 7d33b9a0, server csum 7d33b99f
LustreError: Skipped 15 previous similar messages
LustreError: lustre-OST0002-osc-ffff8800a9864138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x4342:0x0] object 0x2c0000401:24189 extent [0-4194303], original client csum 7d33b9a0 (type 2), server csum 7d33b99f (type 2), client csum now 7d33b9a0
LustreError: Skipped 15 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 47 previous similar messages
LustreError: 12984:0:(osc_request.c:2594:brw_interpret()) lustre-OST0002-osc-ffff8800a9864138: too many resent retries for object: 11811161089:24189: rc = -11
Lustre: DEBUG MARKER: set checksum type to crc32c, rc = 0
LustreError: 12976:0:(osc_request.c:2439:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8800a2d3ad40 x1831258367126400/t12884927813(12884927813) o4->lustre-OST0002-osc-ffff8800a9864138@0@lo:6/4 lens 488/448 e 0 to 0 dl 1746426254 ref 2 fl Interpret:RMQU/600/0 rc 0/0 job:'directio.0' uid:0 gid:0 projid:0
LustreError: 12976:0:(osc_request.c:2439:osc_brw_redo_request()) Skipped 21 previous similar messages
Link to test
sanity test 77f: repeat checksum error on write (expect error)
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31bf22063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: dm_flakey dm_mod lustre(OE) osp(OE) ofd(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) ext4 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common i2c_piix4 pcspkr virtio_console virtio_balloon ip_tables rpcsec_gss_krb5 ata_generic pata_acpi drm_kms_helper ttm drm ata_piix drm_panel_orientation_quirks serio_raw virtio_blk libata i2c_core floppy [last unloaded: libcfs]
CPU: 4 PID: 13082 Comm: ptlrpcd_00_00 Kdump: loaded Tainted: P OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff88031b318010 ti: ffff8800a7e58000 task.ti: ffff8800a7e58000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff8800a7e5b940 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff8802e77dda70 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff8800a7e5b980 R08: ffffffff81813740 R09: ffffffff810baff0
R10: 0000000000000000 R11: 0000000000000400 R12: ffff880331b1bf00
R13: ffffffff81dbbf10 R14: ffff880331b1bf44 R15: 0000000000210000
FS: 0000000000000000(0000) GS:ffff880331b00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 0000000001c10000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa06a0720>] _debug_req+0x80/0x8b0 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
[<ffffffff810d0100>] ? try_to_wake_up+0x170/0x390
[<ffffffff810d03f2>] ? default_wake_function+0x12/0x20
[<ffffffff810bb001>] ? woken_wake_function+0x11/0x20
[<ffffffff810c8510>] ? __wake_up_common+0x70/0x100
[<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa06c0b40>] ? ptlrpcd_add_req+0x160/0x3f0 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa0942546>] brw_interpret+0xdb6/0xe70 [osc]
[<ffffffffa0690bf8>] ptlrpc_check_set+0x3d8/0x2220 [ptlrpc]
[<ffffffffa06c1a24>] ptlrpcd+0xaa4/0xb80 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa06c0f80>] ? ptlrpcd_ctl_init+0x1b0/0x1b0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Lustre: DEBUG MARKER: set checksum type to crc32, rc = 0
LustreError: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x44f6:0x0] object 0x240000bd0:24978 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: lustre-OST0000-osc-ffff8802b74a8958: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x44f6:0x0] object 0x240000bd0:24978 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: lustre-OST0001: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 3 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 12 previous similar messages
LustreError: lustre-OST0001-osc-ffff8802b74a8958: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 3 previous similar messages
LustreError: 13086:0:(osc_request.c:2450:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8802e1fac640 x1816933878022784/t12884928466(12884928466) o4->lustre-OST0001-osc-ffff8802b74a8958@0@lo:6/4 lens 488/448 e 0 to 0 dl 1732764972 ref 2 fl Interpret:RMQU/200/0 rc 0/0 job:'ptlrpcd_00_04.0' uid:0 gid:0
LustreError: 13086:0:(osc_request.c:2450:osc_brw_redo_request()) Skipped 6 previous similar messages
LustreError: lustre-OST0001: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0001-osc-ffff8802b74a8958: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 3 previous similar messages
Lustre: lustre-OST0002-osc-ffff8802b74a8958: disconnect after 20s idle
Lustre: Skipped 3 previous similar messages
LustreError: lustre-OST0001: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0001-osc-ffff8802b74a8958: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0001: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303]: client csum 8b19b060, server csum 8b19b05f
LustreError: Skipped 3 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 23 previous similar messages
LustreError: lustre-OST0000-osc-ffff8802b74a8958: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x44f6:0x0] object 0x240000bd0:24978 extent [0-4194303], original client csum 8b19b060 (type 1), server csum 8b19b05f (type 1), client csum now 8b19b060
LustreError: Skipped 3 previous similar messages
LustreError: 13097:0:(osc_request.c:2450:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8802e898e940 x1816933878036608/t21474863022(21474863022) o4->lustre-OST0000-osc-ffff8802b74a8958@0@lo:6/4 lens 488/448 e 0 to 0 dl 1732765006 ref 2 fl Interpret:RMQU/200/0 rc 0/0 job:'ptlrpcd_00_13.0' uid:0 gid:0
LustreError: 13097:0:(osc_request.c:2450:osc_brw_redo_request()) Skipped 11 previous similar messages
LustreError: 13087:0:(osc_request.c:2605:brw_interpret()) lustre-OST0001-osc-ffff8802b74a8958: too many resent retries for object: 10737419265:24659: rc = -11
Lustre: DEBUG MARKER: set checksum type to adler, rc = 0
LustreError: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x44f6:0x0] object 0x240000bd0:24978 extent [0-4194303]: client csum 7d33b9a0, server csum 7d33b99f
LustreError: Skipped 16 previous similar messages
LustreError: lustre-OST0001-osc-ffff8802b74a8958: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200001b71:0x44f6:0x0] object 0x280000401:24659 extent [0-4194303], original client csum 7d33b9a0 (type 2), server csum 7d33b99f (type 2), client csum now 7d33b9a0
LustreError: Skipped 15 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 47 previous similar messages
LustreError: 13087:0:(osc_request.c:2450:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8802f6d40540 x1816933878068736/t21474863034(21474863034) o4->lustre-OST0000-osc-ffff8802b74a8958@0@lo:6/4 lens 488/448 e 0 to 0 dl 1732765071 ref 2 fl Interpret:RMQU/200/0 rc 0/0 job:'ptlrpcd_00_03.0' uid:0 gid:0
LustreError: 13087:0:(osc_request.c:2450:osc_brw_redo_request()) Skipped 21 previous similar messages
LustreError: 13091:0:(osc_request.c:2605:brw_interpret()) lustre-OST0001-osc-ffff8802b74a8958: too many resent retries for object: 10737419265:24659: rc = -11
LustreError: 13091:0:(osc_request.c:2605:brw_interpret()) Skipped 1 previous similar message
Lustre: DEBUG MARKER: set checksum type to crc32c, rc = 0
Link to test
sanity test 77f: repeat checksum error on write (expect error)
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31bfbf063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: dm_flakey dm_mod lustre(OE) osp(OE) ofd(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) nfsd ext4 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_balloon i2c_piix4 virtio_console pcspkr ip_tables rpcsec_gss_krb5 drm_kms_helper ata_generic pata_acpi ttm drm ata_piix drm_panel_orientation_quirks serio_raw floppy virtio_blk libata i2c_core [last unloaded: libcfs]
CPU: 9 PID: 16847 Comm: ptlrpcd_04_00 Kdump: loaded Tainted: P W OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff880324d45c40 ti: ffff880325408000 task.ti: ffff880325408000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff88032540b940 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff88009e241e70 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff88032540b980 R08: ffffffff818148c0 R09: ffffffff821c2c00
R10: 0000000000000000 R11: 0000000000000400 R12: ffff880331c5bf00
R13: ffffffff81dbbf10 R14: ffff880331c5bf44 R15: 0000000000490000
FS: 0000000000000000(0000) GS:ffff880331c40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 00000003205b2000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa0623960>] _debug_req+0x80/0x8b0 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff81068309>] ? kvm_sched_clock_read+0x9/0x20
[<ffffffff81035ec9>] ? sched_clock+0x9/0x10
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff817e3277>] ? _raw_spin_unlock_irqrestore+0x17/0x30
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa0643d60>] ? ptlrpcd_add_req+0x160/0x3f0 [ptlrpc]
[<ffffffffa0960546>] brw_interpret+0xdb6/0xe70 [osc]
[<ffffffffa0613d28>] ptlrpc_check_set+0x3d8/0x2220 [ptlrpc]
[<ffffffffa0644c44>] ptlrpcd+0xaa4/0xb80 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa06441a0>] ? ptlrpcd_ctl_init+0x1b0/0x1b0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Lustre: DEBUG MARKER: set checksum type to adler, rc = 0
LustreError: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303]: client csum 7d33b9a0, server csum 7d33b99f
LustreError: lustre-OST0003-osc-ffff8801d41bc138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303], original client csum 7d33b9a0 (type 2), server csum 7d33b99f (type 2), client csum now 7d33b9a0
LustreError: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000032e1:0x3767:0x0] object 0x2c0000bd0:19772 extent [0-4194303]: client csum 7d33b9a0, server csum 7d33b99f
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0000-osc-ffff8801d41bc138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x2000032e1:0x3767:0x0] object 0x2c0000bd0:19772 extent [0-4194303], original client csum 7d33b9a0 (type 2), server csum 7d33b99f (type 2), client csum now 7d33b9a0
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000032e1:0x3767:0x0] object 0x2c0000bd0:19772 extent [0-4194303]: client csum 7d33b9a0, server csum 7d33b99f
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0000-osc-ffff8801d41bc138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x2000032e1:0x3767:0x0] object 0x2c0000bd0:19772 extent [0-4194303], original client csum 7d33b9a0 (type 2), server csum 7d33b99f (type 2), client csum now 7d33b9a0
LustreError: Skipped 3 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 21 previous similar messages
LustreError: 16844:0:(osc_request.c:2449:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff880085a94b40 x1816165379868416/t12884929587(12884929587) o4->lustre-OST0003-osc-ffff8801d41bc138@0@lo:6/4 lens 488/448 e 0 to 0 dl 1732032384 ref 2 fl Interpret:RMQU/200/0 rc 0/0 job:'ptlrpcd_02_01.0' uid:0 gid:0
LustreError: 16844:0:(osc_request.c:2449:osc_brw_redo_request()) Skipped 10 previous similar messages
LustreError: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000032e1:0x3767:0x0] object 0x2c0000bd0:19772 extent [0-4194303]: client csum 7d33b9a0, server csum 7d33b99f
LustreError: Skipped 3 previous similar messages
LustreError: lustre-OST0000-osc-ffff8801d41bc138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x2000032e1:0x3767:0x0] object 0x2c0000bd0:19772 extent [0-4194303], original client csum 7d33b9a0 (type 2), server csum 7d33b99f (type 2), client csum now 7d33b9a0
LustreError: Skipped 3 previous similar messages
Lustre: lustre-OST0001-osc-ffff8801d41bc138: disconnect after 24s idle
Lustre: Skipped 3 previous similar messages
LustreError: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303]: client csum 7d33b9a0, server csum 7d33b99f
LustreError: Skipped 5 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 17 previous similar messages
LustreError: lustre-OST0003-osc-ffff8801d41bc138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303], original client csum 7d33b9a0 (type 2), server csum 7d33b99f (type 2), client csum now 7d33b9a0
LustreError: Skipped 5 previous similar messages
LustreError: 16843:0:(osc_request.c:2449:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff88004dcfc640 x1816165379914368/t12884929592(12884929592) o4->lustre-OST0003-osc-ffff8801d41bc138@0@lo:6/4 lens 488/448 e 0 to 0 dl 1732032419 ref 2 fl Interpret:RMQU/200/0 rc 0/0 job:'ptlrpcd_02_00.0' uid:0 gid:0
LustreError: 16843:0:(osc_request.c:2449:osc_brw_redo_request()) Skipped 8 previous similar messages
LustreError: 16843:0:(osc_request.c:2604:brw_interpret()) lustre-OST0003-osc-ffff8801d41bc138: too many resent retries for object: 15032386565:19387: rc = -11
LustreError: 16849:0:(osc_request.c:2604:brw_interpret()) lustre-OST0000-osc-ffff8801d41bc138: too many resent retries for object: 11811163088:19772: rc = -11
Lustre: DEBUG MARKER: set checksum type to crc32c, rc = 0
LustreError: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303]: client csum de2bf73f, server csum de2bf73e
LustreError: Skipped 16 previous similar messages
LustreError: lustre-OST0003-osc-ffff8801d41bc138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303], original client csum de2bf73f (type 4), server csum de2bf73e (type 4), client csum now de2bf73f
LustreError: Skipped 16 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 47 previous similar messages
LustreError: 16848:0:(osc_request.c:2604:brw_interpret()) lustre-OST0000-osc-ffff8801d41bc138: too many resent retries for object: 11811163088:19772: rc = -11
Lustre: DEBUG MARKER: set checksum type to t10ip512, rc = 0
LustreError: 16848:0:(osc_request.c:2449:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff88022b6d8040 x1816165380026368/t12884929605(12884929605) o4->lustre-OST0003-osc-ffff8801d41bc138@0@lo:6/4 lens 488/448 e 0 to 0 dl 1732032487 ref 2 fl Interpret:RMQU/200/0 rc 0/0 job:'directio.0' uid:0 gid:0
LustreError: 16848:0:(osc_request.c:2449:osc_brw_redo_request()) Skipped 21 previous similar messages
LustreError: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303]: client csum 3a674101, server csum 3a674100
LustreError: Skipped 24 previous similar messages
LustreError: lustre-OST0003-osc-ffff8801d41bc138: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x2000032e1:0x3767:0x0] object 0x380000405:19387 extent [0-4194303], original client csum 3a674101 (type 10), server csum 3a674100 (type 10), client csum now 3a674101
LustreError: Skipped 24 previous similar messages
LustreError: 16848:0:(osc_request.c:2604:brw_interpret()) lustre-OST0003-osc-ffff8801d41bc138: too many resent retries for object: 15032386565:19387: rc = -11
LustreError: 16848:0:(osc_request.c:2604:brw_interpret()) Skipped 1 previous similar message
Lustre: DEBUG MARKER: set checksum type to t10ip4K, rc = 0
Link to test
sanity test 77b: checksum error on client write, read
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31bf7a063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: lustre(OE) osp(OE) ofd(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_zfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) ext4 mbcache jbd2 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) crc32_generic crc_t10dif crct10dif_generic crct10dif_common virtio_balloon virtio_console i2c_piix4 pcspkr ip_tables rpcsec_gss_krb5 ata_generic pata_acpi drm_kms_helper ttm drm drm_panel_orientation_quirks ata_piix serio_raw libata i2c_core floppy virtio_blk [last unloaded: libcfs]
CPU: 9 PID: 21267 Comm: ptlrpcd_04_00 Kdump: loaded Tainted: P OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff88029946c9d0 ti: ffff88027c470000 task.ti: ffff88027c470000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff88027c473948 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff88028c986970 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff88027c473988 R08: 0000000000000000 R09: ffffffff810baff0
R10: 0000000000000000 R11: 000000000000000f R12: ffff880331c5bf00
R13: ffffffff81dbbf10 R14: ffff880331c5bf44 R15: 0000000000490000
FS: 0000000000000000(0000) GS:ffff880331c40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 0000000292398000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa0623760>] _debug_req+0x80/0x8b0 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
[<ffffffff810d0100>] ? try_to_wake_up+0x170/0x390
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff810d03f2>] ? default_wake_function+0x12/0x20
[<ffffffff810bb001>] ? woken_wake_function+0x11/0x20
[<ffffffff810c8510>] ? __wake_up_common+0x70/0x100
[<ffffffff814119f9>] ? do_raw_spin_unlock+0x49/0x90
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa0643a80>] ? ptlrpcd_add_req+0x160/0x3f0 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa09684f6>] brw_interpret+0xd66/0xe20 [osc]
[<ffffffffa0613d28>] ptlrpc_check_set+0x3d8/0x2220 [ptlrpc]
[<ffffffffa0644964>] ptlrpcd+0xaa4/0xb80 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa0643ec0>] ? ptlrpcd_ctl_init+0x1b0/0x1b0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Lustre: *** cfs_fail_loc=409, val=0***
LustreError: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000013a1:0x5190:0x0] object 0x300000bd0:27632 extent [0-1048575]: client csum 896c0727, server csum 896c0726
LustreError: lustre-OST0003-osc-ffff8803240a2e98: BAD WRITE CHECKSUM: changed on the client after we checksummed it - likely false positive due to mmap IO (bug 11742): from 0@lo inode [0x2000013a1:0x5190:0x0] object 0x300000bd0:27632 extent [0-1048575], original client csum 896c0727 (type 20), server csum 896c0726 (type 20), client csum now 896c0726
LustreError: 21268:0:(osc_request.c:2449:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8802c9b5ee40 x1814444697697280/t17179900914(17179900914) o4->lustre-OST0003-osc-ffff8803240a2e98@0@lo:6/4 lens 488/448 e 0 to 0 dl 1730390611 ref 3 fl Interpret:RQU/204/0 rc 0/0 job:'dd.0' uid:0 gid:0
Lustre: DEBUG MARKER: set checksum type to crc32, rc = 0
Lustre: *** cfs_fail_loc=408, val=0***
LustreError: lustre-OST0003-osc-ffff8803240a2e98: BAD READ CHECKSUM: from 0@lo inode [0x2000013a1:0x5190:0x0] object 0x300000bd0:27632 extent [0-1048575], client 4c063186/4c063186, server 49c97eab, cksum_type 1
LustreError: 21267:0:(osc_request.c:2449:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff880260488f40 x1814444697701504/t0(0) o3->lustre-OST0003-osc-ffff8803240a2e98@0@lo:6/4 lens 488/440 e 0 to 0 dl 1730390612 ref 2 fl Interpret:RMQU/200/0 rc 1048576/1048576 job:'cmp.0' uid:0 gid:0
LustreError: lustre-OST0003: BAD READ CHECKSUM: should have changed on the client or in transit: from 0@lo inode [0x2000013a1:0x5190:0x0] object 0x300000bd0:27632 extent [0-1048575], client returned csum 4c063186 (type 1), server csum 49c97eab (type 1)
Link to test
sanity test 77d: checksum error on OST direct write, read
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31b81d063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: lustre(OE) osp(OE) ofd(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_zfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) nfsd ext4 mbcache jbd2 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) crc32_generic crc_t10dif crct10dif_generic crct10dif_common pcspkr virtio_balloon virtio_console i2c_piix4 ip_tables rpcsec_gss_krb5 ata_generic drm_kms_helper pata_acpi ttm drm drm_panel_orientation_quirks ata_piix i2c_core virtio_blk serio_raw libata floppy [last unloaded: libcfs]
CPU: 13 PID: 12272 Comm: ptlrpcd_06_01 Kdump: loaded Tainted: P W OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff880052a949d0 ti: ffff88009a330000 task.ti: ffff88009a330000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff88009a333948 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff88028be05070 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff88009a333988 R08: ffffffff818152c0 R09: ffffffff810baff0
R10: 0000000000000000 R11: 000000000000000f R12: ffff880331d5bf00
R13: ffffffff81dbbf10 R14: ffff880331d5bf44 R15: 0000000000690000
FS: 0000000000000000(0000) GS:ffff880331d40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 00000001dad94000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa06007e0>] _debug_req+0x80/0x8b0 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff81068309>] ? kvm_sched_clock_read+0x9/0x20
[<ffffffff81035ec9>] ? sched_clock+0x9/0x10
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff8110a245>] ? __raw_callee_save___pv_queued_spin_unlock_slowpath+0x15/0x24
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffff817e3277>] ? _raw_spin_unlock_irqrestore+0x17/0x30
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa0620b00>] ? ptlrpcd_add_req+0x160/0x3f0 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa08c44f6>] brw_interpret+0xd66/0xe20 [osc]
[<ffffffffa05f0d98>] ptlrpc_check_set+0x3d8/0x2220 [ptlrpc]
[<ffffffffa06219e4>] ptlrpcd+0xaa4/0xb80 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa0620f40>] ? ptlrpcd_ctl_init+0x1b0/0x1b0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
LustreError: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x2000013a1:0x4cfa:0x0] object 0x240000bd0:26586 extent [0-1048575]: client csum 30ec5402, server csum 30ec5401
LustreError: lustre-OST0000-osc-ffff8801bd47e678: BAD WRITE CHECKSUM: changed on the client after we checksummed it - likely false positive due to mmap IO (bug 11742): from 0@lo inode [0x2000013a1:0x4cfa:0x0] object 0x240000bd0:26586 extent [0-1048575], original client csum 30ec5402 (type 20), server csum 30ec5401 (type 20), client csum now 30ec5401
Lustre: *** cfs_fail_loc=408, val=0***
Lustre: Skipped 1 previous similar message
LustreError: lustre-OST0000-osc-ffff8801bd47e678: BAD READ CHECKSUM: from 0@lo inode [0x2000013a1:0x4cfa:0x0] object 0x240000bd0:26586 extent [3145728-4194303], client 53195492/53195492, server 30ec5401, cksum_type 20
LustreError: Skipped 1 previous similar message
LustreError: lustre-OST0000: BAD READ CHECKSUM: should have changed on the client or in transit: from 0@lo inode [0x2000013a1:0x4cfa:0x0] object 0x240000bd0:26586 extent [3145728-4194303], client returned csum 53195492 (type 20), server csum 30ec5401 (type 20)
LustreError: Skipped 1 previous similar message
Link to test
sanity test 77d: checksum error on OST direct write, read
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31be54063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: dm_flakey dm_mod lustre(OE) ofd(OE) osp(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) nfsd ext4 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common pcspkr virtio_console virtio_balloon i2c_piix4 ip_tables rpcsec_gss_krb5 ata_generic pata_acpi drm_kms_helper ttm drm ata_piix drm_panel_orientation_quirks serio_raw virtio_blk floppy libata i2c_core [last unloaded: libcfs]
CPU: 15 PID: 21706 Comm: ptlrpcd_07_00 Kdump: loaded Tainted: P W OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff8800b71349d0 ti: ffff880092b60000 task.ti: ffff880092b60000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff880092b63950 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff8802c7f86ef0 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff880092b63990 R08: ffffffff818157c0 R09: ffffffff810baff0
R10: 0000000000000000 R11: 000000000000000f R12: ffff880331ddbf00
R13: ffffffff81dbbf10 R14: ffff880331ddbf44 R15: 0000000000790000
FS: 0000000000000000(0000) GS:ffff880331dc0000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 00000002e6e36000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa0685e20>] _debug_req+0x80/0x890 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff81068309>] ? kvm_sched_clock_read+0x9/0x20
[<ffffffff81035ec9>] ? sched_clock+0x9/0x10
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffff817e3277>] ? _raw_spin_unlock_irqrestore+0x17/0x30
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa06a5770>] ? ptlrpcd_add_req+0x170/0x400 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa09182da>] brw_interpret+0xcca/0xd80 [osc]
[<ffffffffa0676b58>] ptlrpc_check_set+0x428/0x2180 [ptlrpc]
[<ffffffffa06a69e4>] ptlrpcd+0xa94/0xb70 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa06a5f50>] ? ptlrpcd_partners+0x3a0/0x3a0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Lustre: *** cfs_fail_loc=409, val=0***
LustreError: 168-f: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200001b71:0x6125:0x0] object 0x280000bd0:32000 extent [0-1048575]: client csum 30ec5402, server csum 30ec5401
LustreError: 132-0: lustre-OST0000-osc-ffff8802afb95d28: BAD WRITE CHECKSUM: changed on the client after we checksummed it - likely false positive due to mmap IO (bug 11742): from 0@lo inode [0x200001b71:0x6125:0x0] object 0x280000bd0:32000 extent [0-1048575], original client csum 30ec5402 (type 20), server csum 30ec5401 (type 20), client csum now 30ec5401
LustreError: 21706:0:(osc_request.c:2409:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff88025eb6d4c0 x1772600018895936/t21474874798(21474874798) o4->lustre-OST0000-osc-ffff8802afb95d28@0@lo:6/4 lens 488/448 e 0 to 0 dl 1690484429 ref 2 fl Interpret:RMQU/200/0 rc 0/0 uid:0 gid:0 job:'directio.0'
LustreError: 133-1: lustre-OST0000-osc-ffff8802afb95d28: BAD READ CHECKSUM: from 0@lo inode [0x200001b71:0x6125:0x0] object 0x280000bd0:32000 extent [0-1048575], client 53195492/53195492, server 30ec5401, cksum_type 20
LustreError: 132-0: lustre-OST0000: BAD READ CHECKSUM: should have changed on the client or in transit: from 0@lo inode [0x200001b71:0x6125:0x0] object 0x280000bd0:32000 extent [0-1048575], client returned csum 53195492 (type 20), server csum 30ec5401 (type 20)
Link to test
sanity test 77b: checksum error on client write, read
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31bf60063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: dm_flakey dm_mod lustre(OE) ofd(OE) osp(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) nfsd ext4 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common i2c_piix4 virtio_balloon pcspkr virtio_console ip_tables rpcsec_gss_krb5 ata_generic drm_kms_helper pata_acpi ttm drm ata_piix drm_panel_orientation_quirks virtio_blk serio_raw i2c_core libata floppy [last unloaded: libcfs]
CPU: 2 PID: 31923 Comm: ptlrpcd_01_01 Kdump: loaded Tainted: P W OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff8802c8a35c40 ti: ffff8801b7708000 task.ti: ffff8801b7708000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff8801b770b950 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff880323f4a770 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff8801b770b990 R08: ffffffff81813c40 R09: ffffffff810baff0
R10: 0000000000000000 R11: 000000000000000f R12: ffff880331a9bf00
R13: ffffffff81dbbf10 R14: ffff880331a9bf44 R15: 0000000000110000
FS: 0000000000000000(0000) GS:ffff880331a80000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 00000002c655c000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa0613de0>] _debug_req+0x80/0x890 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff81068309>] ? kvm_sched_clock_read+0x9/0x20
[<ffffffff81035ec9>] ? sched_clock+0x9/0x10
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffff817e3277>] ? _raw_spin_unlock_irqrestore+0x17/0x30
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa0633720>] ? ptlrpcd_add_req+0x170/0x400 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa08f82da>] brw_interpret+0xcca/0xd80 [osc]
[<ffffffffa0604b58>] ptlrpc_check_set+0x428/0x2180 [ptlrpc]
[<ffffffffa0634994>] ptlrpcd+0xa94/0xb70 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa0633f00>] ? ptlrpcd_partners+0x3a0/0x3a0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Lustre: DEBUG MARKER: set checksum type to crc32, rc = 0
Lustre: *** cfs_fail_loc=408, val=0***
LustreError: 133-1: lustre-OST0002-osc-ffff8802d3192548: BAD READ CHECKSUM: from 0@lo inode [0x200001b71:0x5ff7:0x0] object 0x2c0000401:31379 extent [0-1048575], client 794146d1/794146d1, server 80a1047b, cksum_type 1
LustreError: 31921:0:(osc_request.c:2409:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff880320ea20c0 x1771640881916544/t0(0) o3->lustre-OST0002-osc-ffff8802d3192548@0@lo:6/4 lens 488/440 e 0 to 0 dl 1689569701 ref 2 fl Interpret:RMQU/200/0 rc 1048576/1048576 uid:0 gid:0 job:'cmp.0'
LustreError: 132-0: lustre-OST0002: BAD READ CHECKSUM: should have changed on the client or in transit: from 0@lo inode [0x200001b71:0x5ff7:0x0] object 0x2c0000401:31379 extent [0-1048575], client returned csum 794146d1 (type 1), server csum 80a1047b (type 1)
Lustre: DEBUG MARKER: set checksum type to adler, rc = 0
Lustre: *** cfs_fail_loc=408, val=0***
LustreError: 133-1: lustre-OST0002-osc-ffff8802d3192548: BAD READ CHECKSUM: from 0@lo inode [0x200001b71:0x5ff7:0x0] object 0x2c0000401:31379 extent [0-1048575], client f2ad1591/f2ad1591, server b64e1551, cksum_type 2
LustreError: 31920:0:(osc_request.c:2409:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8801abc8d4c0 x1771640881918528/t0(0) o3->lustre-OST0002-osc-ffff8802d3192548@0@lo:6/4 lens 488/440 e 0 to 0 dl 1689569702 ref 2 fl Interpret:RMQU/200/0 rc 1048576/1048576 uid:0 gid:0 job:'cmp.0'
LustreError: 132-0: lustre-OST0002: BAD READ CHECKSUM: should have changed on the client or in transit: from 0@lo inode [0x200001b71:0x5ff7:0x0] object 0x2c0000401:31379 extent [0-1048575], client returned csum f2ad1591 (type 2), server csum b64e1551 (type 2)
Lustre: DEBUG MARKER: set checksum type to crc32c, rc = 0
Lustre: *** cfs_fail_loc=408, val=0***
LustreError: 133-1: lustre-OST0003-osc-ffff8802d3192548: BAD READ CHECKSUM: from 0@lo inode [0x200001b71:0x5ff7:0x0] object 0x300000401:31344 extent [0-1048575], client 15504bf0/15504bf0, server d0b82c03, cksum_type 4
LustreError: 31923:0:(osc_request.c:2409:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff880323f48d40 x1771640881921920/t0(0) o3->lustre-OST0003-osc-ffff8802d3192548@0@lo:6/4 lens 488/440 e 0 to 0 dl 1689569704 ref 2 fl Interpret:RMQU/200/0 rc 1048576/1048576 uid:0 gid:0 job:'cmp.0'
LustreError: 132-0: lustre-OST0003: BAD READ CHECKSUM: should have changed on the client or in transit: from 0@lo inode [0x200001b71:0x5ff7:0x0] object 0x300000401:31344 extent [0-1048575], client returned csum 15504bf0 (type 4), server csum d0b82c03 (type 4)
Link to test
sanity test 77f: repeat checksum error on write (expect error)
BUG: unable to handle kernel paging request at ffffffff81dbbf10
IP: [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
PGD 1c14067 PUD 1c15063 PMD 31bf32063 PTE 8000000001dbb062
Oops: 0002 [#1] SMP DEBUG_PAGEALLOC
Modules linked in: dm_flakey dm_mod lustre(OE) ofd(OE) osp(OE) lod(OE) ost(OE) mdt(OE) mdd(OE) mgs(OE) osd_ldiskfs(OE) ldiskfs(OE) lquota(OE) lfsck(OE) obdecho(OE) mgc(OE) mdc(OE) lov(OE) osc(OE) lmv(OE) fid(OE) fld(OE) ptlrpc_gss(OE) ptlrpc(OE) obdclass(OE) ksocklnd(OE) lnet(OE) libcfs(OE) nfsd ext4 loop zfs(PO) zunicode(PO) zlua(PO) zcommon(PO) znvpair(PO) zavl(PO) icp(PO) spl(O) jbd2 mbcache crc32_generic crc_t10dif crct10dif_generic crct10dif_common i2c_piix4 virtio_console pcspkr virtio_balloon ip_tables rpcsec_gss_krb5 ata_generic pata_acpi drm_kms_helper ttm drm drm_panel_orientation_quirks serio_raw virtio_blk floppy ata_piix i2c_core libata [last unloaded: libcfs]
CPU: 13 PID: 3502 Comm: ptlrpcd_06_01 Kdump: loaded Tainted: P W OE ------------ 3.10.0-7.9-debug #2
Hardware name: Red Hat KVM, BIOS 1.16.0-3.module_el8.7.0+1218+f626c2ff 04/01/2014
task: ffff8802b4ec3760 ti: ffff88031af88000 task.ti: ffff88031af88000
RIP: 0010:[<ffffffff8110a9b2>] [<ffffffff8110a9b2>] __pv_queued_spin_lock_slowpath+0x1f2/0x3d0
RSP: 0018:ffff88031af8b950 EFLAGS: 00010282
RAX: 0000000000008000 RBX: ffff8802001c61f0 RCX: 0000000000000001
RDX: 0000000000000010 RSI: 0000000000000000 RDI: ffffffff81dbbf10
RBP: ffff88031af8b990 R08: ffffffff818152c0 R09: ffffffff810baff0
R10: 0000000000000000 R11: 000000000000000f R12: ffff880331d5bf00
R13: ffffffff81dbbf10 R14: ffff880331d5bf44 R15: 0000000000690000
FS: 0000000000000000(0000) GS:ffff880331d40000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: ffffffff81dbbf10 CR3: 00000001a1bc4000 CR4: 00000000000007e0
Call Trace:
[<ffffffff8141192d>] do_raw_spin_lock+0x6d/0xa0
[<ffffffff817e320e>] _raw_spin_lock+0x1e/0x20
[<ffffffffa05f03b0>] _debug_req+0x80/0x890 [ptlrpc]
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff810682e3>] ? kvm_clock_read+0x33/0x40
[<ffffffff81068309>] ? kvm_sched_clock_read+0x9/0x20
[<ffffffff81035ec9>] ? sched_clock+0x9/0x10
[<ffffffff8110ae0a>] ? __pv_queued_spin_unlock_slowpath+0x5a/0xa0
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffff817e3277>] ? _raw_spin_unlock_irqrestore+0x17/0x30
[<ffffffff810c8631>] ? __wake_up_common_lock+0x91/0xc0
[<ffffffff810c7f10>] ? sched_feat_set+0xf0/0xf0
[<ffffffff810c8673>] ? __wake_up+0x13/0x20
[<ffffffffa0610a00>] ? ptlrpcd_add_req+0x170/0x400 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa09091ab>] brw_interpret+0xcdb/0xdc0 [osc]
[<ffffffffa05e1b28>] ptlrpc_check_set+0x408/0x2280 [ptlrpc]
[<ffffffffa0611c74>] ptlrpcd+0xa94/0xb70 [ptlrpc]
[<ffffffff810baff0>] ? abort_exclusive_wait+0xa0/0xa0
[<ffffffffa06111e0>] ? ptlrpcd_partners+0x3a0/0x3a0 [ptlrpc]
[<ffffffff810ba114>] kthread+0xe4/0xf0
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
[<ffffffff817ede5d>] ret_from_fork_nospec_begin+0x7/0x21
[<ffffffff810ba030>] ? kthread_create_on_node+0x140/0x140
Lustre: DEBUG MARKER: set checksum type to crc32, rc = 0
LustreError: 168-f: lustre-OST0000: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x2c0000bd0:6502 extent [0-1048575]: client csum b2f1b12, server csum b2f1b11
LustreError: 132-0: lustre-OST0000-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x2c0000bd0:6502 extent [0-1048575], original client csum b2f1b12 (type 1), server csum b2f1b11 (type 1), client csum now b2f1b12
LustreError: 168-f: lustre-OST0002: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x340000bd0:6565 extent [0-1048575]: client csum b2f1b12, server csum b2f1b11
LustreError: Skipped 8 previous similar messages
LustreError: 132-0: lustre-OST0000-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x2c0000bd0:6502 extent [0-1048575], original client csum b2f1b12 (type 1), server csum b2f1b11 (type 1), client csum now b2f1b12
LustreError: Skipped 8 previous similar messages
LustreError: 168-f: lustre-OST0002: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x340000bd0:6565 extent [0-1048575]: client csum b2f1b12, server csum b2f1b11
LustreError: Skipped 7 previous similar messages
LustreError: 132-0: lustre-OST0002-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x340000bd0:6565 extent [0-1048575], original client csum b2f1b12 (type 1), server csum b2f1b11 (type 1), client csum now b2f1b12
LustreError: Skipped 7 previous similar messages
LustreError: 168-f: lustre-OST0001: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x300000bd0:6853 extent [0-1048575]: client csum b2f1b12, server csum b2f1b11
LustreError: Skipped 14 previous similar messages
LustreError: 132-0: lustre-OST0001-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x300000bd0:6853 extent [0-1048575], original client csum b2f1b12 (type 1), server csum b2f1b11 (type 1), client csum now b2f1b12
LustreError: Skipped 14 previous similar messages
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 89 previous similar messages
LustreError: 3503:0:(osc_request.c:2403:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8802e9abc7c0 x1763319366623936/t12884941735(12884941735) o4->lustre-OST0001-osc-ffff88028bcf8008@0@lo:6/4 lens 488/448 e 0 to 0 dl 1681633815 ref 2 fl Interpret:RMQU/200/0 rc 0/0 uid:0 gid:0 job:'ptlrpcd_07_00.0'
LustreError: 3503:0:(osc_request.c:2403:osc_brw_redo_request()) Skipped 41 previous similar messages
LustreError: 168-f: lustre-OST0001: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x300000bd0:6853 extent [0-1048575]: client csum b2f1b12, server csum b2f1b11
LustreError: Skipped 15 previous similar messages
LustreError: 132-0: lustre-OST0001-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x300000bd0:6853 extent [0-1048575], original client csum b2f1b12 (type 1), server csum b2f1b11 (type 1), client csum now b2f1b12
LustreError: Skipped 15 previous similar messages
LustreError: 168-f: lustre-OST0001: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x300000bd0:6853 extent [0-1048575]: client csum b2f1b12, server csum b2f1b11
LustreError: Skipped 23 previous similar messages
LustreError: 132-0: lustre-OST0001-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x300000bd0:6853 extent [0-1048575], original client csum b2f1b12 (type 1), server csum b2f1b11 (type 1), client csum now b2f1b12
LustreError: Skipped 23 previous similar messages
LustreError: 3503:0:(osc_request.c:2533:brw_interpret()) lustre-OST0001-osc-ffff88028bcf8008: too many resent retries for object: 12884904912:6853, rc = -11.
LustreError: 3492:0:(osc_request.c:2533:brw_interpret()) lustre-OST0002-osc-ffff88028bcf8008: too many resent retries for object: 13958646736:6565, rc = -11.
LustreError: 3492:0:(osc_request.c:2533:brw_interpret()) Skipped 4 previous similar messages
Lustre: DEBUG MARKER: set checksum type to adler, rc = 0
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 111 previous similar messages
LustreError: 3503:0:(osc_request.c:2403:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff8802cdcd86c0 x1763319366654272/t21474875653(21474875653) o4->lustre-OST0000-osc-ffff88028bcf8008@0@lo:6/4 lens 488/448 e 0 to 0 dl 1681633848 ref 2 fl Interpret:RMQU/200/0 rc 0/0 uid:0 gid:0 job:'ptlrpcd_07_00.0'
LustreError: 3503:0:(osc_request.c:2403:osc_brw_redo_request()) Skipped 47 previous similar messages
LustreError: 168-f: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x380000bd0:6533 extent [1048576-2097151]: client csum 19eeae62, server csum 19eeae61
LustreError: Skipped 71 previous similar messages
LustreError: 132-0: lustre-OST0002-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x340000bd0:6565 extent [1048576-2097151], original client csum 19eeae62 (type 2), server csum 19eeae61 (type 2), client csum now 19eeae62
LustreError: Skipped 71 previous similar messages
LustreError: 3496:0:(osc_request.c:2533:brw_interpret()) lustre-OST0002-osc-ffff88028bcf8008: too many resent retries for object: 13958646736:6565, rc = -11.
LustreError: 3496:0:(osc_request.c:2533:brw_interpret()) Skipped 2 previous similar messages
Lustre: DEBUG MARKER: set checksum type to crc32c, rc = 0
Lustre: *** cfs_fail_loc=409, val=0***
Lustre: Skipped 255 previous similar messages
LustreError: 3501:0:(osc_request.c:2403:osc_brw_redo_request()) @@@ redo for recoverable error -11 req@ffff880224ddc7c0 x1763319366708736/t12884941579(12884941579) o4->lustre-OST0003-osc-ffff88028bcf8008@0@lo:6/4 lens 488/448 e 0 to 0 dl 1681633914 ref 2 fl Interpret:RMQU/200/0 rc 0/0 uid:0 gid:0 job:'ptlrpcd_06_00.0'
LustreError: 3501:0:(osc_request.c:2403:osc_brw_redo_request()) Skipped 119 previous similar messages
LustreError: 3501:0:(osc_request.c:2533:brw_interpret()) lustre-OST0002-osc-ffff88028bcf8008: too many resent retries for object: 13958646736:6565, rc = -11.
LustreError: 3501:0:(osc_request.c:2533:brw_interpret()) Skipped 7 previous similar messages
LustreError: 168-f: lustre-OST0003: BAD WRITE CHECKSUM: from 12345-0@lo inode [0x200002b11:0x3fd1:0x0] object 0x380000bd0:6533 extent [1048576-2097151]: client csum b5ea7f3c, server csum b5ea7f3b
LustreError: Skipped 118 previous similar messages
LustreError: 132-0: lustre-OST0001-osc-ffff88028bcf8008: BAD WRITE CHECKSUM: changed in transit before arrival at OST: from 0@lo inode [0x200002b11:0x3fd1:0x0] object 0x300000bd0:6853 extent [1048576-2097151], original client csum b5ea7f3c (type 4), server csum b5ea7f3b (type 4), client csum now b5ea7f3c
LustreError: Skipped 116 previous similar messages
Lustre: DEBUG MARKER: set checksum type to t10ip512, rc = 0
LustreError: 3501:0:(osc_request.c:2533:brw_interpret()) lustre-OST0000-osc-ffff88028bcf8008: too many resent retries for object: 11811163088:6502, rc = -11.
LustreError: 3501:0:(osc_request.c:2533:brw_interpret()) Skipped 7 previous similar messages
Lustre: DEBUG MARKER: set checksum type to t10ip4K, rc = 0
LustreError: 3502:0:(osc_request.c:2533:brw_interpret()) lustre-OST0002-osc-ffff88028bcf8008: too many resent retries for object: 13958646736:6565, rc = -11.
LustreError: 3502:0:(osc_request.c:2533:brw_interpret()) Skipped 7 previous similar messages
Lustre: DEBUG MARKER: set checksum type to t10crc512, rc = 0
Link to test
Return to new crashes list