[2.4.18] oops in lvm or raid

From: Roger Gammans
Date: Mon Dec 29 2003 - 10:00:50 EST


Hi

The system in question runs:-
ext3-> lvm->raid1(hdc,hde) -> ide( piix , pdc202xx )

We had a power failure here which caused serve corruption
on our system, after coming back up one of the ext3 partitions
wouldn't recover its journal and there was manual fsck ing
gave warnings of serve data loss. Before fscking I created a
lvm snapshot of the partition desperately hoping that the lvm
meta data was intact. And used this to write the raw filesystem image
pre-fsck to a tape.

Anyway to cut the chase I forgot to delete this snapshot partition
before recovering filesystem form a backup and at about the time
the snapshot got full I got the following kernel messages....


Dec 29 13:15:23 turin kernel: lvm -- giving up to snapshot /dev/rootvg/data_root on /dev/rootvg/data_20031218: out of spa
Dec 29 13:15:23 turin kernel: Unable to handle kernel paging request at virtual address 00015618
Dec 29 13:15:23 turin kernel: printing eip:
Dec 29 13:15:23 turin kernel: c4847a7c
c4847a7c -> lvm_snapshot_remap_block (c4847a0c)
Dec 29 13:15:23 turin kernel: *pde = 00000000
Dec 29 13:15:23 turin kernel: Oops: 0000
Dec 29 13:15:23 turin kernel: CPU: 0
Dec 29 13:15:23 turin kernel: EIP: 0010:[raid1:__insmod_raid1_O/lib/modules/2.4.18-bf2.4/kernel/drivers/md+-169348/96]
Dec 29 13:15:23 turin kernel: EFLAGS: 00010246
Dec 29 13:15:23 turin kernel: eax: 0000ffff ebx: 00015618 ecx: c2741600 edx: 00000000
Dec 29 13:15:23 turin kernel: esi: 0111e1f8 edi: 00000900 ebp: 00000000 esp: c1173e58
Dec 29 13:15:23 turin kernel: ds: 0018 es: 0018 ss: 0018
Dec 29 13:15:23 turin kernel: Process kswapd (pid: 4, stackpage=c1173000)
Dec 29 13:15:23 turin kernel: Stack: c2741600 0111e178 c2741770 c2741170 c0392520 00002ac3 00000038 00000000
Dec 29 13:15:23 turin kernel: c4844a5d c1173eb6 c1173eb8 0111e178 c2741600 00003a00 c3abc500 0111c0b8
Dec 29 13:15:23 turin kernel: 01680000 011240b8 01680000 c2741000 c3b9f000 0111e178 000088e0 09005920
Dec 29 13:15:23 turin kernel: Call Trace: [raid1:__insmod_raid1_O/lib/modules/2.4.18-bf2.4/kernel/drivers/md+-181667/96]
Dec 29 13:15:23 turin kernel: [try_to_free_buffers+174/212] [try_to_release_page+63/72] [shrink_cache+458/728] [shrink
Dec 29 13:15:23 turin kernel: [kswapd_balance+18/40] [kswapd+153/188] [kernel_thread+40/56]
c0392520 -> tasklist_lock (c0392510)
c4844a5d -> lvm_init (c4843060)
Dec 29 13:15:23 turin kernel:
Dec 29 13:15:23 turin kernel: Code: 8b 0b eb 03 45 8b 09 39 d9 74 27 39 71 08 75 f4 66 39 79 0c


The lines without a date stamp I manually added with my own hand oops
decoding which doesn't seem to match the ksymoops handling.

This is a stock kernel (bf2.4) from debian stable (Version: 2.4.18-5)

More information available on request, I'm going to try to find time
to set a similar system up to try to reproduce but that might not be
this week.

TTFN
--
Roger. Home| http://www.sandman.uklinux.net/
Master of Peng Shui. (Ancient oriental art of Penguin Arranging)
Work|Independent Sys Consultant | http://www.computer-surgery.co.uk/
So what are the eigenvalues and eigenvectors of 'The Matrix'? --anon

Attachment: pgp00000.pgp
Description: PGP signature