[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RAIDFrame reconstruction panics



Running OpenBSD 3.6. The kernel is a .MP with RAIDFrame compiled in and 
maxuser changed to 64.

The short of what I do to recreate the panic:

# raidctl -f /dev/wd0d raid0
*output about failing component*
# raidctl -s raid0
raid0 Components:
           /dev/wd0d: failed
           /dev/wd1d: optimal
No spares.
Parity status: clean
Reconstruction is 100% complete.
Parity Re-write is 100% complete.
Copyback is 100% complete.
# raidctl -R /dev/wd0d raid0
Closing the opened device: /dev/wd0d
uvm_fault(0xd064bb20, 0x0, 0, 1) -> e
kernel: page fault trap, code=0
Stopped at      lockmgr+0x76:   movl    0x34(%eax),%eax
ddb{0}> trace
lockmgr(d3c9ec98,30042,d3d64144,0,0) at lockmgr+0x76
ufs_lock(dac9abec,d3c4e3e0,dac9abfc,d0368595,20042) at ufs_lock+0x28
VOP_LOCK(d3d640c4,30042,0,1) at VOP_LOCK+0x35
vn_lock(d3d640c4,20002,0,0,d3c9ec44,dac9acb8,d3c4e3e0,d036027b,a0,3,dac9ac8c,d0
31aea9,dac9acb8,dac9ac70,dac9ac70,26,d3d640c4,d3d640c4,0,10,d3d40010,d3c4e3e0,d
ac9accc,d025f5af,dac9acb8,4,d0606224,0,d3d63000,0,d01020c6,8,203,d0606bc0,d3d64
0c4,3,ffffffff,0,dac9acfc,d01fee23,d3d640c4,3,ffffffff,0,0,0,dac9ad0c,d023cfea,
0,0,dac9af4c,d0215955,d0b2a000,d3d640c4,1,d0215923,d0515f60,d0b48800,0,0,d0bb18
00,d0b2a000) at vn_lock+0x3a
spec_close(dac9acb8,dac9ac70,dac9ac70,26,d3d640c4) at spec_close+0xbc
ufsspec_close(dac9acb8,4,d0606224,0,d3d63000) at ufsspec_close+0xdd
VOP_CLOSE(d3d640c4,3,ffffffff,0,0,0,dac9ad0c,d023cfea,0,0,dac9af4c,d0215955,d0b
2a000,d3d640c4,1,d0215923,d0515f60,d0b48800,0,0,d0bb1800,d0b2a000,0,0,0,0,0,0,0
,0,0,0,0,0,d3d640c4,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0) 
at VOP
_CLOSE+0x3b
rf_close_component(d0b2a000,d3d640c4,1,d0215923,d0515f60) at 
rf_close_component
+0x43
rf_ReconstructInPlace(d0b2a000,0,0,d01ff126,40,0) at 
rf_ReconstructInPlace+0x1a
9
rf_ReconstructInPlaceThread(d0bb1800) at 
rf_ReconstructInPlaceThread+0x33
Bad frame pointer: 0xd076fef0
ddb{0}> ps
   PID   PPID   PGRP    UID  S       FLAGS  WAIT       COMMAND
*28928      0      0      0  7   0x2100204             raid_reconip
 20484   3294  20484      0  2   0x2004086             ksh
  3294      1   3294      0  3   0x2004086  pause      csh
   623      1    623      0  3   0x2004086  ttyin      getty
  5869      1   5869      0  3   0x2004086  ttyin      getty
 22276      1  22276      0  3   0x2004086  ttyin      getty
  2047      1   2047      0  3   0x2004086  ttyin      getty
  1573      1   1573      0  3   0x2004086  ttyin      getty
  1992      1   1992      0  3   0x2040184  select     sendmail
 16582      1  16582      0  3   0x2000084  select     cron
 16116      1  16116      0  3   0x2000084  select     sshd
 31771      1  31771      0  3   0x2000184  select     inetd
  4490  30183  30183     73  2   0x2000184             syslogd
 30183      1  30183      0  3   0x2000084  netio      syslogd
 29848      1  29848     77  3   0x2000184  poll       dhclient
 25857      1  31597      0  3   0x2000086  poll       dhclient
    11      0      0      0  3   0x2100204  usbtsk     usbtask
     9      0      0      0  3   0x2100204  usbevt     usb0
     8      0      0      0  3   0x2100204  kmalloc    kmthread
     7      0      0      0  3   0x2100204  crypto_wa  crypto
     6      0      0      0  3   0x2100204  aiodoned   aiodoned
     5      0      0      0  3   0x2100204  syncer     update
     4      0      0      0  3   0x2100204  cleaner    cleaner
     3      0      0      0  2    0x100204             reaper
     2      0      0      0  3   0x2100204  pgdaemon   pagedaemon
     1      0      1      0  3   0x2004084  wait       init
    10      0      0      0  3   0x2100204  rfwcond    raid0
     0     -1      0      0  3   0x2080204  scheduler  swapper
 15971  20484  15971      0  6      0x6006             raidctl
ddb{0}>


I've setup RAIDFrame before, using these same drives in 3.5. I thought 
maybe there could be some left over RAIDFrame cruft, so I ran "dd 
if=/dev/zero of=/dev/wd[0,1]c" and did it over with the exact same 
results.

It -could- be specific to the hardware I'm using. I haven't setup 
RAIDFrame on this certain PC before; it was just something I grabbed 
off an empty desk.

Any help on this would be appreciated.

Chris


dmesg:
OpenBSD 3.6 (GAK.MP) #0: Mon Nov  8 02:37:46 MST 2004
    root_(_at_)_gak_(_dot_)_upnix_(_dot_)_net:/usr/src/arch/i386/compile/GAK.MP
cpu0: Intel Pentium II ("GenuineIntel" 686-class, 512KB L2 cache) 398 
MHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR
real mem  = 133799936 (130664K)
avail mem = 114966528 (112272K)
using 1658 buffers containing 6791168 bytes (6632K) of memory
mainbus0 (root)
bios0 at mainbus0: AT/286+(0e) BIOS, date 02/08/99, BIOS32 rev. 0 @ 
0xec700
pcibios0 at bios0: rev 2.1 @ 0xec700/0x3900
pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xf7280/128 (6 entries)
pcibios0: PCI Interrupt Router at 000:20:0 ("Intel 82371AB PIIX4 ISA" 
rev 0x00)
pcibios0: PCI bus #1 is the last bus
bios0: ROM list: 0xc0000/0x8000 0xe0000/0x8000!
cpu0 at mainbus0: (uniprocessor)
cpu0: Intel Pentium II ("GenuineIntel" 686-class, 512KB L2 cache) 398 
MHz
cpu0: 
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR
pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
pchb0 at pci0 dev 0 function 0 "Intel 82443BX AGP" rev 0x03
ppb0 at pci0 dev 1 function 0 "Intel 82443BX AGP" rev 0x03
pci1 at ppb0 bus 1
vga1 at pci1 dev 0 function 0 "Matrox MGA G200 AGP" rev 0x01
wsdisplay0 at vga1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
xl0 at pci0 dev 13 function 0 "3Com 3c905 100Base-TX" rev 0x00: irq 11, 
address 00:60:97:75:ac:c6
nsphy0 at xl0 phy 24: DP83840 10/100 media interface, rev. 1
fxp0 at pci0 dev 16 function 0 "Intel 82557" rev 0x08: irq 11, address 
ff:ff:ff:ff:ff:ff
fxp0: Disabling dynamic standby mode in EEPROM, New ID 0xfffd, cksum @ 
0x1ff: 0xffff -> 0xbcb9
fxp0: no phy found, using manual mode
pcib0 at pci0 dev 20 function 0 "Intel 82371AB PIIX4 ISA" rev 0x02
pciide0 at pci0 dev 20 function 1 "Intel 82371AB IDE" rev 0x01: DMA, 
channel 0 wired to compatibility, channel 1 wired to compatibility
wd0 at pciide0 channel 0 drive 0: <Maxtor 6Y080P0>
wd0: 16-sector PIO, LBA, 78167MB, 160086528 sectors
wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 2
wd1 at pciide0 channel 1 drive 0: <Maxtor 6Y080P0>
wd1: 16-sector PIO, LBA, 78167MB, 160086528 sectors
wd1(pciide0:1:0): using PIO mode 4, Ultra-DMA mode 2
uhci0 at pci0 dev 20 function 2 "Intel 82371AB USB" rev 0x01: irq 11
usb0 at uhci0: USB revision 1.0
uhub0 at usb0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
"Intel 82371AB Power Mgmt" rev 0x02 at pci0 dev 20 function 3 not 
configured
isa0 at pcib0
isadma0 at isa0
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pms0 at pckbc0 (aux slot)
pckbc0: using irq 12 for aux slot
wsmouse0 at pms0 mux 0
pcppi0 at isa0 port 0x61
midi0 at pcppi0: <PC speaker>
sysbeep0 at pcppi0
lpt0 at isa0 port 0x378/4 irq 7
npx0 at isa0 port 0xf0/16: using exception 16
pccom0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
pccom1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
biomask ef65 netmask ef65 ttymask ffe7
pctr: 686-class user-level performance counters enabled
mtrr: Pentium Pro MTRR support
Kernelized RAIDframe activated
raid0 (root): (RAID Level 1) total number of sectors is 159096576 (77683 
MB) as root
dkcsum: wd0 matched BIOS disk 80
dkcsum: wd1 matched BIOS disk 81
rootdev=0x1300 rrootdev=0x3600 rawdev=0x3602
WARNING: / was not properly unmounted
raid0: Error re-writing parity!



Visit your host, monkey.org