Mailing List Archive


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[tlug] help! raid failure



This is one of those seagate drives driving me insane. I have to identical
(except for the firmware) in a raid array with raid1 and raid0 devices.
I was getting dma timer expiries earlier also a few times, the reset
resulted in a success but this time it is different:

---------kern.log----------------------
hda: dma_timer_expiry: dma status == 0x20
hda: timeout waiting for DMA
hda: timeout waiting for DMA
hda: (__ide_dma_test_irq) called while not waiting
hda: status timeout: status=0xd0 { Busy }
 
hda: drive not ready for command
ide0: reset timed-out, status=0x80
hda: status timeout: status=0x80 { Busy }
 
hda: drive not ready for command
ide0: reset timed-out, status=0x80
end_request: I/O error, dev 03:07 (hda), sector 52720
raid1: Disk failure on hda7, disabling device.
        Operation continuing on 1 devices
end_request: I/O error, dev 03:07 (hda), sector 52728
end_request: I/O error, dev 03:07 (hda), sector 52736
...
md: updating md7 RAID superblock on device
md: hdc7 [events: 00000012]<6>(write) hdc7's sb offset: 2666688
md: recovery thread got woken up ...
md7: no spare disk to reconstruct array! -- continuing in degraded mode
md: updating md10 RAID superblock on device
md: hdc10 [events: 0000000a]<6>(write) hdc10's sb offset: 20482752
md: (skipping faulty hda7 )
md: (skipping faulty hda10 )
md10: no spare disk to reconstruct array! -- continuing in degraded mode
md: recovery thread finished ...
---------------------------------------

The box was doing okay, but I was stupid enough to access the partition
(with reiser) on the raid0 device, and that resulted in a kernel oops in
kupdated:

---------kern.log----------------------
journal-601, buffer write failed
 (device ide0(3,11))
kernel BUG at prints.c:341!
invalid operand: 0000
CPU:    0
EIP:    0010:[<c017580a>]    Not tainted
EFLAGS: 00010282
eax: 00000039   ebx: cc052000   ecx: cd5e2000   edx: ce18a014
esi: 00000000   edi: c6ec65a0   ebp: cc052000   esp: c12a1e2c
ds: 0018   es: 0018   ss: 0018
Process kupdated (pid: 7, stackpage=c12a1000)
---------------------------------------

I tried doing a reboot, but the process hangs and it doesn't do anything:
root    pts/9    Z    06:44   0:00 [shutdown] <defunct>

The servicies (ssh, smtp, etc) still work though.

My biggest problem now is that I'm on the other side of the planet. 
I guess I'm fucked.

Any suggestions?


Home | Main Index | Thread Index

Home Page Mailing List Linux and Japan TLUG Members Links