Argh Again!
If it’s not one thing it’s another. Logged in and the load was up to 160+, couldn’t log in as a user, couldn’t write to /home, nothing. Based on the dmesg output (read more for that) it looks like some sort of error in either the scsi drive or the scsi drive itself. Funny (not in a funny way) as I was saying not that long ago how the SCSI drive hasn’t had a blip while the IDE drives were dying off left right and center, and Fred was talking about getting a drive to mirror the /home drive onto. Fsck.
I rebooted with the options to reboot immediately and it went offline, so it’s rebooted, but it’s not coming up now. I have a call into the data center to see what is up… it might need a root password to fsck or something. Will update when they get back to me (dude has to finish dinner or something).
sigh Hope everyone had /home backed up 🙁 🙁 🙁 Hope it’s not that bad though <fingers crossed>l;
Update Looks like the disk might be toast. Hopefully fred can convince it to come back to life tomorrow morning though. More on ufies.org.
SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 10000
I/O error: dev 08:02, sector 57344
journal-601, buffer write failed
(device sd(8,2))
kernel BUG at prints.c:341!
invalid operand: 0000
CPU: 1
EIP: 0010:[] Not tainted
EFLAGS: 00010282
eax: 00000036 ebx: f7665000 ecx: 00000000 edx: f6b6bf7c
esi: 00000000 edi: 00000012 ebp: f7665000 esp: c284be14
ds: 0018 es: 0018 ss: 0018
Process kupdated (pid: 7, stackpage=c284b000)
Stack: f897c139 f897ca00 c033b7e0 f89c2b48 f896f4eb f7665000 f8979f20 c284be4c
c014470a 00000029 00001000 0000002c 0000002a 00000000 d41fdb60 f7665000
00000023 00000001 00000004 f8973d5b f7665000 f89c2b48 00000001 00000006
Call Trace: [] [ ] [ ] [ ] [ ]
[] [ ] [ ] [ ] [ ] [ ]
[] [ ] [ ] [ ] [ ] [ ]
Code: 0f 0b 55 01 4c c1 97 f8 85 db b8 55 c1 97 f8 74 0c 0f b7 43
SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 10000
I/O error: dev 08:02, sector 57352
SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 10000
I/O error: dev 08:02, sector 57360
[snip]
SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 10000
I/O error: dev 08:02, sector 57472
scsi0:A:0:0: DV failed to configure device. Please file a bug report against this driver.
SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 10000
I/O error: dev 08:02, sector 57480
[snip]
(scsi0:A:0): 40.000MB/s transfers (20.000MHz, offset 63, 16bit)
(scsi0:A:0:0): Unexpected busfree in Data-in phase
SEQADDR == 0x9a
SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 10000
I/O error: dev 08:02, sector 65960
sd(8,2):vs-13070: reiserfs_read_inode2: i/o failure occurred trying to find stat data of [14 149 0x0 SD]
SCSI disk error : host 0 channel 0 id 0 lun 0 return code = 10000
I/O error: dev 08:02, sector 124176
sd(8,2):vs-13070: reiserfs_read_inode2: i/o failure occurred trying to find stat data of [14 434102 0x0 SD]
(scsi0:A:0): 40.000MB/s transfers (20.000MHz, offset 63, 16bit)
Bad cable or connection to the SCSI drive? You mentioned that you had a “minor heart attack” in your previous note with the SCSI drive.
Also ask what time they open at when they call back. I’ll take my lunch at 8am if there open then to get a drive in or do some work on it.
PS, I think it’s time for SysAdmin appreciation day!
Appreciate it. Hopefully the other 163(ish) users will be as forgiving 🙁
“Backups: Your Responsibility!!” has been in the motd for ages, or not?
(I had nothing important in my home dir, no worries. I’m pretty forgiving for something I’m not using/paying for 🙂 )
Debian has some scsitools… maybe there’s some way to do run some diagnostics or readonly mount?
I’ve ordered a couple of SATA drives to do a mirror RAID for backup, but I wonder if even that’s enough?
I’ll keep you guys posted what happens this morning. At this point, I don’t care about the $$. I’m going to get another 2x15k RPM drives for a SCSI array and if the PS will hold it a 120GB IDE backup drive just for backups. Arc, perhaps tar everything up weekly.;)
Thankfully Arc has made that clear from the begining (Backups). Everything really important I have backedup, but stull sucks. Had that not been the case I’d have a few thousand TDIClubers chasing me, some of whom know where I live and work!;)
No worries on the bad hardware mojo 🙂 Nothing important lost.
I needed a reason to redesign anyway 😉