?

Log in

No account? Create an account

Josh-D. S. Davis

Xaminmo / Omnimax / Max Omni / Mad Scientist / Midnight Shadow / Radiation Master

Previous Entry Share Next Entry
RAID Failures
Computer Drive
joshdavis
Enterprise disk vendors who don't offer RAID6 will assure you that it's statistically impossible for two disks in the same RAIDset to fail at the same time. Considering how often it happens in reality, I think their statistics are based on flawed logic.

Before I could get a replacement installed, a second disk failed. This killed the OS, my driver downloads, work downloads, kids' dropbox, and anything else I've grabbed other than music and movies.

I have backups of the os and the /backups, though I don't have a bootable OS backup.

I'm going to see if I can boot from a USB stick, force the array online, errors and all, and harvest information. Not sure how well this will work, because I get a lot of failures for drives to become ready. :(
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   081   081   006    Pre-fail  Always       -       13306263
  3 Spin_Up_Time            0x0003   096   093   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       74
  5 Reallocated_Sector_Ct   0x0033   029   029   036    Pre-fail  Always   FAILING_NOW 1466
  7 Seek_Error_Rate         0x000f   082   060   030    Pre-fail  Always       -       4466624142
  9 Power_On_Hours          0x0032   072   072   000    Old_age   Always       -       24954
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       74
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   001   001   000    Old_age   Always       -       151
188 Command_Timeout         0x0032   100   099   000    Old_age   Always       -       12885098500
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   066   046   045    Old_age   Always       -       34 (Lifetime Min/Max 30/34)
194 Temperature_Celsius     0x0022   034   054   000    Old_age   Always       -       34 (0 25 0 0)
195 Hardware_ECC_Recovered  0x001a   046   019   000    Old_age   Always       -       13306263
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       527
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       527
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
Tags: ,

  • 1
Do you have another copy of your data somewhere?

The critical data is backed up on 2 different sets of disks in my house, and that backup is on a remote, internet backup service.

I had lots of non-critical but pain-to-replace data, but luckily, failed disk 2 was just "dirty" because either the SATA controller or the linux kernel disabled 2 ports when the one drive went not_ready.

I have the array up and degraded now and have replacement disks enroute.

  • 1