Server 14 Drive Failure

  • Sunday, 12th August, 2012
  • 12:57pm

=============================================

12 August - 7.03pm

Please note the server is up and this is just an information notice

The rebuild is currently 18% complete.

root@server14 [~]# arcconf getstatus 1
Controllers found: 1
Logical device Task:
Logical device                 : 0
Task ID                        : 101
Current operation              : Rebuild
Status                         : In Progress
Priority                       : Low
Percentage complete            : 18

=============================================

12 August - 2.42pm

 

Please note the server is up and this is just an information notice

The drive has been replaced and the RAID is now rebuilding.

root@server14 [~]# arcconf getstatus 1
Controllers found: 1
Logical device Task:
Logical device : 0
Task ID : 101
Current operation : Rebuild
Status : In Progress
Priority : Low
Percentage complete : 1

 

=============================================

Our routine monitoring has determined there is a failing drive in the array of Server 14 as can be seen here:

root@server14 [~]# smartctl -ad sat /dev/sg3 | grep -i reallocated_sector
5 Reallocated_Sector_Ct   0x0033   099   099   036    Pre-fail  Always       -       2144

As a precaution we are replacing this drive.  There will be no downtime but the server load may increase slightly.  This is not a concern as server 14 load is traditionally low.  If you notice any issues please open a helpdesk ticket but rest assured we are tracking this.

« Back