Failed disk Adaptec Raid Repair on a Rackmount server running Linux

If you have a failed disk on your Hardware Raid using Adaptec raid controller on your Linux server you can use these commands to get the RAID info so you can replace the correct disk. Here is a video showing this post in action. https://youtu.be/w6Qu4CUo7gI

First check which RAID card you have

lspci | grep -i raid

04:00.0 RAID bus controller: Adaptec Series 7 6G SAS/PCIe 3 (rev 01)

The lspci command shows the details of your PCI bus, and we are just checking for raid. This shows us we have an Adaptec raid controller.

Now we can use this command to check the configuration of the RAID.

/usr/StorMan/arcconf getconfig 1

The getconfig option with the arcconf command shows us the Raid level, the disks, their status, and their serial numbers. We need the serial numbers to identify which disk to replace. Since this is a rack mount server there is no need to take the server offline, we can simply slide out the drive bay, and replace it with the same size disk. Once replaced the new disk should be labeled with the new serial number. The RAID controller should automatically detect the new disk and start rebuilding the array.

We can also check the current status of the rebuild using this command.

/usr/StorMan/arcconf getstatus 1

Once the rebuild is completed the RAID will be in an optimal state.

If you needed to confirm which disk was bad before replacing it you can check the RAID controllers log to see which disk is causing the issue.

/usr/StorMan/arcconf getlogs 1 device tabular

If this helped you out consider subscribing on youtube, or following me on https://twitch.tv/djrunkie

Leave a Reply

Your email address will not be published. Required fields are marked *