Data Storage Management
RAID arrays
The same configuration is used on RTC and ICC. (AOC has a smaller
/data
array due to reduced storage needs.)
/boot
,/
, andswap
— XFS formatted, replicated as RAID1 on two 500 GB SSDs./data
— XFS formatted, replicated as RAID5 on four 2 TB SSDs (effective capacity: 6 TB).
Restoring failed drives
If a drive has been removed and reinserted, you should check to make sure it’s reassembled after reboot.
Check
/proc/mdstat
for missing drives$ cat /proc/mdstat Personalities : [raid1] [raid6] [raid5] [raid4] md124 : active raid5 sde1[2] sdd1[1] sdc1[0] 5860144128 blocks super 1.2 level 5, 512k chunk, algorithm 2 [4/3] [UUU_] bitmap: 2/15 pages [8KB], 65536KB chunk md125 : active raid1 sda1[0] sdb1[1] 470946816 blocks super 1.2 [2/2] [UU] bitmap: 2/4 pages [8KB], 65536KB chunk md126 : active raid1 sdb3[1] sda3[0] 512000 blocks super 1.2 [2/2] [UU] bitmap: 0/1 pages [0KB], 65536KB chunk md127 : active raid1 sdb2[1] sda2[0] 16776192 blocks super 1.2 [2/2] [UU]
The
[UUU_]
is your hint that one of the four drives in/dev/md124
is missing. Now, which one is it?Check for
/dev/sdXX
devices inlsblk
that aren’t parents of/dev/mdXXX
devices. (Note: Don’t count USB drives that aren’t supposed to be part of a RAID!)For example:
$ lsblk NAME MAJ:MIN RM SIZE RO TYPE MOUNTPOINT sda 8:0 0 465.8G 0 disk ├─sda1 8:1 0 449.3G 0 part │ └─md125 9:125 0 449.1G 0 raid1 / ├─sda2 8:2 0 16G 0 part │ └─md127 9:127 0 16G 0 raid1 [SWAP] └─sda3 8:3 0 501M 0 part └─md126 9:126 0 500M 0 raid1 /boot sdb 8:16 0 465.8G 0 disk ├─sdb1 8:17 0 449.3G 0 part │ └─md125 9:125 0 449.1G 0 raid1 / ├─sdb2 8:18 0 16G 0 part │ └─md127 9:127 0 16G 0 raid1 [SWAP] └─sdb3 8:19 0 501M 0 part └─md126 9:126 0 500M 0 raid1 /boot sdc 8:32 0 1.8T 0 disk └─sdc1 8:33 0 1.8T 0 part └─md124 9:124 0 5.5T 0 raid5 sdd 8:48 0 1.8T 0 disk └─sdd1 8:49 0 1.8T 0 part └─md124 9:124 0 5.5T 0 raid5 sde 8:64 0 1.8T 0 disk └─sde1 8:65 0 1.8T 0 part └─md124 9:124 0 5.5T 0 raid5 sdf 8:80 0 1.8T 0 disk └─sdf1 8:81 0 1.8T 0 part
In this example,
sdf1
did not get reassembled intomd124
and isn’t part of the/data
RAID5 array.Hot-add the dropped device to the array with
mdadm
$ sudo mdadm /dev/md124 --add /dev/sdf1
Repeat steps 1 and 2 and verify all drives are present in the RAID