The Dell OpenManage Linux packages have come a long way and are very useful to have installed on any Dell servers you have installed. In fact OpenManage could be considered critical to monitoring Dell hardware for many reasons but consider a single disk in a RAID 1 mirrored group failing and you don’t have physical access to the box. Unless you are constantly monitoring other logs it would make things much easier to receive an alert from monitoring software such as something like Nagios. This way you can feel comfortable at all times that your Dell hardware is in top shape.
Recently one of my customers did have a disk fail, we were alerted, and the hard drive was replaced within an hour. Once it was replaced by the colo I needed to watch the rebuilding process so I could clear the alert logs once it had completed the rebuild process. Use the commands below to monitor the RAID group rebuild process via the Linux CLI which in this case happens to be CentOS Linux.
Query A Virtual Disk Using omreport:
- [root@dev3 ~]# omreport storage vdisk controller=0
- Virtual Disk 0 on Controller PERC 6/i Integrated (Embedded)
- Controller PERC 6/i Integrated (Embedded)
- ID : 0
- Status : Non-Critical
- Name : Virtual Disk 0
- State : Degraded
- Hot Spare Policy violated : Not Assigned
- Virtual Disk Bad Blocks : Not Applicable
- Secured : Not Applicable
- Progress : Not Applicable
- Layout : RAID-1
- Size : 931.00 GB (999653638144 bytes)
- Device Name : /dev/sda
- Bus Protocol : SAS
- Media : HDD
- Read Policy : No Read Ahead
- Write Policy : Write Back
- Cache Policy : Not Applicable
- Stripe Element Size : 64 KB
- Disk Cache Policy : Disabled
- [root@dev3 ~]#
In the example above the query returns information about the virtual disk located on controller 0 or the PERC 6/i RAID Controller. Then in the example below we query the same PERC 6/i RAID Controller to provide information on each of the physical drives attached to it using omreport. Notice the second disk, with id 0:0:1, has a state of Rebuilding while the first disk, with id 0:0:0, has a state of online.
Query Physical Hard Drives Using omreport:
- [root@dev3 ~]# omreport storage pdisk controller=0
- List of Physical Disks on Controller PERC 6/i Integrated (Embedded)
- Controller PERC 6/i Integrated (Embedded)
- ID : 0:0:0
- Status : Ok
- Power Status : Not Applicable
- Name : Physical Disk 0:0:0
- State : Online
- Failure Predicted : No
- Certified : Not Applicable
- Encryption Capable : No
- Secured : Not Applicable
- Progress : Not Applicable
- Bus Protocol : SAS
- Media : HDD
- Mirror Set ID : Not Applicable
- Capacity : 931.00 GB (999653638144 bytes)
- Used RAID Disk Space : 931.00 GB (999653638144 bytes)
- Available RAID Disk Space : 0.00 GB (0 bytes)
- Hot Spare : No
- Vendor ID : SEAGATE
- Product ID : ST31000424SS
- Revision : 0005
- Serial No. : 9WKXXXXX
- Part Number : Not Available
- Negotiated Speed : Not Available
- Capable Speed : Not Available
- Manufacture Day : Not Available
- Manufacture Week : Not Available
- Manufacture Year : Not Available
- SAS Address : 5000C5002118BE9D
- ID : 0:0:1
- Status : Ok
- Power Status : Not Applicable
- Name : Physical Disk 0:0:1
- State : Rebuilding
- Failure Predicted : No
- Certified : Not Applicable
- Encryption Capable : No
- Secured : Not Applicable
- Progress : 0% complete
- Bus Protocol : SAS
- Media : HDD
- Mirror Set ID : Not Applicable
- Capacity : 931.00 GB (999653638144 bytes)
- Used RAID Disk Space : 931.00 GB (999653638144 bytes)
- Available RAID Disk Space : 0.00 GB (0 bytes)
- Hot Spare : No
- Vendor ID : SEAGATE
- Product ID : ST31000424SS
- Revision : 0005
- Serial No. : 9WKXXXXX
- Part Number : Not Available
- Negotiated Speed : Not Available
- Capable Speed : Not Available
- Manufacture Day : Not Available
- Manufacture Week : Not Available
- Manufacture Year : Not Available
- SAS Address : 5000C5002118FD85
- [root@dev3 ~]#
Once the disk is done rebuilding I will clear the alert logs using the omconfig command and the RAID group will be back in tip top shape.