Re: Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- From: Grzegorz Witkowski <geslinux@xxxxxxxxx>
- Date: Tue, 13 Mar 2012 21:08:36 +0000
On Tue, Mar 13, 2012 at 1:36 PM, unix syzadmin <unixsyzadmin@xxxxxxxxx>wrote:
Thanks.
I have downloaded and installed the OpenManage from Dell.
The following commands say if the health of system components is OK.
omreport chassis - health of all main components of the system chassis
omreport chassis processors - cpu health
omreport chassis memory - memory health
omreport chassis pwrsupplies - power supply health
omreport storage controller - raid controller health
However this leaves out the integrated NIC ports and the HBA adapters.
What linux / dell open manage commands can be used to confirm if those are
healthy as well?
Thanks,
On Mon, Mar 12, 2012 at 9:00 PM, Paul Tader <ptader@xxxxxxxxxxxxxx> wrote:
On 3/12/12 5:28 PM, unix syzadmin wrote:on
Hi,
We run redhat linux on intel hardware (mostly Dell, lately dell R710s).
We want to be able to catch any hardware issues when they occur to act
http://linux.dell.com/wiki/index.php/Repository/OMSA>them as quickly as possible.
My understanding is that all hardware events/issues/errors are logged in
/var/log/mcelog (Machine Check Events log). Is this correct? Can't
stress
this enough; does it log all hardware issues
(cpu,memory,disk,ethernet,**fibre/hba etc) ?
Thanks,
I've used MCElog to catch some CPU events but I think you might want to
check out Dell's OpenManage client. It will report/monitor a lot more
information.
http://linux.dell.com/wiki/**index.php/Repository/OMSA<
http://linux.dell.com/repo/**hardware/latest/bootstrap.cgi<
To install:
# wget -q -O -
http://linux.dell.com/repo/hardware/latest/bootstrap.cgi>| bash
# yum install srvadmin-baseredhat-list-request@xxxxxxxxxx>
# yum install srvadmin-storageservices
(logout / login for environment variables to take effect)
# /opt/dell/srvadmin/sbin/**srvadmin-services.sh start
...
# omreport chassis
Health
Main System Chassis
SEVERITY : COMPONENT
Ok : Fans
Ok : Intrusion
Ok : Memory
Ok : Power Supplies
Ok : Processors
Ok : Temperatures
Ok : Voltages
Ok : Hardware Log
Ok : Batteries
# omreport chassis temps
Temperature Probes Information
------------------------------**------
Main System Chassis Temperatures: Ok
------------------------------**------
Index : 0
Status : Ok
Probe Name : System Board Ambient Temp
Reading : 20.0 C
Minimum Warning Threshold : 8.0 C
Maximum Warning Threshold : 42.0 C
Minimum Failure Threshold : 3.0 C
Maximum Failure Threshold : 47.0 C
# omreport storage pdisk controller=0
List of Physical Disks on Controller SAS 6/iR Integrated (Embedded)
Controller SAS 6/iR Integrated (Embedded)
ID : 0:0:0
Status : Ok
Name : Physical Disk 0:0:0
State : Online
Failure Predicted : No
Certified : Not Applicable
Encryption Capable : No
Secured : Not Applicable
Progress : Not Applicable
Bus Protocol : SAS
Media : HDD
Capacity : 67.75 GB (72746008576 bytes)
Used RAID Disk Space : 67.75 GB (72746008576 bytes)
Available RAID Disk Space : 0.00 GB (0 bytes)
Hot Spare : No
Vendor ID : DELL
Product ID : ST973402SS
Revision : S229
<snip>
You get the idea.
--
redhat-list mailing list
unsubscribe mailto:redhat-list-request@**redhat.com<
?subject=unsubscribehttps://www.redhat.com/mailman/listinfo/redhat-list>
https://www.redhat.com/**mailman/listinfo/redhat-list<
--
redhat-list mailing list
unsubscribe mailto:redhat-list-request@xxxxxxxxxx?subject=unsubscribe
https://www.redhat.com/mailman/listinfo/redhat-list
Have you installed all OMSA components?
See the CLI User's Guide
http://support.dell.com/support/edocs/software/svradmin/7.0/en/index.htm
http://support.dell.com/support/edocs/software/svradmin/7.0/en/CLI/PDF/CLIUG.pdf
Have you tried to open 1311 port and connect to the https://<server>:1311
to check if the OMSA is showing everything?
You can try:
omreport system summary
or
omreport servermodule summary
or
omreport chassis nics index=n
or
omreport mainsystem nics index=n
or
omreport chassis nics config=team index=n
or
omreport mainsystem nics config=team index=n
Regards,
Ges
--
redhat-list mailing list
unsubscribe mailto:redhat-list-request@xxxxxxxxxx?subject=unsubscribe
https://www.redhat.com/mailman/listinfo/redhat-list
- References:
- Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- From: unix syzadmin
- Re: Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- From: Paul Tader
- Re: Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- From: unix syzadmin
- Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- Prev by Date: Re: Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- Next by Date: Re: Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- Previous by thread: Re: Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- Next by thread: Re: Does redhat linux log all hardware events/issues/error in /var/log/mcelog?
- Index(es):
Relevant Pages
|