1、惠普设备维护培训日常检查命令,中国惠普有限公司支持服务事业部,QIAN Yun2004.4,日常维护检查项目,系统日志syslog.log,ccerrlog,dmesg系统运行状态cmviewcl、bdf、ioscan、vgdisplay、top、sar、swapinfo、netstat磁盘阵列状态armdsp a vaarraydsp aautoraidamdsp afc60,/var/adm/syslog/syslog.log,应检查的内容:有无SCSI Reset告警(启动前后的SCSI reset信息可忽略)有无EMS告警。如:Aug 12 09:43:05 bj_rz3 EMS 22
2、86:-EMS Event Notification-Value:SERIOUS(4)for Resource:/system/events/core_hw/core_hw(Threshold:=3)Execute the following command to obtain event details:/opt/resmon/bin/resdata-R 149815298-r/system/events/core_hw/core_hw-n 149815299-a 凡是Value为Majorwarning、Serious或Critical的报警都应关注。有无“PV Powerfail”、”I
3、O error”报错LPMC如有重起操作,建议保存当前的syslog日志,Syslog是系统重起到当前的日志,重起后将自动保存为OLDsyslog.log,ccerrlog,283 PM 0*6 08/17/2003 19:16:58Log Entry 283:08/17/2003 19:16:58Alert Level 6:System could fail-attention required;Keyword:Bulk power supply(BPS)2 failed;Status:15Logged by power monitor 0 during monitoring of low
4、voltage power supply0 x0020016a4402404f 0 x00000000000000000 x5820096a4402404f 0 x000067071113103a,执行cclogview/var/stm/logs/os/ccerrlog,可以通过telnet检查GSP/MP里的告警日志情况。,应注意检查是否有Alert Level大于等于2的新条目,dmesg,$Revision:vmunix:vw:-proj selectors:CUPI80_BL2000_1108-c Vw for CUPI80_BL2000_1108 build-cupi80_bl200
5、0_1108 CUPI80_BL2000_1108 Wed Nov 8 19:24:56 PST 2000$Memory Information:physical page size=4096 bytes,logical page size=4096 bytes Physical:4177920 Kbytes,lockable:3859368 Kbytes,available:3859944 Kbytes Using 3162 buffers containing 24576 Kbytes of memory.,驻留在内存中的系统最近一段时间的日志信息:,常见的异常信息:SCSI Reset
6、DetectedLPMC I-Cache errorFile System Full发现后应及时察看syslog.log中的相应条目,cmviewcl,CLUSTER STATUS hpcluster up NODE STATUS STATE GMS_STATE bjscp1a up running halted Network_Parameters:INTERFACE STATUS PATH NAME PRIMARY up 0/5/0/0 lan1 PRIMARY up 0/0/0/0 lan0 STANDBY up 1/12/0/0 lan2 PACKAGE STATUS STATE AU
7、TO_RUN NODE scppkg up running enabled bjscp1a NODE STATUS STATE GMS_STATE bjscp1b up running halted Network_Parameters:INTERFACE STATUS PATH NAME PRIMARY up 0/5/0/0 lan1 STANDBY up 1/12/0/0 lan2 PRIMARY up 0/0/0/0 lan0,观察双机状态,执行cmviewcl v,确认STATUS和STATE为up 和running,同时包自动切换(AUTO_RUN)属性为enable,bdf,Fil
8、esystem kbytes used avail%used Mounted on/dev/vg00/lvol3 204800 48168 155424 24%/dev/vg00/lvol1 295024 38856 226664 15%/stand/dev/vg00/lvol8 4706304 1523976 3157592 33%/var/dev/vg00/lvol7 1163264 708304 451464 61%/usr/dev/vg00/lvol4 204800 96408 107568 47%/tmp/dev/vg00/lvol6 1048576 766024 280360 73
9、%/opt/dev/vg00/lvol5 1048576 4456 1036024 0%/home,检查文件系统的使用率,应检查有无使用率大于90%的文件系统,ioscan-fn,Class I H/W Path Driver S/W State H/W Type Description=root 0 root CLAIMED BUS_NEXUS ioa 0 0 sba CLAIMED BUS_NEXUS System Bus Adapter(803)ba 0 0/0 lba CLAIMED BUS_NEXUS Local PCI Bus Adapter(782)lan 0 0/0/0/0 b
10、tlan3 CLAIMED INTERFACE HP PCI 10/100Base-TX Core/dev/diag/lan0/dev/ether0 ext_bus 0 0/0/1/0 c720 CLAIMED INTERFACE SCSI C895 Ultra Wide Single-Endedtarget 0 0/0/1/0.1 tgt CLAIMED DEVICE disk 0 0/0/1/0.1.0 sdisk NO_HW DEVICE HP DVD-ROM 305/dev/dsk/c0t1d0/dev/rdsk/c0t1d0,检察IO设备是否正常,应检查有无状态为NO_HW的设备,v
11、gdisplay,-Volume groups-VG Name/dev/vg00VG Write Access read/write VG Status available Max LV 255-Logical volumes-LV Name/dev/vg00/lvol1 LV Status available/syncd LV Size(Mbytes)100 Current LE 25 Allocated PE 50 Used PV 2-Physical volumes-PV Name/dev/dsk/c4t0d0 PV Name/dev/dsk/c6t0d0Alternate Link P
12、V Status available Total PE 12992 Free PE 0 Autoswitch Off,显示卷组状态,重点检查vg00,执行vgdisplay v vg00,检查各项status值为available/sync,不是stale,top,CPU LOAD USER NICE SYS IDLE BLOCK SWAIT INTR SSYS 0 0.28 20.2%0.0%2.6%77.2%0.0%0.0%0.0%0.0%1 0.17 14.6%0.0%3.4%82.0%0.0%0.0%0.0%0.0%2 0.33 18.6%0.0%3.0%78.4%0.0%0.0%0.
13、0%0.0%3 0.20 13.0%0.0%4.2%82.8%0.0%0.0%0.0%0.0%4 0.11 14.4%0.0%2.0%83.6%0.0%0.0%0.0%0.0%5 0.44 19.8%0.0%4.2%76.0%0.0%0.0%0.0%0.0%6 0.28 13.2%0.0%11.2%75.6%0.0%0.0%0.0%0.0%7 0.17 14.8%0.0%1.8%83.4%0.0%0.0%0.0%0.0%-avg 0.25 0.0%0.0%0.0%100.0%0.0%0.0%0.0%0.0%Memory:1106604K(999800K)real,1527608K(136268
14、0K)virtual,1987924K free Page#1/6CPU TTY PID USERNAME PRI NI SIZE RES STATE TIME%WCPU%CPU COMMAND 2?18777 informix 156 20 7404K 5052K sleep 9233:02 30.49 30.43 oninit 6?19002 tellin 154 20 29248K 22572K sleep 5256:03 17.05 17.02 manager 0?18779 informix 156 20 7404K 4784K sleep 1681:27 9.62 9.60 oni
15、nit,观察CPU和内存使用情况,重点检查有无占用CPU过大的进程,并检查free memory是否足够,sar-u,10:02:18 cpu%usr%sys%wio%idle10:02:21 0 37 2 1 60 1 18 5 1 75 2 15 10 2 72 3 9 4 2 85 4 21 3 1 75 5 23 2 4 70 6 10 4 3 83 7 15 5 1 79 system 19 5 2 75,观察CPU使用情况:sar u M 3 10,重点检查%idle是否足够(一般不小于25%),sar-v,HP-UX bjscp1a B.11.00 U 9000/800 07/0
16、7/0310:02:48 text-sz ov proc-sz ov inod-sz ov file-sz ov 10:02:51 N/A N/A 189/664 0 2119/7360 0 1127/12018 010:02:54 N/A N/A 188/664 0 2102/7360 0 1121/12018 010:02:57 N/A N/A 187/664 0 2067/7360 0 1114/12018 010:03:00 N/A N/A 187/664 0 2037/7360 0 1108/12018 010:03:03 N/A N/A 187/664 0 2033/7360 0 1108/12018 010:03:06 N/A N/A 187/664 0 2036/7360 0 1108/12018 010:03:09 N/A N/A 187/664 0 2033/7360 0 1108/12018 010:03:12 N/A N/A 188/664 0 2032/7360 0 1113/12018 010:03:15 N/A N/A 187/664 0 2032/736
copyright@ 2008-2022 冰豆网网站版权所有
经营许可证编号:鄂ICP备2022015515号-1