Redback设备日常维护指南县局版.docx
《Redback设备日常维护指南县局版.docx》由会员分享,可在线阅读,更多相关《Redback设备日常维护指南县局版.docx(19页珍藏版)》请在冰豆网上搜索。
Redback设备日常维护指南县局版
Redback设备日常维护指南
设备类型
SE800,
报告人
李华
设备用户
河北联通
报告日期
2010-8-15
报告类型
指南
版本
Ver1.0
目录
Redback设备日常维护指南1
检查SE800防尘网2
检查SE800当前系统告警及状况2
检查设备运行环境(设备温度、电压)3
检查系统资源占用情况(CPU,内存)5
检查设备板块和端口状态8
检查网络连通性和端口流量8
检查设备告警信息和日志11
检查在线用户数量11
检查RADIUSServer状态11
检查IPPool使用情况12
用户报障后的初步处理13
用户反馈意见:
14
检查SE800防尘网
登录系统
各地市目前对县局维护BAS的权限有各自的规划;因此各县的维护人员如许访问bas,请与市局网管确认是否允许进行相关的操作;目前各地都对BAS做了访问控制,需要通过相应的跳板机进行转登;请各县维护人员联系市局,确认是否有权限登录设备
串口登录设备:
设备上线时,都随机配置了专用的九针串口线(一头为公头;一头为母头)和网线;一般我们工程师都会把两根线缆放在设备机架内。
使用带有九针调试串口的PC连接到BAS主控卡主用板卡的
Console2口;使用超级终端软件;波特率9600登录即可。
检查SE800当前系统告警及状况
命令:
showsystemalarmall
[local]SE800_ZH#shsystemalall
TimestampTypeSourceSeverityDescription
---------------------------------------------------------------------------
Nov210:
05:
45ether-12-port4/8MajorLinkdown
Aug502:
58:
10ether-12-port4/9MajorLinkdown
Nov210:
19:
31ether-12-port4/10MajorLinkdown
Nov210:
19:
35ether-12-port5/4MajorLinkdown
Aug502:
58:
10ether-12-port5/9MajorLinkdown
Aug502:
58:
10ether-12-port13/3MajorLinkdown
Aug502:
58:
10ether-12-port13/4MajorLinkdown
象上面例子通常说明:
端口已noshutdown,但实际未插网线或链路不正常
命令:
showsystemstatus
[local]SE800_SB#shsysstat
SystemStatus:
OK
若系统有问题,这里会显示哪个进程有问题
命令:
showbackplane-status
[local]SE800_MLZ#showbackplane-status
Slot1backplanehasnoproblemsreported.
Slot2backplanehasnoproblemsreported.
Slot3backplanehasnoproblemsreported.
Slot4backplanehasnoproblemsreported.
Slot5backplanehasnoproblemsreported.
Slot6backplanehasnoproblemsreported.
Slot9backplanehasnoproblemsreported.
Slot10backplanehasnoproblemsreported.
Slot11backplanehasnoproblemsreported.
Slot12backplanehasnoproblemsreported.
Slot13backplanehasnoproblemsreported.
Slot14backplanehasnoproblemsreported.
检查设备运行环境(设备温度、电压)
SE800
概要查看:
命令:
showhardware
[local]SE800_MLZ#shhard
FanTrayStatusPresent
Fan(s)StatusNormal
PowerSupplyAStatusNormal
PowerSupplyBStatusNormal
ActiveAlarmsNONE
SlotTypeSerialNoRevVerMfgDateVoltageTemp
----------------------------------------------------------------------
N/Abackplane9C0650103000656229-JAN-2003N/AN/A
N/Afantray9D0340701005863229-JAN-2003N/AN/A
1gigaether-4-port8K0450903000144420-SEP-2003OKNORMAL
2ether-12-port7U0450903000144411-SEP-2003OKNORMAL
3ether-12-port7U0450903000294412-SEP-2003OKNORMAL
4ether-12-port7U0450903000094411-SEP-2003OKNORMAL
6gigaether-4-port8K50506050315150412-JUL-2005OKNORMAL
7xcrp36Y0350903001823409-SEP-2003N/ANORMAL
8xcrp36Y0350903001903410-SEP-2003N/ANORMAL
9atm-oc3-4-port9X0150803000751402-SEP-2003OKNORMAL
此处可显示当前设备各部件的电压,温度是否在正常范围内,除此之外,你也可以了解电源和风扇的状态。
若想查看细节
命令:
showhardwaredetail
Slot:
1Type:
gigaether-4-port
SerialNo:
8K045090300014HardwareRev:
4
EEPROMid/ver:
0x5a/4MfgDate:
20-SEP-2003
SysFpgarev:
0x7SysFpgafilerev:
N/A
LimFpgarev:
0x5LimFpgafilerev:
0x5
FlipFpgarev:
0xcFlipFpgafilerev:
0xc
IPPAmemory:
256MBEPPAmemory:
256MB
Voltage1.5V:
1.516(+1%)Voltage1.8V:
1.780(-1%)
Voltage2.6V:
2.637(+0%)Voltage3.3V:
3.389(-0%)
Temperature:
NORMAL(36C)
CardStatus:
HWinitializedPODStatus:
Success
ODDStatus:
NotAvailable
FailLED:
OffActiveLED:
On
StandbyLED:
N/A
ChassEntitlement:
SE400/SE800
PortsEntitled:
All
ActiveAlarms:
NONE
这里是显示设备每个模块的细节信息,你能看到具体的电压,温度,和是否存在告警等信息。
一般来说,温度是最重要的日常维护参数,以上是关于命令显示不同温度信息的说明。
状态值
温度状态描述
COLD
对业务卡(trafficcard)而言,是指卡温度低于20℃,
对控制卡(controllercard)而言,是指卡温度低于30℃
EXTREME
对上述两类卡而言,都表明卡温度在80℃以上,这时该模块已被系统disable
HOT
对业务卡(trafficcard)而言,是指卡温度在70℃到80℃之间
对控制卡(controllercard)而言,是指卡温度在54℃到80℃之间
N/A
温度检测对这个单元不适用,这多出现在板卡未被初始化的情况下
NORMAL
对业务卡(trafficcard)而言,是指卡温度在20℃到70℃之间
对控制卡(controllercard)而言,是指卡温度在30℃到54℃之间
当温度到达60℃以上时,就应该引起注意,用同样命令查看风扇运行状态,到现场清洗或更换防尘网。
检查系统资源占用情况(CPU,内存)
命令:
showprocesscpu
[local]SE800_MLZ#shprocesscpu
TotalsystemCPU%usage(5s,1m,5m):
10.40,3.81,0.93
Proc/threadname:
5sec1min5minProc/threadname:
5sec1min5min
----------------------------------------------------------------------
less:
0.000.000.00exec_cli:
0.340.000.00
sshd:
0.000.000.00exec_cli:
0.000.000.00
sshd:
0.000.000.00netopd:
0.930.050.00
dot1qd:
0.000.000.00atmd:
0.000.000.00
snmpd:
0.100.000.00qosd:
0.000.000.00
staticd:
0.000.000.00bgpd:
0.050.000.00
dnsd:
0.000.000.00clipsd:
0.000.000.00
l2tpd:
0.000.000.00pppoed:
1.510.150.05
aaad:
1.270.050.00pppd:
0.050.000.00
statd:
0.390.490.00dhelperd:
0.000.000.00
oddd:
0.000.000.00lm:
0.000.000.00
dhcpd:
0.000.000.00pemd:
0.000.000.00
dlmd:
0.000.000.00clsd:
0.000.000.00
ppaslogd:
0.000.000.00sysmond:
0.000.000.00
arpd:
0.000.000.00ribd:
0.200.000.00
rpmd:
0.000.000.00ped_parse:
0.000.000.00
ism2:
0.730.050.00rcm:
0.590.000.00
csm:
0.000.000.00pm:
0.340.000.00
syslogd:
0.000.000.00inetd:
0.000.000.00
mount_udrv:
0.000.000.00loggd:
0.000.000.00
mount_mfs:
0.000.000.00pdtstat_thread:
0.000.000.00
reboot_thread:
0.000.000.00evnt_th:
0.000.000.00
sccmem_cleanup:
0.000.000.00ioflush:
2.730.290.10
reaper:
0.000.000.00pagedaemon:
0.000.000.00
init:
0.000.000.00
而在SE800,你可以用showprocess检查各进程占用系统资源状态和运行状态,(注意:
当STATE栏出现Stop或Halt时,一般意味有问题;而UP/DOWN栏,若有进程运行时间比别的短的多,可能意味进程被系统或人为重启过,此时应用命令showcrashfiles检查是否有dump文件产生,并和我们联系。
[local]SE800_MLZ#shprocess
LoadAverage:
1.471.511.43
NAMEPIDSPAWNMEMORYTIME%CPUSTATEUP/DOWN
csm2916608K06:
55:
55.720.00%run17w1d
rcm30124212K02:
50:
40.360.05%run17w1d
ism31130856K11:
22:
05.840.10%run17w1d
ped_parse3213572K00:
54:
03.490.00%run17w1d
rpm3313200K00:
49:
21.400.00%run17w1d
rib3416952K05:
04:
49.030.00%run17w1d
ntp000KNotAvail0.00%demand17w1d
arp3513564K02:
04:
06.880.00%run17w1d
static5413180K00:
51:
14.970.00%run17w1d
isis000KNotAvail0.00%demand17w1d
rip000KNotAvail0.00%demand17w1d
bgp5314780K03:
39:
52.960.00%run17w1d
igmp000KNotAvail0.00%demand17w1d
pim000KNotAvail0.00%demand17w1d
ospf000KNotAvail0.00%demand17w1d
sysmon3614020K01:
11:
36.070.00%run17w1d
dns5213168K00:
45:
55.950.00%run17w1d
msdp000KNotAvail0.00%demand17w1d
ppaslog3713124K00:
54:
18.360.00%run17w1d
vrrp000KNotAvail0.00%demand17w1d
cls3818696K01:
04:
04.010.00%run17w1d
dot1q5813684K00:
58:
50.600.00%run17w1d
tunnel000KNotAvail0.00%demand17w1d
qos5514512K03:
34:
59.000.05%run17w1d
dlm3916552K03:
47:
39.090.00%run17w1d
pem4013264K00:
46:
56.500.00%run17w1d
dhcp4115640K01:
10:
09.310.00%run17w1d
fr000KNotAvail0.00%demand17w1d
rsvp000KNotAvail0.00%demand17w1d
mpls_static000KNotAvail0.00%demand17w1d
bridge000KNotAvail0.00%demand17w1d
atm5719388K02:
02:
04.810.00%run17w1d
lm4213804K01:
03:
23.710.00%run17w1d
nat000KNotAvail0.00%demand17w1d
odd4316352K00:
50:
08.400.00%run17w1d
dhelperd4413360K00:
54:
51.950.00%run17w1d
snmp56112072K01:
54:
57.730.00%run17w1d
stats45121672K19:
09:
37.161.27%run17w1d
ppp4617684K05:
00:
31.210.00%run17w1d
xcd000KNotAvail0.00%demand17w1d
lg000KNotAvail0.00%demand17w1d
ldp000KNotAvail0.00%demand17w1d
netopd59114696K04:
27:
26.730.00%run17w1d
aaad47124816K07:
47:
54.180.15%run17w1d
pppoe4816500K03:
15:
55.570.00%run17w1d
l2tp4914176K06:
24:
21.800.00%run17w1d
clips5015932K00:
53:
45.770.00%run17w1d
hr000KNotAvail0.00%demand17w1d
对于CPU使用率而言,通常情况下,若没有使能动态路由协议,MPLS,组播等功能,普遍低于10%,当进程被重启后的一分钟内,可能会高出一些。
注意观察异常的CPU增高。
命令:
showmemory*检查内存使用情况*
[local]SE800_MLZ#shmem
Memory:
Total926516k,Used174000k,Free720164k,Reserved24k
[local]sdl-10k#showmemory
THUDEC0121:
04:
172005
Currentmemorywatchthresholdis80%
FreeBytesBytesinUseBlocksInUseCumul.Blocks%Thresh
--------------------------------------------------------
SM3280,573,360193,680,32056,580686,675,13840%
CM0334,897,45625,357,6881,5981,6017%
CM1335,605,62424,649,5201,4881,4916%
CM6326,037,71234,217,4322,2622,2889%
CM7330,039,95230,215,1921,9261,9448%
CM8333,067,44027,187,7041,6011,6137%
CM9324,679,18435,575,9602,4502,4849%
检查设备板块和端口状态
检查设备板块运行状态,使用同样的命令:
showhardwaredetail
检查端口状态(SE800)
端口UP/Down和是否配置的状态,使用命令
showportall
端口链路层参数:
(SE800)
showportdetail(若单独检查某个端口使用showport2/1detail)
检查网络连通性和端口流量
检查网路流通性,通常先检查端口状态,确定物理连通和链路正常,再用Ping去测试到特定地址的连通性,时延。
检查端口流量使用命令(SE800)
showportcounter2/1
showportcounter2/1detail,
在SE800detail中会显示链路层错误,这会影响用户上网速率,导致时常掉线等故障。
为了在查看SE800ATM端口的链路层质量,需使用如下命令:
showportperf-monitor9/1detail
[local]SE800_MLZ#shportperf9/1det
atm9/1stateisUp
Description:
Alcatel7300_MaLanZi-C
Linestate:
Up
Adminstate:
Up
Mediatype:
SonetOC3(SM)
Encapsulation:
atm
NASPortType:
Loopback:
none
Framing:
SDH
Speed:
155.52Mbps
Bandwidth:
149.76Mbps
TxC2byte:
0x13
RxC2byte:
0x13
LineSFBER:
10E-4
LineSDBER:
10E-7
ATMMTUsize:
65527Bytes
MTUsize:
4470Bytes
ATMPayloadScramble:
ON
OverSubscriptionRate:
Unlimited
MACaddress:
00:
30:
88:
00:
58:
1d
ClockSource:
card-reference
CCODMode:
default
ActiveAlarms:
NONE
PathAlarms:
NONE
PathTraceLength:
16(15+1framing)
TxPathTrace:
bc5265644261636b0000000000000000.RedBack........
bc5265644261636b0000000000000000.RedBack........
bc5265644261636b0000000000000000.RedBack........
bc5265644261636b0000000000000000.RedBack........
RxPathTrace:
00000000000000000000000000