1、S1断链告警处理案例S1断链告警处理指导1.故障现象描述告警管理中查瞧到基站上报“S1断链告警(198094830)”告警码,如下图所示:2.故障分析排查思路根据TD-LTE得网络接口协议,S1链路就是建立在物理传输层、数据链路层、IP协议层、SCTP偶联链路之上得传输协议层,如下图所示:所以处理S1链路故障,需要从底层开始排查:1、首先排查站点就是否存在传输告警,排除传输故障;2、其次基站IP地址配置就是否正常;3、再次确认SCTP偶联断告警,排除SCTP偶联告警;4、最后排查就是否存在S1 AP建立失败(协商失败或基站无小区),与核心网核对小区TAC值就是否配置一致。3.故障排查步骤1、查
2、瞧基站告警,就是否存在传输类相关告警,例如“网元断链告警”、“SCTP偶联断链”告警,若存在以上告警,需要先按照以上告警排查指导手册,先解决以上告警。2、检查ENODEB-MME或SGW 路由IP地址就是否配置正确;通过telnet命令登录到CC板,使用 BRS命令对MME及SGW地址进行PING包测试,详细登录方式如下,红色字体均需要输入:通过服务器远程登录:bash-3、2$ telnet 10、30、143、201前台通过网线直连登录地址:192、254、1、16正在尝试、连接到 10、30、143、201(192、254、1、16)(none) login: zte(用户名)Passw
3、ord: zte(密码)Processing /etc/profile、 Done# /ushell- Please input password!-*(密码zte)- Login success! ushell tool menu: - ps or PS list process run on the board pr xxx or PR xxx take over xxx process printf info npr xxx or NPR xxx not take over xxx process printf info db xxx or DB xxx debug xxx proces
4、s printf info ndb xxx or NDB xxx not debug xxx process printf info pad xxx or PAD xxx debug and take over xxx process printf info npad xxx or NPAD xxxnot debug and take over xxx process printf info pall or PALL display current debug and take over info ncheck or NCHECK Do not check another ushell exi
5、st check or CHECK Do check another ushell exist Q or q cancel all process debug and printf info exit or EXIT cancel ushell xxx is process id you want to debug or take over printf info - $ps(查瞧前台进程) PID USER VSZ STAT MAND 1 root 1304 S init 2 root 0 SW softirq-high/0 3 root 0 SW softirq-timer/0 4 roo
6、t 0 SW softirq-net-tx/ 5 root 0 SW softirq-net-rx/ 6 root 0 SW softirq-block/0 7 root 0 SW softirq-tasklet 8 root 0 SW softirq-sched/0 9 root 0 SW softirq-hrtimer 10 root 0 SW softirq-rcu/0 11 root 0 SW watchdog/0 12 root 0 DW chkeventd/0 13 root 0 SW events/0 14 root 0 SW rt_events/0 15 root 0 SW k
7、helper 16 root 0 SW kthread 17 root 0 SW rt_kthread 37 root 0 SW kblockd/0 42 root 0 SW khubd 83 root 0 SW pdflush 84 root 0 SW pdflush 85 root 0 SW kswapd0 86 root 0 SWreply from 200、1、10、200 packetsize=36 time=14ms、正常ping通时返回得时长678send ping seq: 2、678PING=reply from 200、1、10、200 packetsize=36 time
8、=4ms、678send ping seq: 3、678PING=reply from 200、1、10、200 packetsize=36 time=3ms、678send ping seq: 4、678PING=reply from 200、1、10、200 packetsize=36 time=24ms、678Ping statistics for 200、1、10、200: Packets: Sent = 4, Received = 4, Lost = 0(0% loss),Approximate round trip times in milli-seconds:Minimum =
9、3ms, Maximum = 24ms, Average = 11ms(ping核心网MME控制面200、1、10、200地址结果,丢包率0%,证明基站到MME链路正常。)brsping 200、1、30、20 (ping核心网SGW地址)678 begin to excel fun:brsping value = 0(0x0) end to excel fun:brsping send ping seq: 1、$678PING=reply from 200、1、30、20 packetsize=36 time reply from 200、1、30、20 packetsize=36 time
10、 reply from 200、1、30、20 packetsize=36 time=1ms、678send ping seq: 4、678PING=reply from 200、1、30、20 packetsize=36 time 1ms、678Ping statistics for 200、1、30、20: Packets: Sent = 4, Received = 4, Lost = 0(0% loss),Approximate round trip times in milli-seconds: Minimum = 0ms, Maximum = 1ms, Average = 0ms(p
11、ing核心网SGW用户面200、1、30、20地址结果,丢包率0%,证明基站到SGW链路正常。)通过以上步骤,排查基站到EPC得控制面MME与用户面SGW链路均正常。3、Pad 到平台进程, showtcb 查瞧偶联状态,继续在平台进程中输入“showtcb”命令查瞧,偶联状态就是否正常,若偶联异常,按照偶联断链告警指导手册处理。 $showtcb678 begin to excel fun:showtcb =Begin:Show Assoc TCB Info=TCB info 0:偶联号0ULPID = 0, AssoID = 0, Checksum = 1, InstanceID = 0
12、LocalPort = 6051, SourIP = 100、64、20、108, VpnId = 31PeerPort = 6051, DestIP = 200、1、10、200, VpnId = 31Association State = established(此处显示偶联状态,established标示偶联正常)CulTsnAcked = 2597222824, NextTsnAssign = 2597222825, LastRecvTSN = 1479945961OutStandingSize = 0, PendingChkNum = 261888, MtuSize = 1500Tx
13、ReChkNum = 0TxStrmNum = 2, RxStrmNum = 2PeerVerifTag = 1479945957, MyVerifTag = 2597222817TCB info 11:偶联号11ULPID = 11, AssoID = 11, Checksum = 0, InstanceID = 11 LocalPort = 36422, SourIP = 100、64、20、108, VpnId = 31PeerPort = 36422, DestIP = 100、64、20、109, VpnId = 31Association State = established(此
14、处显示偶联状态,established标示偶联正常)CulTsnAcked = 1926134201, NextTsnAssign = 1926134202, LastRecvTSN = 866462450OutStandingSize = 0, PendingChkNum = 261888, MtuSize = 1500TxReChkNum = 0TxStrmNum = 2, RxStrmNum = 2PeerVerifTag = 866462422, MyVerifTag = 1926134177TCB info 12:偶连号12ULPID = 12, AssoID = 12, Check
15、sum = 1, InstanceID = 12 LocalPort = 36422, SourIP = 100、64、20、108, VpnId = 31PeerPort = 36422, DestIP = 100、64、43、43, VpnId = 31Association State = established(此处显示偶联状态,established标示偶联正常)CulTsnAcked = 1926134201, NextTsnAssign = 1926134202, LastRecvTSN = 2040807940OutStandingSize = 0, PendingChkNum
16、 = 261888, MtuSize = 1500TxReChkNum = 0TxStrmNum = 2, RxStrmNum = 2PeerVerifTag = 2040807912, MyVerifTag = 1926134177TCB info 13:偶连号13ULPID = 13, AssoID = 13, Checksum = 1, InstanceID = 13 LocalPort = 36422, SourIP = 100、64、20、108, VpnId = 31PeerPort = 36422, DestIP = 100、64、25、85, VpnId = 31Associa
17、tion State = cookie_wait(此处显示偶联状态,cookie wait标示偶联不正常)CulTsnAcked = 0, NextTsnAssign = 2957087305, LastRecvTSN = 0OutStandingSize = 0, PendingChkNum = 4294967168, MtuSize = 0TxReChkNum = 0TxStrmNum = 2, RxStrmNum = 2PeerVerifTag = 0, MyVerifTag = 2957087305TCB info 14:偶连号14ULPID = 14, AssoID = 14, Ch
18、ecksum = 1, InstanceID = 14 LocalPort = 36422, SourIP = 100、64、20、108, VpnId = 31PeerPort = 36422, DestIP = 100、64、43、39, VpnId = 31Association State = cookie_wait(此处显示偶联状态,cookie wait标示偶联不正常)CulTsnAcked = 0, NextTsnAssign = 2529838369, LastRecvTSN = 0OutStandingSize = 0, PendingChkNum = 4294967168,
19、 MtuSize = 0TxReChkNum = 0TxStrmNum = 2, RxStrmNum = 2PeerVerifTag = 0, MyVerifTag = 2529838369=End:Show Assoc TCB Info=value = 34(0x22) end to excel fun:showtcb $exit(退出平台进程)ushell recv signo:0、quit debug and exit ushell!# exit(退出基站CC单板连接)关闭连接。4、排查传输与基站偶联异常问题后,若S1断链故障还未解决,需要排查与EPC之间得S1对接参数核对,检查TD-LTE得E-UTRAN TDD小区中跟踪区码TAC就是否按照EPC协商得值配置,如下图所示:如果此参数与EPC侧配置不一致,会导致S1链路建立失败,需要按照规划修改此配置参数。4.故障排查总结S1链路基于高层得协议链路,排查过程涉及到多个底层协议链路,需要从底层链路开始排查:1、首先需要确定物理传输链路就是否正常。2、其次排查IP协议层地址就是否配置正确。3、再次确认SCTP偶联链路就是否正常。4、最后检查与EPC侧S1对接参数TAC就是否配置一致。
copyright@ 2008-2022 冰豆网网站版权所有
经营许可证编号:鄂ICP备2022015515号-1