Automation of Mobile Radio Network Performance and Fault Management
Automation of Mobile Radio Network Performance and Fault Management
Espoo 28.2.2007
• Consists of Mobile Equipment (ME) and Subscriber Identity Module (SIM) for the end- RAN
user to access the mobile network
• RAN – Radio Access Network
UTRAN PSTN
• Currently the most popular mobile RANs are UTRAN and GERAN
UE GERAN CN
• Other radio access technologies are LTE, WiMAX and WiFi
• CN – Core Network IP
X
• All RANs are attached to a CN that provides switching and access to services in PSTN
and any IP network O&M
• UTRAN architecture
Uu Iub Iu (CS/PS)
• Network elements
• RNC – Radio Network Controller Mobile n e tw ork
UTRAN CN
• Node B aka. BTS – Base Transceiver Station
• A – ATM transmission nodes B
• Interfaces RNC CS
• IuCS: RNC to Circuit Switched Core Network (voice and video calls) B Iur
• IuPS: RNC to Packet Switched Core Network (data calls) B
• Iur: RNC to RNC B A RNC PS
• Iub: RNC to BTS UE
• Uu: BTS to UE
• O&M: OSS to any network element: RNC, BTS, ATM-nodes and CN elements (MSC, O&M
HLR, SGSN, GGSN etc.)) OSS
250
80.00 4000 4000
200
75.00 3000 3000
150
70.00 2000 2000 100
65.00 1000 1000 50
60.00 0
0 0
00 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23
23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23 23
2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/
6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1 6/1
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
RRC setup attempts CSSR CCSR 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20 20
RRC setup failures RRC access failures RAB setup failures RAB access failures
11 2007-02-14 / Magnus Wallström RAB active failures RRC active failures RRC setup attempts
Results (III/IX):
Example on current PM and FM process (2/3)
• 3. Get the KPIs and failure counters on BTS level.
• It can be achieved using the same reporting tool than in the first phase. The output is extensive
list of all the BTS under one RNC, all measurement periods and counters per each BTS.
• 4. Find the network elements that are causing the performance dip.
• After post-processing the data, the results are lists of BTS that are the main contributors to the
performance dips
RNC id
RNC id
BTS id
BTS id
CCSR
CSSR
Cell id
CCSR
CSSR
Cell id
time
time
2006/12/23 11 1 104 1041 15.02 99.01 1345 1132 4 3 4 2 4 2006/12/23 16 1 123 1231 98.46 15.80 1234 8 4 3 4 1023 4
2006/12/23 11 1 104 1042 19.19 96.97 1032 820 4 5 5 6 5 2006/12/23 16 1 123 1232 98.45 24.70 1032 3 4 4 5 765 5
2006/12/23 11 1 104 1043 22.26 97.76 602 452 7 8 1 3 4 2006/12/23 16 1 123 1233 97.51 45.32 602 4 7 3 1 321 4
2006/12/23 11 1 69 691 56.76 99.05 185 69 6 2 3 1 2 2006/12/23 16 1 69 691 96.62 91.40 385 2 6 2 3 32 2
2006/12/23 11 1 69 693 59.60 96.61 99 24 7 8 1 2 2 2006/12/23 16 1 69 693 97.66 91.62 342 1 3 3 1 28 2
2006/12/23 11 1 201 2011 64.29 96.30 84 19 6 3 2 2 3
Ala rm s n o t m a pp ed NE logs a re too la rg e to be sa ved con tinu ously for a lon g tim e
La ck of com pe ten ce
Investig
Inve stigatio
ationn~~PM
PM
Analysis~~FM
Analysis FM
Syste m log file s for the fa ilure s
“the so
“the solutio
lutionn”:
”: tha t have occurre d in the Decisio
De cisionn
AuAutom
tomaate
te dd live ne tw ork ~~CC
MM~~SMSM
Conne ction to a
ininvestigation
vestigation
live ne twork ofth
of theennetw
etwork
ork
performan
perform ancece
Ove rvie w re port of the
live ne tw ork pe rform an ce
OUTPUT:
•System log files and other detail data for the failures that have occurred in the live network. The root-cause analysis
phase utilises this data to make decisions.
•Overview reporting of the network performance that can be utilised in reporting the status of the network to company
management and to customer, i.e. the network operator.
B
RNC CS
B
Sol5:
Sol5: Sol4:
Sol4: Iur
Re
Report
po rt Proce
Processor
ssor B
B A RNC PS
O&M
OSS
Sol1:
Sol1:
Oss
Oss da
data
ta fe
fetch
tcheerr
18 2007-02-14 / Magnus Wallström
Conclusions of the thesis
• Summary of the thesis, Thesis studied practical problems of mobile radio network management:
• Conclusion: UTRAN vendor technical support requires a distributed system of troubleshooting tools to enhance its
troubleshooting processes
• Purpose of the troubleshooting tools is to enhance the performance investigation by automating gathering of the performance and
other relevant network behaviour data for the time periods where network suffers from low performance
• The reasoning of the solution bases on
• Current troubleshooting set-up study:
• Organisation: vendor home base technical support that is a link between local teams, which are located by the operated networks, and the vendor
R&D. During special occasions, e.g. a new product release or emergency situation in network, the organisation may adjust itself by transferring
temporarily system specialist to work locally by the operated network.
• Process: The practical performance and fault management process consists of three phases: investigation, analysis and decision.
• The analysis of the current set-up:
• currently the main problem is the inefficiency of the first, i.e. investigation, phase in the performance and fault management process.
• Generalisation of the results
• Same principles are applicable to other radio network (e.g. GERAN) performance and fault management
• Utilization of an OSS in data gathering makes the solution more portable to other radio network systems
• Typically OSS uses relational SQL databases. Different radio networks have different performance indicators. Then the same tools may be used
after modifying SQL-queries, which is a straightforward process
• Future work
• Scope was limited to investigation. Also the complex analysis-phase has demanding development needs.
• Technical support organization requires product-processes to manage the development and maintanance of the
troubleshooting tools.