204.4293.31 - DmOS - Troubleshooting Guide
204.4293.31 - DmOS - Troubleshooting Guide
204.4293.31 - DmOS - Troubleshooting Guide
Version 9.4.4
Contacts
Technical Support
Datacom has available a support portal - DmSupport, to help the customers in use and config of our equipment.
In this site the following are available: firmwares, technical datasheets, config guide, MIBs and manuals for download. In
addition, it allows opening of calls for assistance with our technical team.
We would like to highlight that our assistance through telephone support is available from Monday through Friday from
08:00 AM through 05:30 PM.
Important: For support assistance 24x7, please request a quotation to our sales department.
General Information
For any other additional information, please visit the https://www.datacom.com.br/en or call:
DATACOM
+55 51 3933-3000
Product Documentation
This document is part of a set of documents prepared to provide all necessary information about DATACOM products.
Software Platform
• Quick Configuration Guide - Provides instructions on how to set functionalities in a quick manner in the equipment
• Troubleshooting Guide - Provides instructions on how to analyze, identify and solve problems with the product
• Release Notes - Provides instructions on the new functionalities, identified defects and compatibilities between
Software and Hardware
Hardware Platform
The availability of some documents can vary depending on the type of product.
Access https://supportcenter.datacom.com.br to locate the related documents or contact the Technical Support for
additional information.
The present document is a set of instructions that provide a quick and objective explanation on the use of the functionalities
available in the product.
This document was developed to be used as an eventual source for solution of technical issues, and for this reason its
sequential reading is not mandatory.
It is presumed that the individual or individuals that manage any aspect of the product should have the basic knowledge of
Ethernet, network protocols and communication networks in general.
Audience
This guide is directed to network administrators, technicians or teams qualified to install, set, plan and maintain this
product.
Conventions
To facilitate understanding of the present manual, the following conventions were adopted:
Icons
Note The notes explain in a better manner a detail included in the text.
WEEE Directive Symbol (Applicable in the European Union and other European coun-
tries with separate collection systems). This symbol on the product or its packaging
indicates that this product must not be disposed of with other waste. Instead, it is your
responsibility to dispose of your waste equipment by handing it over to a designated
collection point for the recycling of waste electrical and electronic equipment. The
Note
separate collection and recycling of your waste equipment at the time of disposal
will help conserve natural resources and ensure that it is recycled in a manner that
protects human health and the environment. For more information about where you
can drop off your consumer waste equipment for recycling, please contact your local
city recycling office or the dealer from whom you originally purchased the product.
This symbols means that, case the procedure was not correctly followed, may exist
Warning
electrical shock risk.
Warning Represents laser radiation. It is necessary to avoid eye and skin exposure.
This symbol means that this text is very important and, if the orientations were not
Caution
correct followed, it may cause damage or hazard.
A warning icon requests special attention to the conditions that, if not avoided, may cause physical damages
to the equipment.
A caution icon requests special attention to the conditions that, if not avoided, may result in risk of death of
serious injury.
Text Convention
This report uses these text conventions to convey instructions and information:
Convention Description
Internet site or an e-mail address. It is also applied to indicate a local link inside the
Hyperlink
document itself (e.g. a chapter).
The text convention shown above differs from Command Line Interface syntax convention. See the
convention related to commands on Sintaxe de Comando.
Table of Contents
Contacts 2
Product Documentation 3
1 Alarms 9
1.1 Alarms Severity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.2 Alarms Status . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.3 How to Check Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.4 Understanding Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2 MIB 11
2.1 Exporting the SNMP MIBs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.2 Querying an SNMP object . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3 Management 12
3.1 CPU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.1.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.1.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.2 Environment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.2.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.2.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.3 Memory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3.3.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3.3.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
4 OAM 22
4.1 CFM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
4.1.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
4.1.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
4.2 EFM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
5 Interface 27
5.1 Backup-Link . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
5.1.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
5.1.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
6 GPON 29
6.1 OLT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
6.1.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
6.1.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
6.2 ONU . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
6.2.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
6.2.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
7 Switching 45
7.1 EAPS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
7.1.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
7.1.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
7.2 Loopback Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
7.2.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
7.2.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
8 MPLS 48
8.1 L2VPN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
8.1.1 Troubleshooting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
8.1.2 Alarms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Legal Note 49
Warranty 49
1 Alarms
This chapter gives to understanding about the alarms in DmOS.
• Alarms Severity
• Alarms Status
• Understanding Alarms
Alarms in DmOS can be classified into three levels of severity: Critical, Major and Minor.
• Critical (Critical alarms) - Conditions that impact the equipment operation and require immediate correction action.
Some examples:
• Major (High priority alarms) - Conditions that impact the equipment operation but are not critical. The condition
should be investigated to verify the need for immediate action. However, some corrective action is required. Some
examples:
• Minor (Low priority alarms) - Alarm condition does not prevent the operation of equipment, but the condition must
be examined, monitoring and if necessary corrected for not to become more serious. Some examples:
When a alarm is activated one trap is generated with Critical, Major or Minor severity. On the other hand,
when a alarm is disactivated another trap is generated but with clear severity, signaling that alarm is not
more actived.
Some alarms has more than one severity like CPU and Memory.
• Active - Informs that alarm is activated on equipment and some action is necessary to clear.
• Unstable - Informs that alarm is activated on equipment but is flapping. This status is detected when at least 5
transactions of alarm have occurred in the last 90 seconds.
CLI (Command Line Interface) can be used to check alarms. The CLI is accessed by using a direct console connection or by
using a TELNET or SSH connection from a remote management terminal. Also it is possible to check alarms through
DmView. The avaialble command to check alarms in CLI is show alarm.
DM4770# show alarm
Triggered on Severity Source Status Name Description
-------------------------- -------- --------- -------- ----------------- -------------------------------------
2023-04-19 09:01:59 UTC-3 MAJOR domain63 Active EAPS_RING_FAILED EAPS domain changed state to ’Failed’
DM4770#
• Name - Name of alarm. The prefixed "*" is used when the alarm is unstable status.
For each alarm presented on the next chapter, the follows items will be showed:
• Suggestion Action - Informs some possible actions to help the operator to clear alarm.
• Trap Name - Informs the name of trap. The user can to check more details in specific MIB using trap name.
2 MIB
This chapter is about understanding the Management Information Base (MIB) supported by DmOS.
DmOS supports three versions of the Simple Network Management Protocol (SNMP), configuration examples can be
seen in the manual DmOS - Quick Configuration Guide. The user can export a file with all SNMP MIBs supported by
equipment to a TFTP or SCP server. The command below shall forward the MIBs file named datacom-mibs.tar.gz via
TFTP protocol to the 172.1.1.1 server.
The objective of this chapter is to explain the basic how the MIB works using as an example the MIB DMOS-REBOOT-
MIB.mib file. A MIB has several other parameters and possible configurations, however, to facilitate understanding, a
simple example will be used.
The module name is defined at the beginning of the file and is written in capital letters, and may contain numbers and
hyphens. In the example, the name is defined on the first line, DMOS-REBOOT-MIB. In addition to the name of the MIB
module, it is necessary to query the object’s name for SNMP. This can be seen below in the file defined in the OBJECT-TYPE
item. In this case it is rebootReason. Below is an example of an SNMP query on a Linux host regarding the reboot reason
of the 10.0.0.1 equipment. The query is carried out using the module name and the object name separated by a double
colon (::).
3 Management
• CPU
• Environment
• Memory
3.1 CPU
3.1.1 Troubleshooting
DmOS forwards L2/L3/MPLS packets in hardware, as well as applying filters and QoS policies, ensuring wirespeed operation
for any packet size, avoiding CPU usage for this purpose. Traffic destined to the CPU is used for establishment and
resolution of configured protocols and services. Usually CPU-related events are originate from problems in protocol
configuration, improper traffic or even the execution of some costly work requested by the user. To reduce incidents in this
component, I read the item CPU DoS Protect Configuration in the Quick Configuration Guide.
3.1.2 Alarms
CPU_CORE_HIGH
A. The CPU core usage average is higher than 90% during the last 5 minutes.
B. The CPU core usage average is higher than 70% during the last 5 minutes.
• Severity
A. Major
B. Minor
• Impact
2. Some packets are dropped in normal requests such as: SNMP and ICMP Telnet or SSH sessions may be slow.
• Possible Cause
• Suggested Action
3. Check logs about CPU using show log component sysmon command or only show log.
4. Check if the problem is caused by a system process or high network traffic, like a loop.
5. We recommend that you use the switch console for debugging on these cases.
• Trap
cpuCoreHighTrap
CPU_LOAD_HIGH
A. The CPU usage average is higher than 80% during the last 5 minutes.
B. The CPU usage average is higher than 60% during the last 5 minutes.
• Severity
A. Critical
B. Major
• Impact
2. Some packets are dropped in normal requests such as: SNMP and ICMP.
• Possible Cause
• Suggested Action
3. Check logs about CPU using show log component sysmon command or only show log.
4. Check if the problem is caused by a system process or high network traffic, like a loop.
5. We recommend that you use the switch console for debugging on these cases.
• Trap
cpuLoadHighTrap
3.2 Environment
3.2.1 Troubleshooting
Environment alarm is related to the place where the equipment is installed or also the replaceable parts of the equipment
such as PSU, Fans and expansion card (Line Card). These alarms require immediate operator action to reduce the risk of
further events.
3.2.2 Alarms
CARD_UNINITIALIZED
The line card could not be correctly initialized because it was inserted after the main board initialization.
• Severity
Major
• Impact
The line card is not programmed while in this state, rendering the line card unusable.
• Possible Cause
The line card was inserted after the main board initialization.
• Suggested Action
Reboot the system so that the line card is initialized together with the main board.
• Trap
cardUninitializedAlarmTrap
FAN_ERROR
• Severity
Major
• Impact
• Possible Cause
• Suggested Action
• Trap
fanErrorAlarmTrap
FAN_FAIL
• Severity
Critical
• Impact
1. The equipment operation will try to operate normally if the others FANS of FAN module will be normal.
2. Possible interruption in equipment operation with possibility of permanent damage if all FANS go to fail.
• Possible Cause
• Suggested Action
• Trap
fanFailAlarmTrap
FAN_MODULE_NOT_PRESENT
• Severity
Major
• Impact
The device temperature will increase and the overtemperature protection might be triggered.
• Possible Cause
• Suggested Action
• Trap
fanModuleNotPresentAlarmTrap
FAN_SPEED_LOW
• Severity
Major
• Impact
The equipment will operate near the minimum FAN speed supported.
• Possible Cause
• Suggested Action
• Trap
fanLowAlarmTrap
PSU_ERROR
• Severity
Major
• Impact
This PSU cannot have its status read. Therefore, there is no way do determine if the PSU can be used as a backup.
• Possible Cause
• Suggested Action
• Trap
psuErrorAlarmTrap
PSU_FUSE_FAILURE
• Severity
Major
• Impact
This PSU cannot be used as a backup. If the current main PSU fails or is removed, the device will be powered down.
• Possible Cause
• Suggested Action
• Trap
psuFuseFailureAlarmTrap
PSU_POWER_INPUT_FAILURE
• Severity
Minor
• Impact
This PSU cannot be used as a backup. If the current main PSU fails or is removed, the device will be powered down.
• Possible Cause
• Suggested Action
• Trap
psuPowerInputFailureAlarmTrap
PSU_UNSUPPORTED
PSU module not supported in this product. Alarm will be generated for PSU with known model and not compatible with
the product. For PSU with unknown model the alarm will not be generated.
• Severity
Major
• Impact
• Possible Cause
• Suggested Action
• Trap
psuUnsupportedTrap
TEMP_ERROR
• Severity
Major
• Impact
The equipment operation will try to operate normally but may cause permanent damage if environment temperature
remains high or low.
• Possible Cause
• Suggested Action
2. Check logs of FAN and temperature using show log component fan temperature command or only show
log.
• Trap
tempErrorAlarmTrap
TEMP_HIGH
• Severity
Critical
• Impact
The equipment will operate near the maximum temperature supported by the equipment with possibility of
permanent damage.
• Possible Cause
• Suggested Action
4. Check logs of FAN and temperature using show log component fan temperature command or only show
log.
• Trap
tempHighAlarmTrap
TEMP_LOW
• Severity
Critical
• Impact
The equipment will operate near the minimum temperature supported by the equipment with possibility of
permanent damage.
• Possible Cause
• Suggested Action
2. Check logs of temperature using show log component fan temperature command or only show log.
• Trap
tempLowAlarmTrap
3.3 Memory
3.3.1 Troubleshooting
Each equipment with the DmOS operating system has a certain amount of memory dimensioned for the product in
question. The equipment has two types of memory, the FLASH memory where the operating system, configuration files,
firmware image and logs are stored. The RAM memory is used by applications and temporary log files. DmOS has an
intelligent FLASH memory management system to avoid running out of space and thus allow the system to always have
space to operate. The RAM memory is influenced by the configurations and actions performed by the operator and it can
reach critical levels if the failure is not handled.
3.3.2 Alarms
MEMORY_AVAILABLE_LOW
• Severity
A. Critical
B. Major
• Impact
• Possible Cause
• Suggested Action
3. Check logs of Memory using show log component sysmon command or only show log.
4. Check the files storaged in equipment using file list. Delete some files if necessary.
5. Check through a monitoring tool if memory is being decreasing constantly or there has been a sudden drop.
• Trap
memAvailableLowTrap
4 OAM
• CFM
• EFM
4.1 CFM
4.1.1 Troubleshooting
Operations, Administration, and Maintenance (OAM) Connectivity Fault Management (CFM) provides end-to-end, point-to-
point, or a LAN made up of several devices. Its design is hierarchical, with different levels and can be summarized as
follows: The Maintenance Domain (MD) with names indicating the domain of action. For each MD level it is possible to have
several Maintenance Association (MA) where the MEPs will communicate. The Maintenance End Point (MEP) is the main
CFM entity that will know your specific MA within an MD level. MEPs can actively send and receive CCM frames, each MEP
is configured to periodically transmit these messages. MEP can only be configured on the interfaces that delimit the
border of a CFM domain. The Maintenance Intermediate Point (MIP) unlike the MEP is a passive entity. The MIP only sends
CFM frames in response to a frame it has received. The Continuity Check Message (CCM) is responsible for the proactive
monitoring of CFM connectivity, with the objective of detecting connection failures in the MA.
4.1.2 Alarms
CFM_ERROR_CCM
• Severity
Major
• Impact
• Possible Cause
1. The MEP ID in the received CCM is not configured in the list of remote MEPs of the receiving MEP.
2. The MEP ID in the received CCM matches the MEP ID of the receiving MEP.
3. The CCM interval on the received CCM does not match the one configured for the receiving MEP.
• Suggested Action
• Trap
None
CFM_ETH_AIS
• Severity
Major
• Impact
• Possible Cause
• Suggested Action
Contact the responsible for the lower-level MD which is generating AIS messages.
• Trap
None
CFM_RMEP_CCM
The MEP is not receiving CCMs from at least one of the configured remote MEPs.
• Severity
Major
• Impact
• Possible Cause
• Suggested Action
• Trap
None
CFM_RMEP_INTF
The interface status received from a configured remote MEP indicates an error condition.
• Severity
Major
• Impact
The interface on which the remote MEP is attached to is unable to forward traffic correctly.
• Possible Cause
There is a link failure on the port the remote MEP is attached to.
• Suggested Action
Check the connectivity of the links on the equipment the MEP reporting the error is configured.
• Trap
None
CFM_RMEP_RDI
A remote MEP is not receiving CCMs from at least one of its remote MEPs.
• Severity
Minor
• Impact
The communication between remote MEP and one of its remote MEPs is lost.
• Possible Cause
1. The CCM transmission is disabled in one of the remote MEPs of a remote MEP.
• Suggested Action
• Trap
None
CFM_XCON_CCM
• Severity
Major
• Impact
• Possible Cause
The MA or MD name on the received CCM does not match the MA or MD configured for the MEP.
• Suggested Action
Verify for both local and remote MEPs the configured names of Maintenance Domains and Maintenance Associations.
• Trap
None
4.2 EFM
4.2.1 Troubleshooting
Ethernet in the First Mile (EFM) is an Operations, Administration, and Maintenance (OAM) mechanism/technology with
the objective of monitoring the state of the point-to-point link, blocking the interface as soon as the communication
is interrupted. Protocol information is transmitted through frame slow Protocol OAM Protocol Data Units (OAMPDUs).
OAMPDUs are only exchanged between interfaces on the link and are not forwarded across switches, in case of using other
encapsulation technologies such as L2VPN, L2TP the frames can be forwarded without being treated. OAMPDUs contain
status and control information used to monitor, test, and troubleshoot the link for unidirectional communication and other
problems.
4.2.2 Alarms
EFM_FAILURE
• Severity
Major
• Impact
The interface on which the failure was detected will not forward any traffic until its recovery. This behavior is a
protection to avoid using an interface under unsafe conditions, such as a unidirectional link.
• Possible Cause
EFM failures are detected when the interface stops receiving EFM PDUs from its remote peer, which in turn might
have several root causes, such as defective cables/fibers and misconfigurations of network devices. EFM failures are
also detected when the remote EFM peer reports a malfunction in its interface, i.e., a failure in the interface of
another network device will block a local interface.
• Suggested Action
Inspect physical connections and configurations in the network devices that would cause PDUs to be lost.
• Trap
efmFailureAlarmTrap
5 Interface
• Backup-Link
5.1 Backup-Link
5.1.1 Troubleshooting
Not available.
5.1.2 Alarms
BACKUPLINK_INTERFACE_DEFECT
• Severity
Major
• Impact
Interface cannot transmit/receive packets. It cannot become active in case the other interface has a link failure or
becomes blocked.
• Possible Cause
• Suggested Action
• Trap
backuplinkInterfaceDefectAlarmTrap
BACKUPLINK_USING_BACKUP_INTERFACE
Main interface suffered a link failure or is blocked by another protocol, resulting in a switchover to the backup interface.
• Severity
Major
• Impact
• Possible Cause
• Suggested Action
• Trap
backuplinkUsingBackupAlarmTrap
6 GPON
• OLT
• ONU
6.1 OLT
6.1.1 Troubleshooting
Not available.
6.1.2 Alarms
OLT_ADAPT_FAILURE
• Severity
Critical
• Impact
1. It is possible that data traffic will keep working but control plane may not work correctly.
• Possible Cause
• Suggested Action
• Trap
None
GPON_LOS
• Severity
Critical
• Impact
• Possible Cause
3. Attenuation of the very high signal. ONU was disconnected from OLT.
• Suggested Action
2. Check the GPON transceiver of PON port in OLT using show interface transceivers gpon command.
3. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
4. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
None
GPON_TX_FAULT
• Severity
Minor
• Impact
• Possible Cause
• Suggested Action
1. Check the GPON transceiver of PON port in OLT using show interface transceivers gpon command.
• Trap
None
GPON_TF
Transmitter failure. The response signal expected from the card after routing data for one port was not received.
• Severity
Critical
• Impact
• Possible Cause
• Suggested Action
1. Check the GPON transceiver of PON port in OLT using show interface transceivers gpon command.
• Trap
gPonTFAlarm
6.2 ONU
6.2.1 Troubleshooting
The attenuation conditions are the cause of the majority of signal problems in the ONU. Higher attenuations cause the
appearance of more alarms related to signal failures. In this scenario, the ONT can tune some parameters to improve the
signal and to minimize these conditions. In adverse ONU or fiber conditions, alarms will appear in the following order:
• SD (Signal Degraded)
• SF (Signal Fail)
6.2.2 Alarms
GPON_DOWi
Drift of window of ONU. ONU transmission is received at an unexpected time (the phase shifted).
• Severity
Critical
• Impact
• Possible Cause
Temporary unstable condition in the fiber. With environmental changes, e.g., temperature, humidity or even wind,
the fiber can expand or contract, causing variations in its length, and consequently, the distance between the OLT
and the ONU.
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU. Check if the fiber between ONU and OLT is clean.
• Trap
gPonDOWiAlarm
GPON_LOAi
Loss of acknowledge with ONU. OLT did not receive ONU acknowledgement after issuing DS messages that require US
acknowledge from the ONU.
• Severity
Minor
• Impact
User traffic.
• Possible Cause
• Suggested Action
• Trap
gPonLOAiAlarm
GPON_SFi
Signal fail of ONU. ONU upstream signal exceeds the BER threshold.
• Severity
Critical
• Impact
User traffic.
• Possible Cause
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
gPonSFiAlarm
GPON_PEEi
• Severity
Major
• Impact
• Possible Cause
ONU is malfunctioning.
• Suggested Action
1. Reinitialize ONU using onu-reset onu x command, where x is the ONU ID.
2. Replace ONU.
• Trap
gPonPEEiAlarm
GPON_LOSi
• Description
• Severity
Critical
• Impact
• Possible Cause
2. The ONU is powered, but have a hardware failure that interferes with its transmission.
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU. Check if the link between OLT and ONU or splitter
is operational.
• Trap
gPonLOSiAlarm
GPON_ONU_EQUIP_FAILURE
• Severity
Major
• Impact
User traffic.
• Possible Cause
ONU is malfunctioning.
• Suggested Action
1. Reinitialize ONU using onu-reset onu x command, where x is the ONU ID.
2. Replace ONU.
• Trap
gPonOnuEquipmentFailureAlarm
GPON_ONU_DOWN
The ONU has an internal problem/defect or ONU was disconnected from the PON link.
• Severity
Major
• Impact
• Possible Cause
1. ONU disconnected.
• Suggested Action
1. Check if the ONU is operational and connected to OLT using show interface gpon 1/1/1 onu 1 optical-info
and/or show interface gpon 1/1/1 onu 1 rssi commands. Where 1/1/1 is the chassi/slot/port and onu 1 is the
ID of ONU.
3. Replace ONU.
• Trap
None
GPON_DFi
Deactivate failure of ONU. ONU does not react correctly after three Deactivate_ONU-ID or three Disable_Serial_Number
messages.
• Severity
Critical
• Impact
1. User traffic.
• Possible Cause
ONU is malfunctioning.
• Suggested Action
Check if ONU was physically removed but the configuration on OLT was not removed.
• Trap
gPonDFiAlarm
GPON_LOFi
Loss of frame of ONU. OLT received four consecutive invalid delimiters from the ONU.
• Severity
Critical
• Impact
• Possible Cause
2. The ONU is powered, but have a hardware failure that interferes with its transmission.
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
4. Replace ONU.
• Trap
gPonLOFiAlarm
GPON_LCDGi
Loss of GEM channel delineation. The delimitation of the frame GEM header is incorrect in three consecutive frames.
• Severity
Major
• Impact
User traffic.
• Possible Cause
1. The fiber is defective. It may be improperly connected, aged, bent or damaged. Dirty or faulty connector.
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
gPonLCDGiAlarm
GPON_LOAMi
Loss of PLOAM for ONU. Three consecutive PLOAM messages of the ONU are missing after OLT sends PLOAMu request for
the ONU.
• Severity
Minor
• Impact
1. User traffic.
• Possible Cause
2. This alarm could be shown during ONU’s activation process. It is cleared after a successful activation.
• Suggested Action
• Trap
gPonLOAMiAlarm
GPON_ONU_AUTO_PROV_FAIL
There was an error while adding one or more ONUs by the auto provisioning feature.
• Severity
Minor
• Impact
• Possible Cause
3. There are other possible causes for failure. Check user log.
• Suggested Action
Check logs to find out the error cause and correct it.
• Trap
gPonOnuAutoProvAddFailTrap
GPON_ONU_PASSWORD_MISMATCH
• Severity
Major
• Impact
Services are not configured for the ONU due to authentication failure.
• Possible Cause
• Suggested Action
• Trap
gPonOnuPasswordMismatchAlarm
GPON_SUFi
Start-up failure of ONU. ONU ranging failed 2 times while the OLT receives the signal bursts.
• Severity
Minor
• Impact
User traffic.
• Possible Cause
The fiber is defective. It may be improperly connected, aged, bent or damaged. Dirty or faulty connector.
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
3. Check if the fiber between ONU and OLT is clean and without damaged fiber connector.
• Trap
gPonSUFiAlarm
GPON_MISi
Link mismatch of ONU. OLT detected that the PST message sent or received are different.
• Severity
Major
• Impact
1. User traffic.
• Possible Cause
• Suggested Action
• Trap
gPonMISiAlarm
GPON_DGi
Receive dying gasp of ONU. OLT received message that the ONU has lost AC power or is below a certain threshold.
• Severity
Critical
• Impact
• Possible Cause
• Suggested Action
• Trap
gPonDGiAlarm
GPON_RDIi
Remote defect indication of ONU. The OLT transmission is received with defects at the ONU.
• Severity
Minor
• Impact
User traffic.
• Possible Cause
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
gPonRDiAlarm
GPON_SDi
Signal degraded of ONU. Signal of an ONU deteriorates and the upstream signal reaches the BER threshold.
• Severity
Major
• Impact
User traffic.
• Possible Cause
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
gPonSDiAlarm
GPON_LOBi
2. Failure to delineate, for any reason, the specified number (default 4) of consecutive scheduled bursts from ONU,
when not exempt by power management state machine.
• Severity
Critical
• Impact
• Possible Cause
2. The ONU is powered, but have a hardware failure that interferes with its transmission.
• Note
This alarm is only available to XGS ONU, and replaces the conditions known as LOSi and LOFi.
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
gPonLOBiAlarm
GPON_LOPCi
2. This is a generic defect indicating breakage of the PLOAM protocol: persistent MIC failure in the upstream; lack of
acknowledgements or proper PLOAM responses from the ONU. Persistent means that the same irregular condition is
observed consecutively at least three times.
• Severity
Major
• Impact
• Possible Cause
2. The ONU is powered, but have a hardware failure that interferes with its transmission.
• Note
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
gPonLOPCiAlarm
GPON_LOOCi
• Severity
Major
• Impact
• Possible Cause
2. The ONU is powered, but have a hardware failure that interferes with its transmission.
• Note
• Suggested Action
1. Check if the TX-Power and RX-Power of PON link is within of recommended values using show interface
transceivers gpon command.
2. Check if the TX-Power and RX-Power of link between a specific ONU and OLT are within of recommended values
using show interface gpon 1/1/1 onu 1 optical-info and/or show interface gpon 1/1/1 onu 1 rssi commands.
Where 1/1/1 is the chassi/slot/port and onu 1 is the ID of ONU.
• Trap
gPonLOOCiAlarm
GPON_ONU_SELF_TEST_FAILURE
• Severity
Major
• Impact
Functioning of ONU.
• Possible Cause
ONU is malfunctioning.
• Suggested Action
1. Reinitialize ONU using onu-reset onu x command, where x is the ONU ID.
2. Replace ONU.
• Trap
gPonOnuSelfTestFailureAlarm
7 Switching
• EAPS
• Loopback Detection
7.1 EAPS
7.1.1 Troubleshooting
Not available.
7.1.2 Alarms
EAPS_FAIL_TIMER_EXPIRED
Fail Timer configured in Master equipment expired because EAPS domain missed consecutive health checks.
• Severity
Major
• Impact
None. But there is some problem in network. The EAPS can go to incomplete state if the Master equipment receive a
LINK-DOWN message.
• Possible Cause
2. A high amount of packets get lifted to the CPU and EAPS hello packets get dropped by congestion in the CPU.
Probably due to an accidental loop or abnormal traffic storm.
• Suggested Action
• Traps
None
EAPS_RING_FAILED
• Severity
Major
• Impact
The secondary port of the Master in the EAPS ring will open. A few loss of protected traffic due to the convergence of
EAPS ring.
• Possible Cause
• Suggested Action
• Traps
None
7.2.1 Troubleshooting
Not available.
7.2.2 Alarms
LOOPBACK_DETECTED
• Severity
Major
• Impact
The port on which the loop was detected will not forward any traffic until it stops receiving the control traffic for the
amount of time configured in the ports timer:
• Possible Cause
Wrong physical connection or configuration, usually associated with misconnected optical fibers, defective cabling,
or any configuration that would cause a loopback-detection packet to be forwarded back to the port on which it was
generated.
• Suggested Action
• Trap
loopbackDetectedAlarmTrap
8 MPLS
• L2VPN
8.1 L2VPN
8.1.1 Troubleshooting
Not available.
8.1.2 Alarms
VPWS_RED_MAIN_NEIGHBOR_FAIL
• Severity
Major
• Impact
• Possible Cause
• Suggested Action
• Trap
Indisponível.
Legal Note
In spite the fact that all the precautions were taken in development of the present document, DATACOM shall not be held
responsible for eventual errors or omissions as well as no obligation is assumed due to damages resulting from the use of
the information included in this guide. The specifications provided in this manual shall be subject to changes with no prior
notification and are not acknowledged as any type of contract.
Warranty
DATACOM’s products are covered by a warranty against manufacturing defects during a minimum period of 12 (twelve)
months including the legal term of 90 days, as from the date of issue of the supply Nota Fiscal (Invoice).
Our warranty is standard counter warranty, this means, for exercise of the warranty, the customer should send the product
to DATACOM Authorized Technical Assistance with paid freight. The return freight of the equipment will be DATACOM
responsibility.