The following sections describe how to configure and manage SNMP services with Oracle Communications Converged Application Server:
Oracle Communications Converged Application Server includes a dedicated SNMP MIB to monitor activity on engine tier and SIP data tier server instances. The Oracle Communications Converged Application Server MIB is available on both Managed Servers and the Administration Server of a domain. However, Oracle Communications Converged Application Server engine and SIP data tier traps are generated only by the Managed Server instances that make up each tier. If your Administration Server is not a target for the sipserver
custom resource, it will generate only WebLogic Server SNMP traps (for example, when a server in a cluster fails). Administrators should monitor both WebLogic Server and Oracle Communications Converged Application Server traps to evaluate the behavior of the entire domain.
Note: | Oracle Communications Converged Application Server MIB objects are read-only. You cannot modify a Oracle Communications Converged Application Server configuration using SNMP. |
The Oracle Communications Converged Application Server MIB file is installed in WLSS_HOME/server/lib/wlss/BEA-WLSS-MIB.asn1
. Use an available SNMP management tool or MIB browser to view the contents of this file. See also Trap Descriptions for a description of common SNMP traps.
To enable SNMP monitoring for the entire Oracle Communications Converged Application Server domain, follow these steps:
Note: | Ensure that you create a new Server SNMP agent, rather than a Domain-Scoped agent. |
The following sections describe the Oracle Communications Converged Application Server SNMP traps in more detail. Recovery procedures for responding to individual traps are also included where applicable.
The following Oracle Communications Converged Application Server log and configuration files are frequently helpful for troubleshooting problems, and may be required by your technical support contact:
General information that can help the technical support team includes:
Table 2-1 lists the Oracle Communications Converged Application Server SNMP traps and indicates whether the trap is generated by servers in the engine tier or SIP data tier. Each trap is described in the sections that follow.
This trap is generated by an engine tier server instance when it loses its connection to a replica in the SIP data tier. It may indicate a network connection problem between the engine and SIP data tiers, or may be generated with additional traps if a SIP data tier server fails.
If this trap occurs in isolation from other traps indicating a server failure, it generally indicates a network failure. Verify or repair the network connection between the affected engine tier server and the SIP data tier server.
If the trap is accompanied by additional traps indicating a SIP data tier server failure (for example, dataTierServerStopped
), follow the recovery procedures for the associated traps.
This trap is generated by an engine tier server instance when it successfully reconnects to a SIP data tier server after a prior failure (after a connectionLostToPeer
trap was generated). Repeated instances of this trap may indicate an intermittent network failure between the engine and SIP data tiers.
See connectionLostToPeer.
Oracle Communications Converged Application Server SIP data tier nodes generate this alarm when an unrecoverable error occurs in a WebLogic Server instance that is part of the SIP data tier. Note that this trap may be generated by the server that is shutting down, by another replica in the same partition, or in some cases by both servers (network outages can sometimes trigger both servers to generate the same trap).
See the Recovery Procedure for serverStopped.
Oracle Communications Converged Application Server engine tier nodes use a configurable throttling mechanism that helps you control the number of new SIP requests that are processed. After a configured overload condition is observed, Oracle Communications Converged Application Server destroys new SIP requests by responding with “503 Service Unavailable” to the caller. The server continues to destroy new requests until the overload condition is resolved according to a configured threshold control value. This alarm is generated when the throttling mechanism is activated. The throttling behavior should eventually return the server to a non-overloaded state, and further action may be unnecessary. See Overload in the Configuration Reference Manual.
1. Check other servers to see if they are nearly overloaded.
2. Check to see if the load balancer is correctly balancing load across the application servers, or if it is overloading one or more servers. If additional servers are nearly overloaded, Notify Tier 4 support immediately.
3. If the issue is limited to one server, notify Tier 4 support within one hour.
Question: How can I monitor load using the Administration Server? How can I tell when I'm near a threshold?
Answer: If you set the queue length as an incoming call overload control, you can monitor the length of the queue using the Administration Console. If you specify a session rate control, you cannot monitor the session rate using the Administration Console. (The Administration Console only displays the current number of SIP sessions, not the rate of new sessions generated.)
Oracle Communications Converged Application Server SIP data tier nodes generate this alarm when a server instance is added to a partition in the SIP data tier.
This trap is generated during normal startup procedures when SIP data tier servers are booted.
SIP data tier nodes generate this alarm if an engine server client that was not registered (or was removed from the list of registered engines) attempts to communicate with the SIP data tier. This trap is generally followed by a serverStopped
trap indicating that the engine tier server was shut down to preserve SIP data tier consistency.
Restart the engine tier server. Repeated occurrences of this trap may indicate a network problem between the engine tier server and one or more replicas.
Oracle Communications Converged Application Server SIP data tier nodes generate this alarm when a server is removed from the SIP data tier, either as a result of a normal shutdown operation or because of a failure. There must be at least one replica remaining in a partition to generate this trap; if a partition has only a single replica and that replica fails, the trap cannot be generated. In addition, because engine tier nodes determine when a replica has failed, an engine tier node must be running in order for this trap to be generated.
If this trap is generated as a result of a server instance failure, additional traps will be generated to indicate the exception. See the recovery procedures for traps generated in addition to replicaRemovedFromPartition
.
This trap indicates that the WebLogic Server instance is now down. This trap applies to both engine tier and SIP data tier server instances, but only when the servers are members of a named WebLogic Server cluster. If this trap is received spontaneously and not as a result of a controlled shutdown, follow the steps below.
ps – ef | grep java
There should be only one PID for each WebLogic Server instance running on the machine.
kill -3 [pid]
Note: | Oracle Communications Converged Application Server logs are truncated according to your system configuration. Make backup logs immediately to avoid losing critical troubleshooting information. |
Question: If the server shuts down, are all SNMP traps for the server lost?
Answer: The Administration Console generates SNMP messages for managed WebLogic Server instances only until the ServerShutDown message is received. Afterwards, no additional messages are generated.
Oracle Communications Converged Application Server engine tier nodes generate this alarm when a SIP Servlet is deployed to the container.
This trap is generated during normal deployment operations and does not indicate an exception.
Oracle Communications Converged Application Server engine tier nodes generate this alarm when a SIP application shuts down, or if a SIP application is undeployed. This generally occurs when Oracle Communications Converged Application Server is shutdown while active requests still exist.
During normal shutdown procedures this alarm should be filtered out and should not reach operations. If the alarm occurs during the course of normal operations, it indicates that someone has shutdown the application or server unexpectedly, or there is a problem with the application. Notify Tier 4 support immediately.
Oracle Communications Converged Application Server engine tier nodes generate this trap when an application deploys successfully as a Web Application but fails to deploy as a SIP application.
The typical failure is caused by an invalid sip.xml
configuration file and should occur only during software installation or upgrade procedures. When it occurs, undeploy the application, validate the sip.xml file, and retry the deployment.
Note: | This alarm should never occur during normal operations. If it does, contact Tier 4 support immediately. |