5 SEPP Metrics, KPIs, and Alerts
5.1 SEPP Metrics
This section provides information about the SEPP metrics.
Table 5-1 Dimensions
Dimensions | Details | Values |
---|---|---|
apiUrl | Resource URIs passing across Inter PLMN Via SEPP | Example: /namf-comm/v1/subscriptions |
app | SEPP Service names |
Examples:
|
application | application name, here, it is ocsepp. | Example: ocsepp |
cause | Indicates the reason of failure |
Examples:
|
chart | Indicates the SEPP microservice release names |
Examples:
|
container | Indicates the name of the container. It is part of each metrics. The app and container contains the same value. |
Examples:
|
DestinationHost | Indicates the destination host for Jetty client on PN32F or CN32F |
Examples:
|
direction |
Direction of the request or response. In Gateway Metrics, the values are egress and egressOut. In N32F Metrics the values are ingress and egress. |
Examples:
|
Egress Rate Limit List | The list that contains the PLMN for applying Egress Rate Limit. | Example: ERL1 |
engVersion | The SEPP Release version |
Example:
|
error_action | The action needs to be taken when there is a validation failure in SEPP. |
Examples: REJECT FORWARD |
ErrorOriginator | Name of service that originates the error. | Example: PN32F |
event | The event that occurred on request processing or completion. |
Examples:
|
handshake_procedure | The type of the handshake operation at cSEPP or pSEPP. | Example: capability-exchange |
header | SBI Headers |
Examples:
|
Host | FQDN of the target host | Example: ocsepp-release-n32-egress-gateway. |
http_error_message | Reason for failure response received. |
Examples:
|
http_method | HTTP Method Name |
Examples: GET PUT POST PATCH DELETE |
http_status | HTTP Status Code in response (404 NOT_FOUND, 429 TOO_MANY_REQUESTS, 200 OK) |
Examples: 2xx, 4xx, 5xx |
namespace | Name of the Kubernetes namespace on which microservice is running. | Example: seppsvc |
nf_instance_id | Unique identity of the NF Instance sending request to OCSEPP. | Example: 9faf1bbc-6e4a-4454-a507-aef01a101a06 |
NfServiceType | Name of target network function service | Example: nausf-auth |
NfType | Name of target network function |
Examples:
|
node | Name of the Kubernetes worker node on which microservice is running. | Example: k8s-node-13.chase1.lab.in.oracle.com |
peer_domain | Domain of Remote SEPP |
Example: svc.cluster.local |
peer_fqdn | FQDN of peer present in Remote SEPP | |
peer_plmn_id | Supported PLMN list of Remote SEPP |
Example: "[Plmn [mcc=123, mnc=456]]" |
plmn_identifier | In CAT 2 Network ID Validation feature, PLMN is extracted from this identifier. |
Examples:
|
pod | Name of the pod of SEPP microservice | Example: ocsepp-release-cn32f-svc-6fd6ccfd4b-hkgqb |
Port | Port number | Example: 443 |
release | Name of the SEPP release deployed. | Example: ocsepp-release |
releaseVersion | Indicates the current release version of SEPP. | Example: 23.4.0 |
remote_sepp_name | Name of the SEPP from where message is received or destined to | Example: SEPP-1 |
remote_sepp_set_name | Name of the Remote SEPP Set from where message is received or destined to | Example: RPS-3 |
request_path | Resource URI as per defined in 3GPP specifications for 5G. | Example: /nudm-sdm/v2/imsi-987654000000008 |
ruleApplied | Rules Applied on Local SEPP or Remote SEPP. |
Examples:
|
Scheme | Indicates the HTTP Scheme |
Examples:
|
sepp_type | SEPP that acts as Producer SEPP or Consumer SEPP |
Examples:
|
sourceRss | only if Allowed P-RSS Validation is enabled | Example: |
Status | The status of the feature or microservice. |
Examples:
|
vendor | For OCSEPP, vendor Value must be set to "oracle" |
Example: oracle |
5.1.1 Configuring SEPP Metrics Dashboard in OCI
This section describes the steps to upload the
ocsepp_oci_dashboard_<version>.json
file on OCI Logging
Analytics Dashboard. As OCI doesn't support Grafana, OCI uses the Logging Analytics
Dashboard Service for visualizing the metrics and logs.
- Log in to OCI Console.
Note:
For more details about logging in to the OCI, refer to Signing In to the OCI Console. - Open the navigation menu and click Observability & Management.
- Under Logging Analytics, click Dashboards. The Dashboards page appears.
- Choose the Compartment on the left pane.
- Click Import dashboards.
- User can select and upload the
ocsepp_oci_dashboard_<version>.json
file. The following three parameters of json file must be customized before uploading it:- ##COMPARTMENT_ID: The OCID of the compartment.
- ##METRIC_NAMESPACE: The metrics namespace that the user provided while deploying OCI adaptor.
- ##K8_NAMESPACE: Kubernetes namespace where SEPP is deployed.
- Import dashboard page appears. Click Import button on
the page.
User can view the imported dashboard and can view the
metrics in the dashboard.
Note:
SEPP has organized the panels or widgets in five dashboards to support the SEPP metrics and all the five dashboards have been clubbed into a single JSON file.
5.1.2 Common Metrics
5.1.2.1 cgroup_cpu_nanoseconds
Table 5-2 cgroup_cpu_nanoseconds
Field | Details |
---|---|
Metric Details | Total CPU time consumed by service in nanoseconds |
Microservice | Consumer N32f, Producer N32f |
Type | Hologram |
Dimensions |
|
5.1.2.2 cgroup_memory_bytes
Table 5-3 cgroup_memory_bytes
Field | Details |
---|---|
Metric Details | Total memory consumed by service in bytes |
Microservice | Consumer N32f, Producer N32f |
Type | Gauge |
Dimensions |
|
5.1.2.3 oc_configclient_request_total
Table 5-4 oc_configclient_request_total
Metric Details | This metric will be pegged whenever config client is polling for configuration update from common configuration server |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.2.4 oc_configclient_response_total
Table 5-5 oc_configclient_response_total
Metric Details | This metrics will be pegged whenever config client receives response from common configuration server |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.2.5 oc_configserver_reachability
Table 5-6 oc_configserver_reachability
Metric Details | Gauge metric to peg the reachability of config server |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Gauge |
Dimensions |
|
5.1.3 CN32F Common Metrics
5.1.3.1 ocsepp_cn32f_jetty_request_stat_metrics_total
Table 5-8 ocsepp_cn32f_jetty_request_stat_metrics_total
Metric Details | This metric will be pegged for every event occurred when a request is sent to CN32F |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.3.2 ocsepp_cn32f_jetty_response_stat_metrics_total
Table 5-9 ocsepp_cn32f_jetty_response_stat_metrics_total
Field | Details |
---|---|
Metric Details | This metric will be pegged for every event occurred when a response is received from CN32F |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.3.3 ocsepp_cn32f_connection_failure_total
Table 5-10 ocsepp_cn32f_connection_failure_total
Field | Details |
---|---|
Metric Details | This metric will be pegged in the customized Jetty Client as soon as it fails to connect to the destination service. |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.3.4 ocsepp_cn32f_requests_failure_total
Table 5-11 ocsepp_cn32f_requests_failure_total
Metric Details |
Total number of requests failed to be sent from cn32f to remote SEPP. Condition: When any error or exception occurs on cn32f side because of which request is not sent to pn32f. |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.3.5 ocsepp_cn32f_response_failure_total
Table 5-12 ocsepp_cn32f_response_failure_total
Field | Details |
---|---|
Metric Details |
Total number of response failed to be sent from cn32f pod to NF. Condition: When any error or exception occurs on cn32f and request is not sent to NF. |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.3.6 ocsepp_cn32f_requests_total
Table 5-13 ocsepp_cn32f_requests_total
Metric Details |
Total number of requests received from NF. Condition:When a request is received on InboundInterface of cn32f. |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.3.7 ocsepp_cn32f_response_total
Table 5-14 ocsepp_cn32f_response_total
Field | Details |
---|---|
Metric Details |
Total number of response received from remote SEPP. Condition: When a response is received on OutboundInterface of cn32f. |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.3.8 ocsepp_cn32f_latency_seconds_count
Table 5-15 ocsepp_cn32f_latency_seconds_count
Field | Details |
---|---|
Metric Details |
This metric is used to display the number of ingress requests processed at cn32f in a particular time span (in seconds). |
Microservice | Consumer N32f |
Type | Histogram |
Dimensions |
|
5.1.3.9 ocsepp_cn32f_latency_seconds_max
Table 5-16 ocsepp_cn32f_latency_seconds_max
Field | Details |
---|---|
Metric Details | This metrics is used to display the maximum of processing time of an ingress request at cn32f in seconds. |
Microservice | Consumer N32f |
Type | Histogram |
Dimensions |
|
5.1.3.10 ocsepp_cn32f_latency_seconds_sum
Table 5-17 ocsepp_cn32f_latency_seconds_sum
Field | Details |
---|---|
Metric Details | This metrics is used to display the average of processing time of all the ingress request at cn32f for a particular time. |
Microservice | Consumer N32f |
Type | Histogram |
Dimensions |
|
5.1.3.11 ocsepp_cn32f_outgoing_connections
Table 5-18 ocsepp_cn32f_outgoing_connections
Field | Details |
---|---|
Metric Details | Gauge metric that will peg active outgoing connections from CN32F to destination |
Microservice | Consumer N32f |
Type | Gauge |
Dimensions |
|
5.1.3.12 ocsepp_cn32f_server_latency
Table 5-19 ocsepp_cn32f_server_latency
Field | Details |
---|---|
Metric Details | This metric will be pegged in Jetty response listener that captures the amount of time taken for processing of the request by jetty client |
Microservice | Consumer N32f |
Type | Gauge |
Dimensions |
|
5.1.4 PN32F Common Metrics
5.1.4.1 ocsepp_pn32f_requests_total
Table 5-20 ocsepp_pn32f_requests_total
Metric Details |
Number of requests received from Peer SEPP. Condition: When a request reaches pn32f from peer SEPP. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.4.2 ocsepp_pn32f_requests_failure_total
Table 5-21 ocsepp_pn32f_requests_failure_total
Metric Details | Number of requests transmitted to NF. Condition: When a request transmits a message to a NF. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.4.3 ocsepp_pn32f_responses_total
Table 5-22 ocsepp_pn32f_responses_total
Metric Details |
Number of responses received from Egress Gateway. Condition: When a response reaches pn32f from Egress Gateway. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.4.4 ocsepp_pn32f_responses_failure_total
Table 5-23 ocsepp_pn32f_responses_failure_total
Metric Details |
Number of responses transmitted to Consumer SEPP (cSEPP). Condition: When a response transmits a message to cSEPP. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.4.5 ocsepp_pn32f_latency_seconds_count
Table 5-24 ocsepp_pn32f_latency_seconds_count
Metric Details | This metric is used to display the number of ingress requests processed at pn32f in a particular time span (in seconds). |
Microservice | Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.4.6 ocsepp_pn32f_latency_seconds_sum
Table 5-25 ocsepp_pn32f_latency_seconds_sum
Metric Details | This metrics is used to display the average processing time of all the ingress request at pn32f for a particular time. |
Microservice | Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.4.7 ocsepp_pn32f_latency_seconds_max
Table 5-26 ocsepp_pn32f_latency_seconds_max
Metric Details | This metrics is used to display the maximum processing time of all the ingress request at pn32f for a particular time. |
Microservice | Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.4.8 ocsepp_pn32f_connection_failure_total
Table 5-27 ocsepp_pn32f_connection_failure_total
Metric Details | This metric will be pegged in the customized Jetty Client as soon as it fails to connect to the destination service. |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.4.9 ocsepp_pn32f_jetty_request_stat_metrics_total
Table 5-28 ocsepp_pn32f_jetty_request_stat_metrics_total
Field | Details |
---|---|
Metric Details | This metric will be pegged for every event occurred when a request is sent to PN32F |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.4.10 ocsepp_pn32f_jetty_response_stat_metrics_total
Table 5-29 ocsepp_pn32f_jetty_response_stat_metrics_total
Field | Details |
---|---|
Metric Details | This metric will be pegged for every event occurred when a response is received from PN32F |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.4.11 ocsepp_pn32f_outgoing_connections
Table 5-30 ocsepp_pn32f_outgoing_connections
Field | Details |
---|---|
Metric Details | Gauge metric that will peg active outgoing connections from PN32F to destination |
Microservice | Producer N32f |
Type | Gauge |
Dimensions |
|
5.1.4.12 ocsepp_pn32f_server_latency
Table 5-31 ocsepp_pn32f_server_latency
Field | Details |
---|---|
Metric Details | This metric will be pegged in Jetty response listener that captures the amount of time taken for processing of the request by jetty client |
Microservice | Producer N32f |
Type | Gauge |
Dimensions |
|
5.1.5 N32C Handshake Procedure Metrics
5.1.5.1 ocsepp_n32c_handshake_failure_attempts_total
Table 5-32 ocsepp_n32c_handshake_failure_attempts_total
Field | Details |
---|---|
Metric Details | If N32c Handshake procedure fails, this metrics will be pegged and corresponding alarm will be raised. |
Microservice | Producer and Consumer N32c |
Type | Counter |
Dimensions |
|
5.1.5.2 ocsepp_n32c_handshake_reInitiation_failure
Table 5-33 ocsepp_n32c_handshake_reInitiation_failure
Field | Details |
---|---|
Metric Details | If N32c Handshake Reinitiation procedure fails, this metrics will be pegged and corresponding alarm will be raised. |
Microservice | Consumer N32c, Producer N32c |
Type | Gauge |
Dimensions |
|
5.1.5.3 ocsepp_cn32c_handshake_requests_total
Table 5-34 ocsepp_cn32c_handshake_requests_total
Field | Details |
---|---|
Metric Details |
Total number of requests sent over n32c for handshake procedure. Condition: When SEPP initiates any handshake procedure requests towards peer SEPP. |
Microservice | Consumer N32c |
Type | Counter |
Dimensions |
|
5.1.5.4 ocsepp_cn32c_handshake_response_total
Table 5-35 ocsepp_cn32c_handshake_response_total
Field | Details |
---|---|
Metric Details |
Total number of responses received over n32c for handshake procedure. Condition: When SEPP receives any handshake procedure response from peer SEPP. It can be successful or failure based on response code. |
Microservice | Consumer N32c |
Type | Counter |
Dimensions |
|
5.1.5.5 ocsepp_cn32c_handshake_initiation_req_total
Table 5-36 ocsepp_cn32c_handshake_initiation_req_total
Field | Details |
---|---|
Metric Details |
Total number of Handshake initiation requests received from config-mgr. Condition: When handshake initiation requests are received from config-mgr. |
Microservice | Consumer N32c |
Type | Counter |
Dimensions |
|
5.1.5.6 ocsepp_cn32c_handshake_reinitiation_req_total
Table 5-37 ocsepp_cn32c_handshake_reinitiation_req_total
Field | Details |
---|---|
Metric Details |
Total number of Handshake ReInitiation requests received from config-mgr. Condition: When handshake Reinitiation requests received from config-mgr. |
Microservice | Consumer N32C |
Type | Counter |
Dimensions |
|
5.1.5.7 ocsepp_cn32c_handshake_delete_req_total
Table 5-38 ocsepp_cn32c_handshake_delete_req_total
Field | Details |
---|---|
Metric Details |
Total number of Handshake context delete requests received from config-mgr. Condition: When handshake context delete requests are received from config-mgr. |
Microservice | Consumer N32c |
Type | Counter |
Dimensions |
|
5.1.5.8 ocsepp_pn32c_handshake_requests_total
Table 5-39 ocsepp_pn32c_handshake_requests_total
Field | Details |
---|---|
Metric Details |
Total number of requests received over n32c for handshake procedure. Condition: When any handshake procedure request is received from peer SEPP. |
Microservice | Producer N32c |
Type | Counter |
Dimensions |
|
5.1.5.9 ocsepp_pn32c_handshake_response_total
Table 5-40 ocsepp_pn32c_handshake_response_total
Field | Details |
---|---|
Metric Details |
Total number of responses sent over n32c for handshake procedure. Condition: When SEPP sends response to handshake procedure received. It can be a success response or failure response based on success code. |
Microservice | Producer N32c |
Type | Counter |
Dimensions |
|
5.1.6 5G SBI Message Mediation Support Metrics
5.1.6.1 ocsepp_cn32f_mediation_requests_total
Table 5-41 ocsepp_cn32f_mediation_requests_total
Metric Details |
Metric is common for both CN32F & PN32F. Separation happens based on "app" tag. Number of requests in which Trigger Rule Applied at SEPP end for Mediation, based on configuration. |
Microservice | Consumer N32f Producer N32f |
Type | Counter |
Dimensions |
|
5.1.6.2 ocsepp_n32f_mediation_not_applied_total
Table 5-42 ocsepp_n32f_mediation_not_applied_total
Field | Details |
---|---|
Metric Details |
Metric is common for both CN32F and PN32F. Separation happens based on "app" tag. Number of requests for which Trigger Rule do not match at SEPP and request is not forwarded to Mediation. For Match all configurations, the trigger points will be matched. |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.6.3 ocsepp_cn32f_mediation_response_total
Table 5-43 ocsepp_cn32f_mediation_response_total
Metric Details | Number of requests in which CN32F service of SEPP get Response from Mediation Service. |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.6.4 ocsepp_cn32f_mediation_response_failure
Table 5-44 ocsepp_cn32f_mediation_response_failure
Field | Details |
---|---|
Metric Details | Number of requests in which CN32F service of SEPP get Failure Response from Mediation Service. |
Microservice | Consumer N32f |
Type | Counter |
Dimensions |
|
5.1.6.5 ocsepp_pn32f_mediation_response_total
Table 5-45 ocsepp_pn32f_mediation_response_total
Metric Details | Number of requests in which PN32F service of SEPP get Response from Mediation Service. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.6.6 ocsepp_pn32f_mediation_response_failure
Table 5-46 ocsepp_pn32f_mediation_response_failure
Field | Details |
---|---|
Metric Details | Number of requests in which PN32F service of SEPP get Failure Response from Mediation Service. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.6.7 ocsepp_cn32f_mediation_latency_seconds_count
Table 5-47 ocsepp_cn32f_mediation_latency_seconds_count
Field | Details |
---|---|
Metric Details |
This metric is used to display the number of ingress requests processed at cn32f in a particular time span (in seconds). |
Microservice | Consumer N32f |
Type | Histogram |
Dimensions |
|
5.1.6.8 ocsepp_cn32f_mediation_latency_seconds_max
Table 5-48 ocsepp_cn32f_mediation_latency_seconds_max
Field | Details |
---|---|
Metric Details | Total time taken for processing a message (from sending a message to receiving the response). |
Microservice | Consumer N32f |
Type | Histogram |
Dimensions |
|
5.1.6.9 ocsepp_cn32f_mediation_latency_seconds_sum
Table 5-49 ocsepp_cn32f_mediation_latency_seconds_sum
Metric Details | Total time taken for processing a message (from sending a message to receiving the response). |
Microservice | Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.6.10 ocsepp_pn32f_mediation_latency_seconds_count
Table 5-50 ocsepp_pn32f_mediation_latency_seconds_count
Metric Details |
Total time taken for processing a message (from sending a message to receiving the response). |
Microservice | Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.6.11 ocsepp_pn32f_mediation_latency_seconds_max
Table 5-51 ocsepp_pn32f_mediation_latency_seconds_max
Metric Details | This metrics is used to display the maximum processing time of all the ingress request at pn32f for a particular time. |
Microservice | Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.6.12 ocsepp_pn32f_mediation_latency_seconds_sum
Table 5-52 ocsepp_pn32f_mediation_latency_seconds_sum
Metric Details | This metrics is used to display the average of processing time of all the ingress request at pn32f for a particular time. |
Microservice | Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.6.13 ocsepp_pn32f_mediation_latency_seconds_sum
Table 5-53 ocsepp_pn32f_mediation_latency_seconds_sum
Field | Details |
---|---|
Metric Details | Time taken by Mediation Service to process request after getting called from PN32F service. |
Microservice | PN32f |
Type | Histogram |
Dimensions |
|
5.1.6.14 ocsepp_n32f_mediation_requests_total
Table 5-54 ocsepp_n32f_mediation_requests_total
Field | Details |
---|---|
Metric Details |
Number of requests in which Trigger Rule Applied at SEPP end for Mediation, based on configuration. Separation happens based on "app" tag. |
Microservice | Consumer N32F, Producer N32F |
Type | Counter |
Dimensions |
|
5.1.7 Hosted SEPP Metrics
5.1.7.1 ocsepp_allowed_p_rss_routing_failure_total
Table 5-55 ocsepp_allowed_p_rss_routing_failure_total
Metric Details | Number of requests failing due to Hosted SEPP failure. |
Microservice | Consumer N32F |
Type | Counter |
Dimensions |
|
5.1.8 Message Copy Metrics
5.1.8.1 oc_ingressgateway_msgcopy_requests_total
Table 5-56 oc_ingressgateway_msgcopy_requests_total
Field | Details |
---|---|
Metric Details | This is incremented whenever request message is sent or acknowledged from Data Director. |
Microservice | PLMN Ingress gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.8.2 oc_ingressgateway_msgcopy_responses_total
Table 5-57 oc_ingressgateway_msgcopy_responses_total
Field | Details |
---|---|
Metric Details | This is incremented whenever response message is sent or acknowledged from DD. |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.8.3 oc_ingressgateway_dd_unreachable
Table 5-58 oc_ingressgateway_dd_unreachable
Field | Details |
---|---|
Metric Details |
This indicates whether DD is reachable or not 0 - reachable, 1 - unreachable |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Gauge |
Dimensions |
|
5.1.8.4 oc_egressgateway_msgcopy_requests_total
Table 5-59 oc_egressgateway_msgcopy_requests_total
Field | Details |
---|---|
Metric Details | This is incremented whenever request message is sent or acknowledged from DD. |
Microservice | Plmn Egress Gateway, N32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.8.5 oc_egressgateway_msgcopy_responses_total
Table 5-60 oc_egressgateway_msgcopy_responses_total
Field | Details |
---|---|
Metric Details | This is incremented whenever response message is sent or acknowledged from DD. |
Microservice | Plmn Egress Gateway, N32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.9 SOR Metrics
5.1.9.1 ocsepp_pn32f_sor_requests_total
Table 5-62 ocsepp_pn32f_sor_requests_total
Metric Details | Number of requests sent to SOR |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.9.2 ocsepp_pn32f_sor_responses_total
Table 5-63 ocsepp_pn32f_sor_responses_total
Metric Details | Number of responses received from SOR |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.9.3 ocsepp_pn32f_sor_retry_to_producer_requests_total
Table 5-64 ocsepp_pn32f_sor_retry_to_producer_requests_total
Metric Details | Number of requests sent to Producer based on 3gpp header |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.9.4 ocsepp_pn32f_sor_back_to_consumer_responses_total
Table 5-65 ocsepp_pn32f_sor_back_to_consumer_responses_total
Metric Details | Number of responses received from SoR sent back to Consumer |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.9.5 ocsepp_pn32f_sor_failure_total
Table 5-66 ocsepp_pn32f_sor_failure_total
Metric Details | Number of 4xx or 5xx responses received from SOR |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.9.6 ocsepp_pn32f_sor_timeout_failure_total
Table 5-67 ocsepp_pn32f_sor_timeout_failure_total
Metric Details | Number of requests which are request timeout while connecting to SOR |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.10 Rate Limiting for Ingress Roaming Signaling per Remote SEPP Set Metrics
5.1.10.1 oc_ingressgateway_rss_ratelimit_total
Table 5-68 oc_ingressgateway_rss_ratelimit_total
Metric Details | Number of request for which RSS based rate limiting was applied and request was successfully forwarded. |
Microservice | N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
Metric filter | Status = accepted |
5.1.10.2 oc_ingressgateway_rss_ratelimit_total
Table 5-69 oc_ingressgateway_rss_ratelimit_total
Field | Details |
---|---|
Metric Details | Number of request for which RSS based rate limiting was not applied. |
Microservice | N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
Metric filter | Status = ratelimit not applied |
5.1.10.3 ocsepp_configmgr_routefailure_total
Table 5-70 ocsepp_configmgr_routefailure_total
Metric Details | Metric pegged due to route sync issue in SEPP. |
Microservice | Config Manager |
Type | Counter |
Dimensions |
|
5.1.10.4 oc_ingressgateway_rss_ratelimit_total
Table 5-71 oc_ingressgateway_rss_ratelimit_total
Field | Details |
---|---|
Metric Details | Number of request for which RSS based rate limiting was applied but request had to be dropped. |
Microservice | N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
Metric filter | Status = dropped |
5.1.11 Topology Hiding Metrics
5.1.11.1 ocsepp_topology_latency_seconds_count
Table 5-72 ocsepp_topology_latency_seconds_count
Field | Details |
---|---|
Metric Details |
This metric is used to display the number of ingress requests processed at cn32f and pn32f in a particular time span (in seconds). |
Microservice | Consumer N32f, Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.11.2 ocsepp_topology_latency_seconds_max
Table 5-73 ocsepp_topology_latency_seconds_max
Field | Details |
---|---|
Metric Details | This metrics is used to display the maximum processing time of an ingress request at cn32f and pn32f in seconds. |
Microservice | Consumer N32f, Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.11.3 ocsepp_topology_latency_seconds_sum
Table 5-74 ocsepp_topology_latency_seconds_sum
Field | Details |
---|---|
Metric Details | This metrics is used to display the average processing time of all the ingress request at cn32f and pn32f for a particular time. |
Microservice | Consumer N32f, Producer N32f |
Type | Histogram |
Dimensions |
|
5.1.11.4 ocsepp_topology_header_success_total
Table 5-75 ocsepp_topology_header_success_total
Field | Details |
---|---|
Metric Details | Count of headers for which topology hiding and recovery was successful |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.11.5 ocsepp_topology_header_failure_total
Table 5-76 ocsepp_topology_header_failure_total
Field | Details |
---|---|
Metric Details | Count of headers for which topology hiding and recovery failed |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.11.6 ocsepp_topology_body_success_total
Table 5-77 ocsepp_topology_body_success_total
Field | Details |
---|---|
Metric Details |
Count of body attributes for which topology hiding and recovery was successful. |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.11.7 ocsepp_topology_body_failure_total
Table 5-78 ocsepp_topology_body_failure_total
Metric Details |
Count of body for which topology hiding and recovery failed |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.11.8 ocsepp_topology_success_total
Table 5-79 ocsepp_topology_success_total
Metric Details | Count of messages for which topology hiding or recovery was successful |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.11.9 ocsepp_topology_invalid_header_regex_configured_total
Table 5-80 ocsepp_topology_invalid_header_regex_configured_total
Field | Details |
---|---|
Metric Details | If configured header regex pattern is invalid, this metric will be pegged. |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.11.10 ocsepp_topology_header_regex_not_configured_total
Table 5-81 ocsepp_topology_header_regex_not_configured_total
Field | Details |
---|---|
Metric Details | If header regex pattern is not configured, this metric will be pegged. |
Microservice | Consumer N32f, Producer N32f |
Type | Counter |
Dimensions |
|
5.1.12 Cat 0 - SBI Message Schema Validation Metrics
5.1.12.1 ocsepp_message_validation_applied_total
Table 5-82 ocsepp_message_validation_applied_total
Field | Details |
---|---|
Metric Details | Total number of requests for which message validation is applied. |
Microservice | Producer N32f, Consumer N32F |
Type | Counter |
Dimensions |
|
Note:
- The dimension "peer_plmn_id" is applicable only for Consumer N32F.
- An additional dimension "sourceRss" is applicable for Consumer N32F in Roaming Hub mode.
5.1.12.2 ocsepp_message_validation_on_body_failure_total
Table 5-83 ocsepp_message_validation_on_body_failure_total
Metric Details | Number of requests in which message validation failed on body at SEPP end. |
Microservice | Producer N32f, Consumer N32F |
Type | Counter |
Dimensions |
|
Note:
- The dimension "peer_plmn_id" is applicable only for Consumer N32F.
- An additional dimension "sourceRss" is applicable for Consumer N32F in Roaming Hub mode.
5.1.12.3 ocsepp_message_validation_on_header_failure_total
Table 5-84 ocsepp_message_validation_on_header_failure_total
Metric Details | Number of requests for which message validation failed on query parameters at SEPP end. |
Microservice | Producer N32f, Consumer N32F |
Type | Counter |
Dimensions |
|
Note:
- The dimension "peer_plmn_id" is applicable only for Consumer N32F.
- An additional dimension "sourceRss" is applicable for Consumer N32F in Roaming Hub mode.
5.1.13 Cat 1 - Service API Validation Metrics
5.1.13.1 ocsepp_security_service_api_failure_total
Table 5-85 ocsepp_security_service_api_failure_total
Field | Details |
---|---|
Metric Details |
Metric are common for both CN32F and PN32F. Separation happens based on "app" tag. Number of requests failed as Method and Resource URI were not Allowed |
Microservice | N32f |
Type | Counter |
Dimensions |
|
5.1.14 Cat 2 - Network ID Validation Metrics
5.1.14.1 ocsepp_network_id_validation_body_failure_total
Table 5-86 ocsepp_network_id_validation_body_failure_total
Metric Details | Number of request for which Network ID body validation feature were failed. |
Microservice | Producer N32f, Consumer N32F |
Type | Counter |
Dimensions |
|
5.1.14.2 ocsepp_network_id_validation_header_failure_total
Table 5-87 ocsepp_network_id_validation_header_failure_total
Metric Details | Number of request for which Network ID header validation feature were failed. |
Microservice | Producer N32f, Consumer N32F |
Type | Counter |
Dimensions |
|
5.1.15 Cat 3 - Previous Location Check Metrics
5.1.15.1 ocsepp_previous_location_exception_failure_total
Table 5-88 ocsepp_previous_location_exception_failure_total
Metric Details | Number of requests, for which previous location validation check failed due to exceptions. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.15.2 ocsepp_previous_location_validation_success_total
Table 5-89 ocsepp_previous_location_validation_success_total
Field | Details |
---|---|
Metric Details | Number of requests, for which previous location validation feature is successful. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.15.3 ocsepp_previous_location_validation_failure_total
Table 5-90 ocsepp_previous_location_validation_failure_total
Metric Details | Number of requests, for which previous location validation check failed. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.15.4 ocsepp_previous_location_validation_requests_total
Table 5-91 ocsepp_previous_location_validation_requests_total
Field | Details |
---|---|
Metric Details | Number of requests, for which previous location validation feature is applied. |
Microservice | Producer N32f |
Type | Counter |
Dimensions |
|
5.1.16 Rate Limiting for Egress Roaming Signaling per PLMN Metrics
5.1.16.1 oc_ingressgateway_plmn_egress_ratelimit_total
Table 5-93 oc_ingressgateway_plmn_egress_ratelimit_total
Metric Details | Number of requests for which Egress Rate Limiting was applied and request was successfully forwarded because tokens were available for the Egress Rate Limit List. |
Microservice | Ingress Gateway |
Type | Counter |
Dimensions |
|
Status | ERL_MATCH_TOKEN_AVAILABLE_FWD |
5.1.16.2 oc_ingressgateway_plmn_egress_ratelimit_total
Table 5-94 oc_ingressgateway_plmn_egress_ratelimit_total
Metric Details | Number of requests for which Egress Rate Limiting was applied, here tokens were not available to process the request, request was rejected as its priority was low (above than the configured cutoff). |
Microservice | Ingress Gateway |
Type | Counter |
Dimensions |
|
Status | ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT |
5.1.16.3 oc_ingressgateway_plmn_egress_ratelimit_total
Table 5-95 oc_ingressgateway_plmn_egress_ratelimit_total
Metric Details |
The number of requests for which egress rate limiting was applied, here tokens were not available to process the request, but the request was forwarded as its priority was high (less than the configured cutoff). |
Microservice | Ingress Gateway |
Type | Counter |
Dimensions |
|
Status | ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD |
5.1.16.4 oc_ingressgateway_plmn_egress_ratelimit_total
Table 5-96 oc_ingressgateway_plmn_egress_ratelimit_total
Metric Details | Number of requests for which rate limiting could not be applied as invalid PLMN ID was sent in the request. The request was forwarded. |
Microservice | Ingress Gateway |
Type | Counter |
Dimensions |
|
Status | ERROR_UNABLE_TO_EXTRACT_PLMN_FWD |
5.1.16.5 oc_ingressgateway_plmn_egress_ratelimit_total
Table 5-97 oc_ingressgateway_plmn_egress_ratelimit_total
Metric Details | Number of requests for which rate limiting could not be applied as none of the Egress Rate Limit List contains the corresponding PLMN ID. The request was forwarded. |
Microservice | Ingress Gateway |
Type | Counter |
Dimensions |
|
Status | ERL_NO_MATCH_FWD |
5.1.16.6 oc_ingressgateway_plmn_egress_ratelimit_total
Table 5-98 oc_ingressgateway_plmn_egress_ratelimit_total
Metric Details | Number of requests for which rate limiting could not be applied as some unexpected exception was raised during the execution of the rate limit filter for the request. The request was forwarded. |
Microservice | Ingress Gateway |
Type | Counter |
Dimensions |
|
Status | ERROR_INTERNAL_FWD |
5.1.17 Config Manager Metrics
5.1.17.1 ocsepp_configmgr_rpp_config_failure_total
Table 5-99 ocsepp_configmgr_rpp_config_failure_total
Field | Details |
---|---|
Metric Details | This metrics is pegged whenever there is a Remote SEPP configuration failure. |
Microservice | Config Manager |
Type | Counter |
Dimensions |
|
5.1.17.2 ocsepp_configmgr_rpp_validation_failure_total
Table 5-100 ocsepp_configmgr_rpp_validation_failure_total
Field | Details |
---|---|
Metric Details | This metric is pegged whenever there is a change in mandatory parameter of Remote SEPP Profile or mandatory parameter is missing. |
Microservice | Config Manager |
Type | Counter |
Dimensions |
|
5.1.17.3 ocsepp_configmgr_routeupdate_total
Table 5-101 ocsepp_configmgr_routeupdate_total
Field | Details |
---|---|
Metric Details | This metric shows the total number of times config-mgr microservice has tried to update the route details for Egress Gateway microservices. |
Microservice | Config Manager |
Type | Counter |
Dimensions |
|
5.1.18 Support for TLS 1.3 Metrics
5.1.18.1 oc_ingressgateway_incoming_tls_connections
Table 5-102 oc_ingressgateway_incoming_tls_connections
Field | Details |
---|---|
Metric Details | Number of TLS connections received on the Ingress Gateway and their negotiated versions. The version can be TLS 1.2 or TLS 1.3. |
Microservice |
PLMN Ingress Gateway N32 Ingress Gateway |
Type | Gauge |
Dimensions |
|
5.1.18.2 oc_egressgateway_outgoing_tls_connections
Table 5-103 oc_egressgateway_outgoing_tls_connections
Field | Details |
---|---|
Metric Details | Number of TLS connections sent on the Egress Gateway and their negotiated versions. The version can be TLS 1.2 or TLS 1.3. |
Microservice |
PLMN Egress Gateway N32 Egress Gateway |
Type | Gauge |
Dimensions |
|
5.1.18.3 security_cert_x509_expiration_seconds
Table 5-104 security_cert_x509_expiration_seconds
Metric Details | Time to certificate expiry in epoch seconds. |
Microservice |
PLMN Ingress Gateway N32 Ingress Gateway PLMN Egress Gateway N32 Egress Gateway |
Type | Histogram |
Dimensions |
|
5.1.19 Ingress and Egress Gateway Metrics
Table 5-105 Ingress and Egress Gateway Dimensions
Dimension | Details |
---|---|
NFType | Name of the NF Type in path.
For Eg: Path is /nxxx-yyy/vz/....... Where XXX(Upper Case) is NFType UNKNOWN if unable to extract NFType from the path |
NFServiceType | Name of the Service with in the NF.
Example: Path is /nxxx-yyy/vz/.......
Where nxxx-yyy is NFServiceType UNKNOWN if unable to extract NFServiceType from the path |
receivedResponseCode (Pod readiness state metric) | receivedResponseCode (Pod readiness state metric) |
id (Pod readiness state metric) | Servivce profile Id of the backend svc |
uri (Pod readiness state metric) | Service profile Uri of the backend svc |
event | This tag captures the lifecycle event processed during the jetty request processing with the back-end svc |
Host | (Ip or fqdn) : port of ingress gateway |
DestinationHost | Destination ip/fqdn |
client_type | client_type |
HttpVersion | Http protocol version |
oc_ingressgateway_pod_resource_stress_Type | The type of resource for which the pod protection threshold has reached.e.g. CPU, MEMORY, PENDING_REQUEST |
XfccHeaderPresent | XfccHeaderPresent |
consumerNfType | consumerNfType |
Scheme | Http protocol scheme |
Path | Path predicate that matched the current request |
ClientCertIdentity | Cerificate Identity of the client |
content_available | content_available |
Route_Path | Path predicate/Header predicate that matched the current request |
InstanceIdentifier | Prefix of the pod configured in helm when there are multiple instances in same deployment |
jetty_request_timeout | Tag to capture if a request at IGW failed due to request timeout at jetty level |
Virtual Host | The fqdn which requires alternate route svc resolution |
error_reason | Reason for failure response received. If message is sent in the response, then it is filled with the message otherwise exception class is filled. In case of successful response it is filled with "no-error" |
ErrorOriginator | This tag captures the ErrorOriginator |
quantile | This tag captures the latency values with ranges as 10ms, 20ms, 40ms, 80ms, 100ms, 200ms, 500ms, 1000ms and 5000ms |
oc_ingressgateway_xfcc_header_validate_ServiceType | Name of the Service with in the NF. |
oc_ingressgateway_dns_resolution_Status | oc_ingressgateway_dns_resolution_Status |
oc_ingressgateway_global_ratelimit_Status |
Request accepted or dropped |
oc_ingressgateway_global_ratelimit_total_app | Application at which traffic rejection occurs - n32-ingress-gateway or plmn-ingress-gateway |
oc_ingressgateway_global_ratelimit_total_Method | Request method received ( POST , PUT , GET , PATCH , DELETE) |
oc_ingressgateway_connection_failure_Host | destination ip/fqdn |
oc_ingressgateway_connection_failure_Port | destination port |
oc_ingressgateway_connection_failure_Direction | This tag determines the direction in which there is connection failure at IGW |
oc_ingressgateway_xfcc_header_validate_Status | oc_ingressgateway_xfcc_header_validate_Status |
oc_ingressgateway_xfcc_header_validate_Cause | This tag determines the validation cause for the xfcc header validation metric being pegged |
oc_ingressgateway_incoming_pod_connections_rejected_Direction | The incoming connections rejected at IGW handled in pod protection |
oc_ingressgateway_xfcc_header_validate_CertsCompared | This tag captures the total number of certificates compared in XFCC header at IGW during the header validation |
oc_configclient_request_total_releaseVersion | This tag indicates the current release version of ingress gateway |
oc_configclient_request_total_configVersion | This tag indicates the configuration version that ingress gateway is currently maintaining |
oc_configclient_response_total_releaseVersion | This tag indicates the configuration version that ingress gateway is currently maintaining |
oc_configclient_response_total_updated | This tag indicates whether the configuration was updated or not |
oc_ingressgateway_incoming_connections_Direction | This tag indicates the direction of connection established i.e, whether it is incoming or outgoing |
oc_ingressgateway_incoming_connections_Host | This tag indicates the remote address of client connected to ingress gateway |
oc_ingressgateway_outgoing_connections_Direction | This tag indicates the direction of connection established i.e, whether it is incoming or outgoing |
oc_ingressgateway_going_connections_Host | This tag indicates the address of destination |
Proxy | Value received for "x-custom-egress-proxy-header". |
ConnectedHostIp | This tag captures the IP of destination host to which EGW sends ping requests |
ConnectedHostFqdn | This tag captures the fqdn of destination host to which EGW sends ping requests |
ConnectedHostPort | This tag captures the port of destination host to which EGW sends ping requests |
oc_egressgateway_connection_failure_Host | destination ip/fqdn |
oc_egressgateway_connection_failure_Port | destination port |
oc_egressgateway_incoming_connections_Direction | This tag indicates the direction of connection established i.e, whether it is incoming or outgoing |
oc_egressgateway_incoming_connections_Host | This tag indicates the remote address of client connected to ingress gateway |
oc_egressgateway_outgoing_connections_Direction | This tag indicates the direction of connection established i.e, whether it is incoming or outgoing |
oc_egressgateway_outgoing_connections_Host | This tag indicates address of destination |
EndpointName | Request sent for |
Reroute_Path |
Path that matched the request to over corresponding route Example : /nef/** |
Attempt |
Attempt number for scp re-route. Example : 1 , 2 etc., |
5.1.19.1 Ingress Gateway Metrics
This section provides information about the Ingress Gateway metrics used in SEPP.
5.1.19.1.1 oc_ingressgateway_http_requests_total
Table 5-106 oc_ingressgateway_http_requests_total
Metric Details | This metric will be pegged as soon as the request reaches the Ingress gateway in the first custom filter of the application. |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.1.2 oc_ingressgateway_http_responses_total
Table 5-107 oc_ingressgateway_http_responses_total
Field | Details |
---|---|
Metric Details | This metric will be pegged in the last custom filter of the Ingress gateway while the response is being sent back to the consumer NF. |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.1.3 oc_ingressgateway_request_latency_seconds
Table 5-108 oc_ingressgateway_request_latency_seconds
Field | Details |
---|---|
Metric Details | This metric will be pegged in the last custom filter of the Ingress gateway while the response is being sent back to the consumer NF. This metric tracks the amount of time taken for processing the request. It starts as soon the request reaches the first custom filter of the application and lasts till the response is sent back to the consumer NF from the last custom filter of the application. |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Histogram |
Dimensions |
|
5.1.19.1.4 oc_ingressgateway_request_latency_seconds_count
Table 5-109 oc_ingressgateway_request_latency_seconds_count
Field | Details |
---|---|
Metric Details | This metric is used to display the number of ingress requests processed in a particular time span (in seconds). |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Histogram |
Dimensions |
|
5.1.19.1.5 oc_ingressgateway_request_latency_seconds_sum
Table 5-110 oc_ingressgateway_request_latency_seconds_sum
Field | Details |
---|---|
Metric Details | This metrics is used to display the average of processing time of all the ingress request for a particular time. |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Histogram |
Dimensions |
|
5.1.19.1.6 oc_configclient_request_total
Table 5-111 oc_configclient_request_total
Metric Details | This metric will be pegged whenever config client is polling for configuration update from common configuration server |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.1.7 oc_configclient_response_total
Table 5-112 oc_configclient_response_total
Metric Details | This metrics will be pegged whenever config client receives response from common configuration server |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.1.8 oc_configserver_reachability
Table 5-113 oc_configserver_reachability
Metric Details | Gauge metric to peg the reachability of config server |
Microservice | Plmn Ingress Gateway, N32 Ingress Gateway |
Type | Gauge |
Dimensions |
|
5.1.19.1.9 oc_ingressgateway_incoming_connections
Table 5-114 oc_ingressgateway_incoming_connections
Field | Details |
---|---|
Metric Details | Gauge metric that will peg active incoming connections from client to ingress gateway |
Microservice | PLMN Ingress Gateway, N32 Ingress Gateway |
Type | Gauge |
Dimensions |
|
5.1.19.1.10 oc_ingressgateway_outgoing_connections
Table 5-115 oc_ingressgateway_outgoing_connections
Field | Details |
---|---|
Metric Details | Gauge metric that will peg active outgoing connections from ingress gateway to destination |
Microservice | PLMN Ingress Gateway, N32 Ingress Gateway |
Type | Gauge |
Dimensions |
|
5.1.19.1.11 oc_ingressgateway_connection_failure_total
Table 5-116 oc_ingressgateway_connection_failure_total
Field | Details |
---|---|
Metric Details |
This metric will be pegged in the customized Jetty Client as soon as it fails to connect to the destination service with direction as ingressOut. Here in case of Ingress gateway, the destination service will be a backend microservice of the NF. And TLS connection failure metrics when connecting to ingress with direction as ingress. |
Microservice | PLMN Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.1.12 oc_ingressgateway_global_ratelimit_total
Table 5-117 oc_ingressgateway_global_ratelimit_total
Field | Details |
---|---|
Metric Details | This metric will be pegged in the custom filter implemented to check the global rate limit conditions. |
Microservice | PLMN Ingress Gateway, N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.1.13 oc_ingressgateway_request_content_metrics_total
Table 5-118 oc_ingressgateway_request_content_metrics_total
Field | Details |
---|---|
Metric Details | This metric will be pegged by default filter RequestContentMetrics. It pegs whether request has request body or not. |
Microservice | PLMN Egress Gateway, N32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.1.14 oc_ingressgateway_request_processing_latency_seconds
Table 5-119 oc_ingressgateway_request_processing_latency_seconds
Field | Details |
---|---|
Metric Details | This metric will be pegged in the last custom filter of the Ingress gateway while the response is being sent back to the consumer NF. This metric captures the amount of time taken for processing of the request only within Ingress gateway. It starts as soon the request reaches the first custom filter of the application and lasts till the request is forwarded to the destination. |
Microservice | PLMN Egress Gateway, N32 Egress Gateway |
Type | Histogram |
Dimensions |
|
5.1.19.1.15 oc_ingressgateway_route_overloadcontrol_total
Table 5-120 oc_ingressgateway_route_overloadcontrol_total
Metric Details | When overload is enabled , this metric is pegged for every incoming request and describes whether request is accepted or discarded. |
Microservice | N32 Ingress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.2 Egress Gateway Metrics
This section provides information about the Egress Gateway metrics used in SEPP.
5.1.19.2.1 oc_egressgateway_http_requests_total
Table 5-121 oc_egressgateway_http_requests_total
Field | Details |
---|---|
Metric Details | This metric will be pegged as soon as the request reaches the Egress gateway in the first custom filter of the application with direction as egress. This will also be pegged when the request goes out of egress in Jetty Request Listener with direction as egressOut. |
Microservice | Plmn Egress GatewayN32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.2.2 oc_egressgateway_http_responses_total
Table 5-122 oc_egressgateway_http_responses_total
Metric Details | This metric will be pegged in the last custom filter of the Egress gateway while the response is being sent back to backend NF microservice with direction as egress. This will also be pegged when the response is fetched in Jetty responseListener with direction as egressOut. BlacklistedFqdn tag will be filled with BlacklistedFqdn when request is sent with blacklisted producer |
Microservice | Plmn Egress GatewayN32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.2.3 oc_egressgateway_incoming_connections
Table 5-123 oc_egressgateway_incoming_connections
Field | Details |
---|---|
Metric Details | Gauge metric that will peg active incoming connections from client to egress gateway |
Microservice | PLMN Egress Gateway, N32 Egress Gateway |
Type | Gauge |
Dimensions |
|
5.1.19.2.4 oc_egressgateway_outgoing_connections
Table 5-124 oc_egressgateway_outgoing_connections
Field | Details |
---|---|
Metric Details | Gauge metric that will peg active outgoing connections from egress gateway to destination |
Microservice | PLMN Egress Gateway, N32 Egress Gateway |
Type | Gauge |
Dimensions |
|
5.1.19.2.5 oc_egressgateway_connection_failure_total
Table 5-125 oc_egressgateway_connection_failure_total
Field | Details |
---|---|
Metric Details | This metric will be pegged by jetty client when the destination is not reachable by egress gateway. Here the destination is producer NF. |
Microservice | PLMN Egress Gateway, N32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.2.6 oc_egressgateway_sbiRouting_http_requests_total
Table 5-126 oc_egressgateway_sbiRouting_http_requests_total
Field | Details |
---|---|
Metric Details | This metric is pegged in the SBIRoutingFilter only when SBIRouting feature is enabled for a route to which request is sent to EGW. |
Microservice | Plmn Egress GatewayN32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.2.7 oc_egressgateway_sbiRouting_http_responses_total
Table 5-127 oc_egressgateway_sbiRouting_http_responses_total
Field | Details |
---|---|
Metric Details | This metric will be pegged in the SBIRoutingFilter only when SBI Routing feature is enabled for a route to which request is sent to EGW and when sbiRerouteEnabled is set to true and reroute mechanism is executed. |
Microservice | PLMN Egress Gateway, N32 Egress Gateway |
Type | Counter |
Dimensions |
|
5.1.19.2.8 oc_egressgateway_server_latency_seconds
Table 5-128 oc_egressgateway_server_latency_seconds
Field | Details |
---|---|
Metric Details | This metric will be pegged in Jetty response listener that captures the amount of time taken for processing of the request by jetty client. |
Microservice | PLMN Egress Gateway, N32 Egress Gateway |
Type | Histogram |
Dimensions |
|
5.1.19.2.9 oc_fqdn_alternate_route_total
Table 5-129 oc_fqdn_alternate_route_total
Field | Details |
---|---|
Metric Details |
Tracks the number of registration, deregistration and GET calls received for a given scheme and FQDN. Note: Registration does not reflect active registration numbers. It captured number of registration requests received. |
Microservice | Egress Gateway |
Type | Counter |
Dimensions |
type: Register/Deregister/GET binding_value: <scheme>+<FQDN> |
5.1.19.2.10 oc_dns_srv_lookup_total
Table 5-130 oc_dns_srv_lookup_total
Field | Details |
---|---|
Metric Details |
Track the number of times the DNS SRV lookup was done for a given scheme and FQDN. |
Microservice | Egress Gateway |
Type | Counter |
Dimensions |
binding_value: <scheme>+<FQDN> |
5.1.19.2.11 oc_alternate_route_resultset
Table 5-131 oc_alternate_route_resultset
Field | Details |
---|---|
Metric Details |
Provides number of alternate routes known for a given scheme and FQDN. Whenever DNS SRV lookup or static configuration is done, this metric provides number of known alternate route for a given pair. For example, <"http", "abc.oracle.com">: 2. |
Microservice | Egress Gateway |
Type | Gauge |
Dimensions |
binding_value: <scheme>+<FQDN> |
5.1.19.2.12 oc_configclient_request_total
Table 5-132 oc_configclient_request_total
Field | Details |
---|---|
Metric Details |
This metric is pegged whenever a polling request is made from config client to the server for configuration updates. |
Microservice | Egress Gateway |
Type | Counter |
Dimensions |
Tags: releaseVersion, configVersion.
|
5.1.19.2.13 oc_configclient_response_total
Table 5-133 oc_configclient_response_total
Field | Details |
---|---|
Metric Details |
This metric is pegged whenever a response is received from the server to client. |
Microservice | Egress Gateway |
Type | Counter |
Dimensions |
Tags: releaseVersion, configVersion, updated.
|
5.2 SEPP KPIs
This section provides information about the SEPP KPIs.
5.2.1 N32C Handshake Procedure KPIs
5.2.1.1 cn32c Handshake Requests Per Remote SEPP
Table 5-134 cn32c Handshake Requests Per Remote SEPP
KPI Detail | Measures the cn32c handshake requests per remote SEPP. |
Metric Used for KPI |
"sum(ocsepp_cn32c_handshake_requests_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id , remote_sepp_name)" |
Service Operation | n32c Handshake Request |
Response Code | NA |
5.2.1.2 cn32c Handshake Success Rate
Table 5-135 cn32c Handshake Success Rate
KPI Detail | Measures the cn32c handshake success rate. |
Metric Used for KPI |
(sum(ocsepp_cn32c_handshake_response_total{namespace=~"$Namespace",responseCode="200 OK"})/sum(ocsepp_cn32c_handshake_requests_total{namespace=~"$Namespace"}))*100 |
Service Operation | n32c handshake success rate |
Response Code | 200 OK |
5.2.1.3 cn32c Handshake Response Per Remote SEPP
Table 5-136 cn32c Handshake Response Per Remote SEPP
KPI Detail | Measures the cn32c handshake response per remote SEPP. |
Metric Used for KPI |
"sum(ocsepp_cn32c_handshake_response_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)" |
Service Operation | n32c Handshake |
Response Code | All |
5.2.1.4 cn32c Handshake Failure Per Remote SEPP
Table 5-137 cn32c Handshake Failure Per Remote SEPP
KPI Detail | Measures the cn32c handshake failure per remote SEPP |
Metric Used for KPI | "sum(ocsepp_n32c_handshake_failure_attempts_total{namespace=~\"$Namespace\",app=\"cn32c-svc\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)" |
Service Operation | n32c Handshake |
Response Code | 4xx and 5xx |
5.2.1.5 pn32c Handshake Requests Total Per Remote SEPP
Table 5-138 pn32c Handshake Requests Total Per Remote SEPP
KPI Detail | Measures the pn32c handshake requests total per remote SEPP |
Metric Used for KPI | "sum(ocsepp_pn32c_handshake_requests_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)" |
Service Operation | n32c Handshake |
Response Code | All |
5.2.1.6 pn32c Handshake Response Total Per Remote SEPP
Table 5-139 pn32c Handshake Response Total Per Remote SEPP
KPI Detail | Measures the pn32c handshake response total per remote SEPP |
Metric Used for KPI | "sum(ocsepp_pn32c_handshake_response_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)" |
Service Operation | n32c Handshake |
Response Code | All |
5.2.1.7 pn32c Handshake Success rate
Table 5-140 pn32c Handshake Success rate
KPI Detail | Measures the pn32c handshake success rate. |
Metric Used for KPI |
(sum(ocsepp_pn32c_handshake_response_total{namespace=~"$Namespace",responseCode="200 OK"})/sum(ocsepp_pn32c_handshake_requests_total{namespace=~"$Namespace"}))*100 |
Service Operation | n32c Handshake |
Response Code | 200 |
5.2.1.8 pn32c Handshake Failure Per Remote SEPP
Table 5-141 pn32c Handshake Failure Per Remote SEPP
KPI Detail | Measures the pn32c handshake failure total |
Metric Used for KPI |
sum(ocsepp_n32c_handshake_failure_attempts_total{namespace=~"$Namespace",app="pn32c-svc"})by(peer_domain, peer_fqdn, peer_plmn_id) |
Service Operation | n32c Handshake |
Response Code | 4xx and 5xx |
5.2.2 SEPP Common KPIs
5.2.2.1 Memory Usage per POD
Table 5-142 Memory Usage per POD
KPI Detail | Measures the memory usage per POD |
Metric Used for KPI |
sum(container_memory_usage_bytes{namespace=~"$Namespace",image!=""}/(1024*1024*1024)) by (pod) |
Service Operation | NA |
Response Code | NA |
5.2.2.2 CPU Usage per POD
Table 5-143 CPU Usage per POD
KPI Detail | Measures the CPU usage per POD |
Metric Used for KPI | sum(rate(container_cpu_usage_seconds_total{namespace=~"$Namespace",image!=""}[2m])) by (pod) * 1000 |
Service Operation | N/A |
Response Code | N/A |
5.2.2.3 Total Ingress gateway requests
Table 5-144 Total Ingress gateway requests
Field | Details |
---|---|
KPI Detail | Measures the total Ingress gateway requests |
Metric Used for KPI | sum((oc_ingressgateway_http_requests_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction) |
5.2.2.4 Total Egress gateway requests
Table 5-145 Total Egress gateway requests
Field | Details |
---|---|
KPI Detail | Measures the total egress gateway requests |
Metric Used for KPI | sum((oc_egressgateway_http_requests_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction) |
5.2.2.5 Total Ingress gateway responses
Table 5-146 Total Ingress gateway responses
Field | Details |
---|---|
KPI Detail | Measures the total Ingress gateway responses |
Metric Used for KPI | sum((oc_ingressgateway_http_responses_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction) |
5.2.2.6 Total Egress gateway responses
Table 5-147 Total Egress gateway responses
Field | Details |
---|---|
KPI Detail | Measures the total Egress gateway responses |
Metric Used for KPI | sum((oc_egressgateway_http_responses_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction) |
5.2.2.7 IGW Processing Time (ms)
Table 5-148 IGW Processing Time (ms)
Field | Details |
---|---|
KPI Detail | Measures the IGW Processing Time |
Metric Used for KPI | sum(irate(oc_ingressgateway_request_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(Method,app) /sum(irate(oc_ingressgateway_request_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(Method,app) |
5.2.2.8 PercentageDiscard
Table 5-149 PercentageDiscard
KPI Detail | Measures the number of Discard requests for Percentage based scheme |
Metric Used for KPI |
oc_ingressgateway_route_overloadcontrol_total sum(irate(oc_ingressgateway_route_overloadcontrol_total{DiscardAction="PercentageBased",Status="DISCARDED",namespace=$NAMESPACE}[2m])) |
5.2.2.9 PriorityDiscard
Table 5-150 PriorityDiscard
KPI Detail | Measures the number of Discard requests for Priority based scheme |
Metric Used for KPI |
oc_ingressgateway_route_overloadcontrol_total sum(irate(oc_ingressgateway_route_overloadcontrol_total{DiscardAction="PriorityBased",Status="DISCARDED",namespace=NAMESPACE}[2m])) |
5.2.3 CN32F Common KPIs
5.2.3.1 cn32f Routing Success Rate
Table 5-151 cn32f Routing Success Rate
KPI Detail | Measures the cn32f routing success rate. |
Metric Used for KPI |
(sum(ocsepp_cn32f_response_total{namespace=~"$Namespace"})/sum(ocsepp_cn32f_requests_total{namespace=~"$Namespace"}))*100 |
Service Operation | n32f message forward |
Response Code | All |
5.2.3.2 Total cn32f Requests
Table 5-152 Total cn32f Requests
KPI Detail | Measures the cn32f requests rate per remote SEPP. |
Metric Used for KPI |
sum((ocsepp_cn32f_requests_total{namespace=~"$Namespace", direction="egress"}))by(PEER_DOMAIN, PEER_FQDN, PLMN_ID) |
Service Operation | n32f message forward |
Response Code | All |
5.2.3.3 cn32f Processing Time (ms)
Table 5-153 cn32f Processing Time (ms)
KPI Detail | Measures the cn32f processing time (ms) |
Metric Used for KPI | sum(irate(ocsepp_cn32f_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_cn32f_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
Service Operation | n32f message forward |
Response Code | All |
5.2.3.4 Total cn32f Responses
Table 5-154 Total cn32f Responses
KPI Detail | Measures the cn32f response rate per remote SEPP |
Metric Used for KPI | sum((ocsepp_cn32f_response_total{namespace=~"$Namespace", direction="egress"})) by(PEER_DOMAIN, PEER_FQDN, PLMN_ID) |
Service Operation | n32f message forward |
Response Code | All |
5.2.4 PN32F Common KPIs
5.2.4.1 Total pn32f Requests
Table 5-156 Total pn32f Requests
KPI Detail | Measures the total pn32f requests |
Metric Used for KPI | sum((ocsepp_pn32f_requests_total{namespace=~"$Namespace", direction="egress"}))by(PEER_DOMAIN, PEER_FQDN, PLMN_ID) |
Service Operation | n32f message forward |
Response Code | All |
5.2.4.2 Total pn32f Responses
Table 5-157 Total pn32f Responses
KPI Detail | Measures the pn32f response rate per remote SEPP. |
Metric Used for KPI | sum((ocsepp_pn32f_responses_total{namespace=~"$Namespace", direction="egress"})) by(PEER_DOMAIN, PEER_FQDN, PLMN_ID) |
Service Operation | n32f message forward |
Response Code | All |
5.2.4.3 pn32f Processing Time (ms)
Table 5-158 pn32f Processing Time (ms)
KPI Detail | Measures the pn32f processing time in milli seconds |
Metric Used for KPI |
sum(irate(ocsepp_pn32f_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_pn32f_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
Service Operation | n32f message forward |
Response Code | All |
5.2.4.4 pn32f Failures
Table 5-159 pn32f Failures
KPI Detail | Measures the pn32f request failures in total |
Metric Used for KPI | sum(ocsepp_pn32f_requests_failure_total{namespace=~"$Namespace"}) by (PEER_DOMAIN, PEER_FQDN, PLMN_ID) |
Service Operation | n32f message forward |
Response Code | 4xx and 5xx |
5.2.4.5 pn32f Routing Success Rate
Table 5-160 pn32f Routing Success Rate
KPI Detail | Measures the pn32f routing success rate |
Metric Used for KPI | (sum(ocsepp_pn32f_responses_total{namespace=~"$Namespace"})/sum(ocsepp_pn32f_requests_total{namespace=~"$Namespace"}))*100 |
Service Operation | n32f message forward |
Response Code | All |
5.2.5 Global Rate Limiting Feature KPIs
5.2.5.1 PLMN IGW Global Rate limit Traffic Rejected
Table 5-161 PLMN IGW Global Rate limit Traffic Rejected
KPI Detail | Measures the PLMN IGW Global rate limit traffic rejected |
Metric Used for KPI |
sum(irate(oc_ingressgateway_global_ratelimit_total{namespace=~"$Namespace",app="plmn-ingress-gateway", Status="dropped"}[2m])) No. of messages rejected for traffic initiated from producer side |
5.2.5.2 N32 IGW Global Rate limit Traffic Rejected
Table 5-162 N32 IGW Global Rate limit Traffic Rejected
KPI Detail | Measures the N32 IGW Global rate limit traffic rejected |
Metric Used for KPI |
sum(irate(oc_ingressgateway_global_ratelimit_total{namespace=~"$Namespace",app="n32-ingress-gateway", Status="dropped"}[2m])) No. of messages rejected for traffic initiated from consumer side |
5.2.6 Topology Hiding KPIs
5.2.6.1 CN32F Topology Egress Request Processing Time(ms)
Table 5-163 CN32F Topology Egress Request Processing Time(ms)
KPI Detail | Measures the cn32f topology Egress request processing time |
Metric Used for KPI | sum(irate(ocsepp_topology_latency_seconds_sum{app="cn32f-svc",direction="egress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="cn32f-svc",direction="egress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.2 CN32F Topology Ingress Response Processing Time
Table 5-164 CN32F Topology Ingress Response Processing Time
KPI Detail | Measures the cn32f topology Egress response processing time |
Metric Used for KPI | sum(irate(ocsepp_topology_latency_seconds_sum{app="cn32f-svc",direction="ingress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="cn32f-svc",direction="ingress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.3 CN32F Topology Hiding Success
Table 5-165 CN32F Topology Hiding Success
KPI Detail | Measures the n32f topology success by messages |
Metric Used for KPI | sum(ocsepp_topology_success_total{app="cn32f-svc", namespace=~"$Namespace"}) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.4 N32F Topology Success by headers
Table 5-166 N32F N32F Topology Success by headers
KPI Detail | Measures N32F Topology success by headers |
Metric Used for KPI |
sum(ocsepp_topology_header_success_total{app="cn32f-svc"}) by(header) Note : Update label app to "app=pn32f-svc" for PN32F microservice. |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.5 CN32F Topology Hiding Missing Regex Configuration
Table 5-167 CN32F Topology Hiding Missing Regex Configuration
KPI Detail | Measures the cn32f topology hiding missing regex configuration |
Metric Used for KPI | sum(ocsepp_topology_header_regex_not_configured_total{app="cn32f-svc", namespace=~"$Namespace"}) by(error_msg) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.6 CN32F Topology Hiding Invalid Header Regex
Table 5-168 CN32F Topology Hiding Invalid Header Regex
Field | Details |
---|---|
KPI Detail | Measures the cn32f topology hiding invalid header regex |
Metric Used for KPI |
sum(ocsepp_topology_invalid_header_regex_configured_total{app="cn32f-svc", namespace=~"$Namespace"}) by(error_msg) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.7 PN32F Topology Ingress Request Processing Time(ms)
Table 5-169 PN32F Topology Ingress Request Processing Time(ms)
KPI Detail | Measures the pn32f topology Ingress request processing time |
Metric Used for KPI | sum(irate(ocsepp_topology_latency_seconds_sum{app="pn32f-svc",direction="ingress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="pn32f-svc",direction="ingress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.8 PN32F Topology Egress Response Processing Time(ms)
Table 5-170 PN32F Topology Egress Response Processing Time(ms)
KPI Detail | Measures the pn32f topology Egress response processing time |
Metric Used for KPI | sum(irate(ocsepp_topology_latency_seconds_sum{app="pn32f-svc",direction="egress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="pn32f-svc",direction="egress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.9 PN32F Topology Hiding Invalid Header Regex
Table 5-171 PN32F Topology Hiding Invalid Header Regex
Field | Details |
---|---|
KPI Detail | Measures the pn32f topology hiding invalid header Regex |
Metric Used for KPI | sum(ocsepp_topology_invalid_header_regex_configured_total{app="pn32f-svc", namespace=~"$Namespace"}) by(error_msg) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.10 PN32F Topology Hiding Missing Regex Configuration
Table 5-172 PN32F Topology Hiding Missing Regex Configuration
Field | Details |
---|---|
KPI Detail | Measures the pn32f topology hiding missing Regex configuration |
Metric Used for KPI | sum(ocsepp_topology_header_regex_not_configured_total{app="pn32f-svc", namespace=~"$Namespace"}) by(error_msg) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.6.11 PN32F Topology Hiding Success
Table 5-173 PN32F Topology Hiding Success
Field | Details |
---|---|
KPI Detail | Measures the pn32f topology hiding success |
Metric Used for KPI | sum(ocsepp_topology_success_total{app="pn32f-svc", namespace=~"$Namespace"}) |
Service Operation | n32f message forward |
Response Code | NA |
5.2.7 5G SBI Message Mediation Support KPIs
5.2.7.1 Mediation Requests Counters - N32 Egress Request
Table 5-174 Mediation Requests Counters - N32 Egress Request
Field | Details |
---|---|
KPI Detail | Measures the Mediation Requests Counters for N32 Egress Request |
Metric Used for KPI | sum(ocsepp_n32f_mediation_requests_total{direction="N32_Egress_Request", namespace=~"$Namespace"}) |
5.2.7.2 Mediation Requests Counters - N32 Ingress Response
Table 5-175 Mediation Requests Counters - N32 Ingress Response
Field | Details |
---|---|
KPI Detail | Measures the Mediation Requests Counters for N32 Ingress Response |
Metric Used for KPI | sum(ocsepp_n32f_mediation_requests_total{direction="N32_Ingress_Response", namespace=~"$Namespace"}) |
5.2.7.3 Mediation Requests Counters - N32 Ingress Request
Table 5-176 Mediation Requests Counters - N32 Ingress Request
Field | Details |
---|---|
KPI Detail | Measures the Mediation Requests Counters for N32 Ingress Request |
Metric Used for KPI | sum(ocsepp_n32f_mediation_requests_total{direction="N32_Ingress_Request", namespace=~"$Namespace"}) |
5.2.7.4 Mediation Requests Counters - N32 Egress Response
Table 5-177 Mediation Requests Counters - N32 Egress Response
Field | Details |
---|---|
KPI Detail | Measures the Mediation Requests Counters for N32 Egress Response |
Metric Used for KPI | sum(ocsepp_n32f_mediation_requests_total{direction="N32_Egress_Response", namespace=~"$Namespace"}) |
5.2.7.5 Mediation Response Counters - N32 Egress Request
Table 5-178 Mediation Response Counters - N32 Egress Request
Field | Details |
---|---|
KPI Detail | Measures the Mediation Response Counters for N32 Egress Request |
Metric Used for KPI | sum(ocsepp_n32f_mediation_response_total{direction="N32_Egress_Request", namespace=~"$Namespace"}) |
5.2.7.6 Mediation Response Counters - N32 Ingress Response
Table 5-179 Mediation Response Counters - N32 Ingress Response
Field | Details |
---|---|
KPI Detail | Measures the Mediation Response Counters for N32 Ingress Response |
Metric Used for KPI | sum(ocsepp_cn32f_mediation_response_total{direction="N32_Ingress_Response",namespace=~"$Namespace"}) |
5.2.7.7 Mediation Response Counters - N32 Ingress Request
Table 5-180 Mediation Response Counters - N32 Ingress Request
Field | Details |
---|---|
KPI Detail | Measures the Mediation Response Counters for N32 EIngress Request |
Metric Used for KPI | sum(ocsepp_pn32f_mediation_response_total{direction="N32_Ingress_Request",namespace=~"$Namespace"}) |
5.2.7.8 Mediation Response Counters - N32 Egress Response
Table 5-181 Mediation Response Counters - N32 Egress Response
Field | Details |
---|---|
KPI Detail | Measures the Mediation Response Counters for N32 Egress Response |
Metric Used for KPI | sum(ocsepp_pn32f_mediation_response_total{direction="N32_Egress_Response",namespace=~"$Namespace"}) |
5.2.7.9 Mediation Response Failure
Table 5-182 Mediation Response Failure
Field | Details |
---|---|
KPI Detail | Measures the Mediation Response Failure |
Metric Used for KPI |
sum(ocsepp_cn32f_mediation_response_failure{namespace=~"$Namespace"}) by (Direction, status_code) sum(ocsepp_pn32f_mediation_response_failure{namespace=~"$Namespace"}) by (Direction, status_code) |
5.2.7.10 Mediation Applied Total
Table 5-183 Mediation Applied Total
Field | Details |
---|---|
KPI Detail | Measures the Mediation Applied Total |
Metric Used for KPI |
(sum(ocsepp_n32f_mediation_requests_total{namespace=~"$Namespace"})*100)/(sum(ocsepp_n32f_mediation_not_applied_total{namespace=~"$Namespace"})+sum(ocsepp_n32f_mediation_requests_total{namespace=~"$Namespace"})) |
5.2.7.11 Mediation Response Time At PN32F
Table 5-184 Mediation Response Time At PN32F
Field | Details |
---|---|
KPI Detail | Measures the Mediation Response Time at PN32F |
Metric Used for KPI | sum(irate(ocsepp_pn32f_mediation_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_pn32f_mediation_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
5.2.7.12 Mediation Response Time At CN32F
Table 5-185 Mediation Response Time At CN32F
Field | Details |
---|---|
KPI Detail | Measures the Mediation Response Time at CN32F |
Metric Used for KPI |
sum(irate(ocsepp_cn32f_mediation_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_cn32f_mediation_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) |
5.2.8 Ingress Gateway Message Copy KPIs
5.2.8.1 Total Requests Data sent towards DD for Ingress Gateway
Table 5-186 Total Requests Data sent towards DD for Ingress Gateway
Field | Details |
---|---|
KPI Detail | Measures the total requests data sent towards DD for Ingress Gateway. |
Metric Used for KPI |
sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="req"}[2m])) by(app) |
5.2.8.2 Total Ack received from DD for Requests for Ingress Gateway
Table 5-187 Total Ack received from DD for Requests for Ingress Gateway
Field | Details |
---|---|
KPI Detail | Measures the total Ack received from DD for requests for Ingress Gateway. |
Metric Used for KPI |
sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="ack"}[2m])) by(app) |
5.2.9 Egress Gateway Message Copy KPIs
5.2.9.1 Total Requests Data sent towards DD for Egress Gateway
Table 5-188 Total Requests Data sent towards DD for Egress Gateway
Field | Details |
---|---|
KPI Detail | Measures the total Requests Data sent towards DD for Egress Gateway. |
Metric Used for KPI |
sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="req"}[2m])) by(app) |
5.2.9.2 Total Ack received from DD for Requests for Egress Gateway
Table 5-189 Total Ack received from DD for Requests for Egress Gateway
Field | Details |
---|---|
KPI Detail | Measures the total acknowledgement received from DD on Egress Gateway. |
Metric Used for KPI |
sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="ack"}[2m])) by(app) |
5.2.10 Hosted SEPP KPIs
5.2.10.1 CN32F Allowed P-RSS Validation Failure Count
Table 5-190 CN32F Allowed P-RSS Validation Failure Count
Field | Details |
---|---|
KPI Detail | Measures the number of messages failed due to incorrect routing rules configured at cn32f microservice. |
Metric Used for KPI |
sum(ocsepp_allowed_p_rss_routing_failure_total{app="cn32f-svc", namespace=~"$Namespace"}) by (app) |
5.2.10.2 PN32F Allowed P-RSS Validation Failure Count
Table 5-191 PN32F Allowed P-RSS Validation Failure Count
Field | Details |
---|---|
KPI Detail | Measures the number of messages failed due to incorrect routing rules configured at pn32f microservice |
Metric Used for KPI |
sum(ocsepp_allowed_p_rss_routing_failure_total{app="pn32f-svc", namespace=~"$Namespace"}) by (app) |
5.2.11 SoR KPIs
5.2.12 Rate Limiting for Ingress Roaming Signaling per Remote SEPP Set KPIs
5.2.12.1 Average No of messages discarded for a particular RSS
Table 5-194 Average No of messages discarded for a particular RSS
Field | Details |
---|---|
KPI Detail | Measures the average number of messages discarded for a particular RSS. |
Metric used for KPI | sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”,Remote_SEPP_Set=“<Remote SEPP Set name>", Status=“dropped”}[2m])) |
5.2.12.2 Average No of messages accepted for a particular RSS
Table 5-195 Average No of messages accepted for a particular RSS
Field | Details |
---|---|
KPI Detail | Measures the average number of messages accepted for a particular RSS. |
Metric used for KPI | sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”,Remote_SEPP_Set=“<Remote SEPP Set name>”, Status=“accepted”}[2m])) |
5.2.12.3 Average No of messages for which feature not applied
Table 5-196 Average No of messages for which feature not applied
Field | Details |
---|---|
KPI Detail | Measures the average number of messages for which feature not applied. |
Metric used for KPI | sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”,Status=“ratelimit not applied”}[2m])) |
5.2.12.4 Average of all messages by Status
Table 5-197 Average of all messages by Status
Field | Details |
---|---|
KPI Detail | Measures the average of all messages by Status |
Metric used for KPI | sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”}[2m])) by (Status) |
5.2.12.5 List of Average number of messages dropped for all RSS
Table 5-198 List of Average number of messages dropped for all RSS
Field | Details |
---|---|
KPI Detail | Lists the average number of messages dropped for all RSS |
Metric used for KPI | sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”, Status=“dropped”}[2m])) by (Remote_SEPP_Set) |
5.2.12.6 List of Average number of messages accepted for all RSS
Table 5-199 List of Average number of messages accepted for all RSS
Field | Details |
---|---|
KPI Detail | Lists the average number of messages accepted for all RSS |
Metric used for KPI | sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”, Status=“accepted”}[2m])) by (Remote_SEPP_Set) |
5.2.13 Cat 0 - SBI Message Schema Validation KPIs
5.2.13.1 Message validation applied requests on cn32f
Table 5-200 Message validation applied requests on cn32f
Field | Details |
---|---|
KPI Detail | Measures the total number of requests at CN32F on which message validation has been applied by request path. |
Metric Used for KPI | sum(ocsepp_message_validation_applied_total{namespace=~"$Namespace",app="cn32f-svc"}) by (requestPath) |
5.2.13.2 Cn32f message validation failure on request body
Table 5-201 Cn32f message validation failure on request body
Field | Details |
---|---|
KPI Detail | Measures the total number of message validation failure(s) on request body by request path. |
Metric Used for KPI | sum(ocsepp_message_validation_on_body_failure_total{namespace=~"$Namespace",app="cn32f-svc"}) by (request_path) |
5.2.13.3 Cn32f message validation failures on request query parameter(s)
Table 5-202 Cn32f message validation failures on request query parameter(s)
Field | Details |
---|---|
KPI Detail | Measures the total number of message validation failures on request query parameter(s) by request path. |
Metric Used for KPI | sum(ocsepp_message_validation_on_header_failure_total{namespace=~"$Namespace",app="cn32f-svc"}) by (request_path) |
5.2.13.4 Message validation applied requests on pn32f
Table 5-203 Message validation applied requests on pn32f
Field | Details |
---|---|
KPI Detail | Measures the total number of requests at pn32f on which message validation has been applied by request path. |
Metric Used for KPI | sum(ocsepp_message_validation_applied_total{namespace=~"$Namespace",app="pn32f-svc"}) by (requestPath) |
5.2.13.5 Pn32f message validation failure on request body
Table 5-204 Pn32f message validation failure on request body
Field | Details |
---|---|
KPI Detail | Measures the total number of message validation failure(s) on request body by request path. |
Metric Used for KPI | sum(ocsepp_message_validation_on_body_failure_total{namespace=~"$Namespace",app="pn32f-svc"}) by (request_path) |
5.2.13.6 Pn32f message validation failures on request query parameter(s)
Table 5-205 Pn32f message validation failures on request query parameter(s)
Field | Details |
---|---|
KPI Detail | Measures the total number of message validation failures on request query parameter(s) by request path. |
Metric Used for KPI | sum(ocsepp_message_validation_on_header_failure_total{namespace=~"$Namespace",app="pn32f-svc"}) by (request_path) |
5.2.14 Rate Limiting for Egress Roaming Signaling per PLMN KPIs
5.2.14.1 Average Number of Messages Rejected for a Particular PLMN
Table 5-206 Average Number of Messages Rejected for a Particular PLMN
Field | Details |
---|---|
KPI Detail | Measures the average number of messages rejected for a particular PLMN |
Metric used for KPI | sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”,PLMN_ID="PLMN ID", Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[2m])) |
5.2.14.2 Average Number of Messages Accepted for a Particular PLMN
Table 5-207 Average Number of Messages Accepted for a Particular PLMN
Field | Details |
---|---|
KPI Detail | Measures the average number of messages accepted for a particular PLMN |
Metric used for KPI | sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”,PLMN_ID="PLMN ID", Status=~"ERL_MATCH_TOKEN_AVAILABLE_FWD|ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD"}[2m]) |
5.2.14.3 Average Number of Messages for which Feature not Applied
Table 5-208 Average Number of Messages for which Feature not Applied
Field | Details |
---|---|
KPI Detail | Measures the average number of messages for which feature not applied |
Metric used for KPI | sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”,Status!~"ERL_MATCH_TOKEN_AVAILABLE_FWD|ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD|ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[2m])) |
5.2.14.4 Average of all Messages by Status
Table 5-209 Average of all Messages by Status
Field | Details |
---|---|
KPI Detail | Measures the average of all messages by status |
Metric used for KPI | sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”}[2m])) by (Status) |
5.2.14.5 Average Number of Messages Rejected per PLMN
Table 5-210 Average Number of Messages Rejected per PLMN
Field | Details |
---|---|
KPI Detail | Measures the average number of messages rejected per PLMN |
Metric used for KPI | sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace="namespace", Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[2m])) by (PLMN_ID) |
5.2.14.6 Average Number of Messages Accepted per PLMN
Table 5-211 Average Number of Messages Accepted per PLMN
Field | Details |
---|---|
KPI Detail | Measures the average number of messages accepted per PLMN |
Metric used for KPI | sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace="namespace", Status=~"ERL_MATCH_TOKEN_AVAILABLE_FWD|ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD"}[2m])) by (PLMN_ID) |
5.3 SEPP Alerts
This section provides information about the SEPP alerts and their configuration.
Note:
For CNE1.8.4 or earlier versions:
- namespace: {{$labels.kubernetes_namespace}}
- podname: {{$labels.kubernetes_pod_name}}
For CNE 1.9.x or later versions:
- namespace: {{$labels.namespace}}
- podname: {{$labels.pod}}
5.3.1 System Level Alerts
5.3.1.1 SEPPPodMemoryUsageAlert
Table 5-212 SEPPPodMemoryUsageAlert
Trigger Condition | Pod memory usage is above the threshold (70% ) |
Severity | Warning |
Alert details provided | Summary'namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Memory usage is {{ $value | printf "%.2f" }} which is above 70% (current value is: {{ $value }})'Expression: (sum by(namespace,container) (container_memory_usage_bytes{container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}) / sum by(namespace,container)(container_spec_memory_limit_bytes{container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}) ) * 100 >= 70 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4003 |
Metric Used |
container_memory_usage_bytes Note: This is a Kubernetes metric used for instance availability monitoring. If the metric is not available, use the similar metric as exposed by the monitoring system. |
Resolution |
The alert gets cleared when the memory utilization falls below the critical threshold. Note: The threshold is configurable in the SeppAlertrules.yaml file. If guidance is required, contact My Oracle Support. |
5.3.1.2 SEPPPodCpuUsageAlert
Table 5-213 SEPPPodCpuUsageAlert
Field | Details |
---|---|
Trigger Condition | Pod CPU usage is above the threshold ( 70% ) |
Severity | Warning |
Alert details provided | Summary'namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: CPU usage is {{ $value | printf "%.2f" }} which is usage is above 70% (current value is: {{ $value }})'Expression: (sum by (namespace,container) (rate(container_cpu_usage_seconds_total{container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}[2m])) ) / (sum by (container, namespace) (kube_pod_container_resource_limits{resource="cpu",container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}) ) * 100 >= 70 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4002 |
Metric Used |
container_cpu_usage_seconds_total Note : This is a Kubernetes metric used for instance availability monitoring. If the metric is not available, use the similar metric as exposed by the monitoring system. |
Resolution | The alert gets cleared when the CPU utilization is below the
critical threshold.
Note: The threshold is configurable in the SeppAlertrules.yaml file. If guidance is required, contact My Oracle Support. |
5.3.2 Application Level Alerts
5.3.2.1 Common Alerts
5.3.2.1.1 SEPPN32fRoutingFailure
Table 5-214 SEPPN32fRoutingFailure
Field | Details |
---|---|
Trigger Condition | N32f service not able to forward message |
Severity | Info |
Alert details provided | Summarynamespace: {{ $labels.namespace}}, podname: {{ $labels.pod}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}Expression: idelta(ocsepp_cn32f_requests_failure_total[2m]) > 0 or (ocsepp_cn32f_requests_failure_total unless ocsepp_cn32f_requests_failure_total offset 2m) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4001 |
Metric Used | ocsepp_cn32f_requests_failure_total |
Resolution |
The alert gets cleared when Consumer SEPP accepts request only if producer NF domain and PLMN match the Remote SEPP configured. Steps: The failure reason is present in the alert. Possible Resolutions :
|
5.3.2.1.2 SEPPConfigMgrRouteFailureAlert
Table 5-215 SEPPConfigMgrRouteFailureAlert
Trigger Condition | When routing failure occurs while posting remote SEPP or roaming partner set, this alert will be raised. |
Severity | Major |
Alert Details Provided |
Summary namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Route Failure has occurred because {{ $labels.errorReason }} Expression sum(increase(ocsepp_configmgr_routefailure_total{app="config-mgr-svc"}[5m]) >0 or (ocsepp_configmgr_routefailure_total{app="config-mgr-svc"} unless ocsepp_configmgr_routefailure_total{app="config-mgr-svc"} offset 5m )) by (namespace,errorCode) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4026 |
Metric Name | Metric ocsepp_configmgr_routefailure_total |
Resolution | The alert is cleared if no new failures are observed in 5 minutes window. |
5.3.2.1.3 EgressSbiErrorRateAbove1Percent
Table 5-216 EgressSbiErrorRateAbove1Percent
Trigger Condition | Sbi Transaction Error Rate exceeded configured threshold |
Severity | Major |
Alert details provided | Summary"Sbi Transaction Error Rate detected above 1 Percent of Total Sbi Transactions"Expression sum(rate(oc_egressgateway_sbiRouting_http_responses_total{Status!~"2.*"}[24h])) by (app,pod, namespace) /sum(rate(oc_egressgateway_sbiRouting_http_responses_total[24h])) by (app,pod, namespace) *100 >= 1 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7001 |
Metric Used | oc_egressgateway_sbiRouting_http_responses_total |
Resolution |
This alert will be raised when the total SBI transaction error rate will be above 1% of the total transaction done during 24 hour time period. Metric will be cleared when the error rate will be below 1% |
5.3.2.2 Handshake Alerts
5.3.2.2.1 SEPPCn32cHandshakeFailureAlert
Table 5-217 SEPPCn32cHandshakeFailureAlert
Trigger Condition | Handshake procedure has failed on Consumer SEPP |
Severity | Major |
Alert details provided | Summary'namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Handshake procedure has failed on Consumer side because {{ $labels.reason }}'Expression: sum(increase(ocsepp_n32c_handshake_failure_attempts_total{app="cn32c-svc"}[5m]) >0 or (ocsepp_n32c_handshake_failure_attempts_total{app="cn32c-svc"} unless ocsepp_n32c_handshake_failure_attempts_total{app="cn32c-svc"} offset 5m )) by (namespace,remote_sepp_name,nfinstanceid,peer_fqdn,app) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.2001 |
Metric Used | ocsepp_n32c_handshake_failure_attempts_total filtered by app=cn32-svc |
Resolution 1 | The alert gets cleared when the N32C Handshake is established
after successful TCP connection to remote SEPP.
Failure reason:
Release name used while helm installation is other than
ocsepp-release .
Error Verification: Check the
failure reason in the alert. If the failure reason is 404 –route not found
or Route not found, follow the recovery steps:
|
Resolution 2 |
The alert gets cleared when the N32C Handshake is established after successful TCP connection to remote SEPP. Steps: The failure reason is present in the alert. Possible Resolutions:
|
5.3.2.2.2 SEPPPn32cHandshakeFailureAlert
Table 5-218 SEPPPn32cHandshakeFailureAlert
Trigger Condition | Handshake procedure has failed on Producer sepp |
Severity | Major |
Alert details provided | Summary'namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Handshake procedure has failed on Producer side because {{ $labels.error_msg }}'Expression: sum(increase(ocsepp_n32c_handshake_failure_attempts_total{app="pn32c-svc"}[5m]) >0 or (ocsepp_n32c_handshake_failure_attempts_total{app="pn32c-svc"} unless ocsepp_n32c_handshake_failure_attempts_total{app="pn32c-svc"} offset 5m )) by (namespace,remote_sepp_name,nfinstanceid,peer_fqdn,app) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.3001 |
Metric Used | ocsepp_n32c_handshake_failure_attempts_total filtered by app=pn32-svc |
Resolution |
The alert gets cleared when the N32C Handshake is successful due to TCP connection success of Producer to consumer SEPP. Steps: The failure reason is present in the alert. Possible Resolution: Update and reinitiate the Handshake. |
5.3.2.3 Upgrade Alerts
5.3.2.3.1 SEPPUpgradeStartedAlert
Table 5-219 SEPPUpgradeStartedAlert
Trigger Condition | Rest API trigger at start of Upgrade |
Severity | NA |
Alert details provided |
applicationname alertname servicename releasename namespace oid severity vendor sourcerelease targetrelease |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.8001 |
Metric Used | NA |
Resolution |
If a success alert is generated then start and failure alerts will be cleared. |
5.3.2.3.2 SEPPUpgradeFailedAlert
Table 5-220 SEPPUpgradeFailedAlert
Trigger Condition | Rest API trigger at failure of Upgrade |
Severity | NA |
Alert details provided |
applicationname alertname servicename releasename namespace oid severity vendor sourcerelease targetrelease |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.8002 |
Metric Used | NA |
Resolution |
If a success alert is generated then start and failure alerts will be cleared. |
5.3.2.3.3 SEPPUpgradeSuccessfulAlert
Table 5-221 SEPPUpgradeSuccessfulAlert
Trigger Condition | Rest API trigger at success of Upgrade |
Severity | NA |
Alert details provided |
applicationname alertname servicename releasename namespace oid severity vendor sourcerelease targetrelease |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.8003 |
Metric Used | NA |
Resolution |
If a success alert is generated then start and failure alerts will be cleared. |
5.3.2.4 Rollback Alerts
5.3.2.4.1 SEPPRollbackStartedAlert
Table 5-222 SEPPRollbackStartedAlert
Trigger Condition | Rest API trigger at start of Rollback |
Severity | NA |
Alert details provided |
applicationname alertname servicename releasename namespace oid severity vendor sourcerelease targetrelease |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.8004 |
Metric Used | NA |
Resolution |
If a success alert is generated then start and failure alerts will be cleared. |
5.3.2.4.2 SEPPRollbackFailedAlert
Table 5-223 SEPPRollbackFailedAlert
Trigger Condition | Rest API trigger at failure of Rollback |
Severity | NA |
Alert details provided |
applicationname alertname servicename releasename namespace oid severity vendor sourcerelease targetrelease |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.8005 |
Metric Used | NA |
Resolution |
If a success alert is generated then start and failure alerts will be cleared. |
5.3.2.4.3 SEPPRollbackSuccessfulAlert
Table 5-224 SEPPRollbackSuccessfulAlert
Trigger Condition | Rest API trigger at success of Rollback |
Severity | NA |
Alert details provided |
applicationname alertname servicename releasename namespace oid severity vendor sourcerelease targetrelease |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.8006 |
Metric Used | NA |
Resolution | Cleared after DEFAULT_DURATION_FOR_ALERT_EXPIRY minutes |
5.3.2.5 Global Rate Limiting on Ingress Gateway of SEPP Alerts
5.3.2.5.1 IngressGlobalMessageDropAbovePointOnePercent
Table 5-225 IngressGlobalMessageDropAbovePointOnePercent
Trigger Condition | Ingress Global Message Drop Rate detected greater than or equal to 0.1 Percent of Total Transactions. |
Severity | Warning |
Alert details provided | Summary"Ingress Global Message Drop Rate detected above 0.1 Percent of Total Transactions"Expression sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 0.1 < 1 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7002 |
Metric Used | oc_ingressgateway_global_ratelimit_total |
Resolution |
The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 0.1% of the total messages received. This will get cleared once percentage of message rejected is below 0.1% or greater than or equal to 1%. |
5.3.2.5.2 IngressGlobalMessageDropAbove1Percent
Table 5-226 IngressGlobalMessageDropAbove1Percent
Trigger Condition | Ingress Global Message Drop Rate detected greater than or equal to 1 Percent of Total Transactions. |
Severity | Warning |
Alert details provided | Summary"Ingress Global Message Drop Rate detected above 1 Percent of Total Transactions"Expression sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 1 < 10 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7003 |
Metric Used | oc_ingressgateway_global_ratelimit_total |
Resolution |
The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 1% of the total messages received. This will get cleared once percentage of message rejected is below 1% greater than or equal to 10%. |
5.3.2.5.3 IngressGlobalMessageDropAbove10Percent
Table 5-227 IngressGlobalMessageDropAbove10Percent
Trigger Condition | Ingress Global Message Drop Rate detected greater than or equal to 10 Percent of Total Transactions |
Severity | Minor |
Alert details provided | Summary"Ingress Global Message Drop Rate detected above 10 Percent of Total Transactions"Expression sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 10 < 25 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7004 |
Metric Used | oc_ingressgateway_global_ratelimit_total |
Resolution |
The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 10% of the total messages received. This will get cleared once percentage of message rejected is below 10% or greater than or equal to 25% . |
5.3.2.5.4 IngressGlobalMessageDropAbove25Percent
Table 5-228 IngressGlobalMessageDropAbove25Percent
Trigger Condition | Ingress Global Message Drop Rate detected greater than or equal to 25 Percent of Total Transactions |
Severity | Major |
Alert details provided | Summary"Ingress Global Message Drop Rate detected above 25 Percent of Total Transactions"Expression sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 25 < 50 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7005 |
Metric Used | oc_ingressgateway_global_ratelimit_total |
Resolution |
The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 25% of the total messages received.This will get cleared once percentage of message rejected is below 25% or greater than or equal to 50%. |
5.3.2.5.5 IngressGlobalMessageDropAbove50Percent
Table 5-229 IngressGlobalMessageDropAbove50Percent
Trigger Condition | Ingress Global Message Drop Rate detected greater than or equal to 50 Percent of Total Transactions |
Severity | Critical |
Alert details provided | Summary"Ingress Global Message Drop Rate detected above 50 Percent of Total Transactions"Expression sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 50 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7006 |
Metric Used | oc_ingressgateway_global_ratelimit_total |
Resolution |
The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 50% of the total messages received.This will get cleared once percentage of message rejected is below 50%. |
5.3.2.6 Topology Hiding Alerts
5.3.2.6.1 SEPPN32fTopologyOperationFailureAlert
Table 5-230 SEPPN32fTopologyOperationFailureAlert
Field | Details |
---|---|
Trigger Condition | Topology Hiding or Recovery Failure exceeded configured threshold (1%) |
Severity | Major |
Alert details provided | Summary"Topology hiding/recovery operation failres reached more than configured threshold"Expression delta(ocsepp_topology_header_failure_total[2m])>0 or (ocsepp_topology_header_failure_total unless ocsepp_topology_header_failure_total offset 2m) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4004 |
Metric Used | ocsepp_topology_header_failure_total, ocsepp_topology_header_success_total |
Resolution |
This alert will be raised when the total Topology Hiding or Recovery failures reach more than 1%. Alert will be cleared when the error rate is below 1%. Possible Resolutions:
Note: The alert will be cleared only if the corresponding success metric is pegged. |
5.3.2.6.2 SEPPN32fTopologyBodyOperationFailureAlert
Table 5-231 SEPPN32fTopologyBodyOperationFailureAlert
Field | Details |
---|---|
Trigger Condition |
Topology Operation failed and exceeds defined threshold |
Severity | Major |
Alert details provided | Summary"Topology Hiding/Recovery Operation failures reached more than configured threshold"Expression: delta(ocsepp_topology_body_failure_total[2m])>0 or (ocsepp_topology_body_failure_total unless ocsepp_topology_body_failure_total offset 2m) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4006 |
Metric Used | ocsepp_topology_body_failure_total ocsepp_topology_body_success_total |
Resolution | This alert will be raised when the total Topology Hiding or Recovery
for message body failures reach more than 1%.
Alert will be cleared
when the error rate will be below 1%.
Possible Resolutions:
|
5.3.2.7 5G SBI Message Mediation Support Alerts
5.3.2.7.1 SEPPCN32fMediationFailure
Table 5-232 SEPPCN32fMediationFailure
Trigger Condition |
Mediation processing Failure |
Severity | Info |
Alert details provided | Summary"Mediation processing Failure"Expression: increase(ocsepp_cn32f_mediation_response_failure{status_code!="504 GATEWAY_TIMEOUT"}[10m]) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4007 |
Metric Used | ocsepp_cn32f_mediation_response_failure |
Resolution |
This alert will be raised when Mediation microservice is unable to apply rules on the incoming request & response from SEPP. Possible Resolution:
|
5.3.2.7.2 SEPPCN32fMediationUnreachable
Table 5-233 SEPPCN32fMediationUnreachable
Trigger Condition |
Mediation service is not accessible |
Severity | Critical |
Alert details provided | Summary"Mediation service is not accessible"Expression: increase(ocsepp_cn32f_mediation_response_failure {status_code="504 GATEWAY_TIMEOUT"}[10m]) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4008 |
Metric Used | ocsepp_cn32f_mediation_response_failure |
Resolution |
This alert will be raised when Mediation microservice is not accessible. Possible Resolution:
|
5.3.2.7.3 SEPPPN32fMediationFailure
Table 5-234 SEPPPN32fMediationFailure
Trigger Condition |
Mediation processing Failure |
Severity | Info |
Alert details provided | Summary"Mediation processing Failure"Expression: increase(ocsepp_pn32f_mediation_response_failure {status_code!="504 GATEWAY_TIMEOUT"}[10m]) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4009 |
Metric Used | ocsepp_pn32f_mediation_response_failure |
Resolution |
This alert will be raised when Mediation microservice is unable to apply rules on the incoming request & response from SEPP. Possible Resolution:
|
5.3.2.7.4 SEPPPN32fMediationUnreachable
Table 5-235 SEPPPN32fMediationUnreachable
Trigger Condition |
Mediation service is not accessible |
Severity | Critical |
Alert details provided | Summary"Mediation service is not accessible"Expression: increase(ocsepp_pn32f_mediation_response_failure {status_code="504 GATEWAY_TIMEOUT"}[10m]) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4010 |
Metric Used | ocsepp_pn32f_mediation_response_failure |
Resolution |
This alert will be raised when Mediation microservice is not accessible. Possible Resolution:
|
5.3.2.8 Overload Control Alerts
5.3.2.8.1 SEPPServiceOverload65Percent
Table 5-236 SEPPServiceOverload65Percent
Trigger Condition | CPU memory of pn32f-svc more than 65% |
Severity | Warning |
Alert details provided | SummaryBackend service is in overload with load level > 65%Expression service_resource_overload_level == 1 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7007 |
Metric Used | service_resource_overload_level |
Resolution |
The alert will be cleared when CPU Memory for backend-svc goes below 60%. |
5.3.2.8.2 SEPPServiceOverload70Percent
Table 5-237 SEPPServiceOverload70Percent
Trigger Condition | CPU memory of pn32f-svc more than 70% |
Severity | Minor |
Alert details provided | SummaryBackend service is in overload with load level > 70%Expression service_resource_overload_level == 2 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7008 |
Metric Used | service_resource_overload_level |
Resolution |
The alert will be cleared when CPU Memory for backend-svc goes below 70% |
5.3.2.8.3 SEPPServiceOverload80Percent
Table 5-238 SEPPServiceOverload80Percent
Trigger Condition | CPU memory of pn32f-svc more than 80% |
Severity | Major |
Alert details provided | SummaryBackend service is in overload with load level > 80%Expression service_resource_overload_level == 3 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7009 |
Metric Used | service_resource_overload_level |
Resolution |
The alert will be cleared when CPU Memory for backend-svc goes below 80% |
5.3.2.8.4 SEPPServiceOverload90Percent
Table 5-239 SEPPServiceOverload90Percent
Trigger Condition | CPU memory of pn32f-svc more than 90% |
Severity | Critical |
Alert details provided | SummaryBackend service is in overload with load level > 90%Expression service_resource_overload_level == 4 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7010 |
Metric Used | service_resource_overload_level |
Resolution |
The alert will be cleared when CPU Memory for backend-svc goes below 90% |
5.3.2.9 Hosted SEPP Alerts
5.3.2.9.1 SEPPPn32fHSRoutingFailureAlert
Table 5-240 SEPPPn32fHSRoutingFailureAlert
Trigger Condition | When the routing failure rate at Pn32f service is greater than 20 percentage. |
Severity | Major |
Alert details provided | Allowed P-RSS Validation failure at Roaming Hub
Expression ((sum by(namespace, app, nfInstanceId, pod) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod) (ocsepp_pn32f_requests_total))) > 0.2 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4013 |
Metric Used | ocsepp_allowed_p_rss_routing_failure_total , ocsepp_pn32f_requests_total |
Resolution | The alert gets automatically cleared when the failure rate at pn32f microservice goes below 20 percent. |
5.3.2.9.2 SEPPCn32fHSRoutingFailureAlertMinor
Table 5-241 SEPPCn32fHSRoutingFailureAlertMinor
Field | Details |
---|---|
Trigger Condition | When the routing failure rate at Cn32f service is greater than 50 percentage. |
Severity | Minor |
Alert details provided | Allowed P-RSS Validation failure at Roaming Hub for
Consumer SEPP.
Expression ((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.5 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4014 |
Metric Used | ocsepp_allowed_p_rss_routing_failure_total , ocsepp_cn32f_requests_total |
Resolution | The alert gets automatically cleared when the failure rate at cn32f microservice goes below 50 percent. |
5.3.2.9.3 SEPPCn32fHSRoutingFailureAlertMajor
Table 5-242 SEPPCn32fHSRoutingFailureAlertMajor
Field | Details |
---|---|
Trigger Condition | When the routing failure rate at Cn32f service is greater than 60 percentage. |
Severity | Major |
Alert details provided | Allowed P-RSS Validation failure at Roaming Hub for
Consumer SEPP.
Expression ((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.6 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4015 |
Metric Used | ocsepp_allowed_p_rss_routing_failure_total, ocsepp_cn32f_requests_total |
Resolution | The alert gets automatically cleared when the failure rate at cn32f microservice goes below 60 percent. |
5.3.2.9.4 SEPPCn32fHSRoutingFailureAlertCritical
Table 5-243 SEPCn32fHSRoutingFailureAlertCritical
Field | Details |
---|---|
Trigger Condition | When the routing failure rate at Cn32f service is greater than 65 percentage. |
Severity | Critical |
Alert details provided | Allowed P-RSS Validation failure at Roaming Hub for
Consumer SEPP.
Expression ((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.65 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4016 |
Metric Used | ocsepp_allowed_p_rss_routing_failure_total, ocsepp_cn32f_requests_total |
Resolution | The alert gets automatically cleared when the failure rate at cn32f microservice goes below 65 percent. |
5.3.2.9.5 SEPPCn32fHSRoutingFailureAlertWarning
Table 5-244 SEPCn32fHSRoutingFailureAlertWarning
Field | Details |
---|---|
Trigger Condition | When the routing failure rate at Cn32f service is greater than 25 percentage. |
Severity | Warning |
Alert details provided | Allowed P-RSS Validation failure at Roaming Hub for
Consumer SEPP.
Expression ((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.25 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4017 |
Metric Used | ocsepp_allowed_p_rss_routing_failure_total, ocsepp_cn32f_requests_total |
Resolution | The alert gets automatically cleared when the failure rate at cn32f microservice goes below 25 percent. |
5.3.2.10 SEPP Message Feed Alerts
5.3.2.10.1 DDUnreachableFromN32IGW
Table 5-245 DDUnreachableFromN32IGW
Trigger Condition | This alarm is raised when Data Director is not reachable from N32 Ingress Gateway. |
Severity | major |
Alert details provided | Summary (oc_ingressgateway_dd_unreachable{app="n32-ingress-gateway"} == 1) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4018 |
Metric Used | oc_ingressgateway_dd_unreachable |
Resolution | Alert gets cleared automatically when the connection with Data Director is established. |
5.3.2.10.2 DDUnreachableFromPLMNIGW
Table 5-246 DDUnreachableFromPLMNIGW
Trigger Condition | This alarm is raised when Data Director is not reachable from PLMN Ingress Gateway. |
Severity | major |
Alert details provided | Summary (oc_ingressgateway_dd_unreachable{app="n32-ingress-gateway"} == 1) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4019 |
Metric Used | oc_ingressgateway_dd_unreachable |
Resolution | Alert gets cleared automatically when the connection with Data Director is established. |
5.3.2.10.3 DDUnreachableFromN32EGW
Table 5-247 DDUnreachableFromN32EGW
Trigger Condition | This alarm is raised when Data Director is not reachable from N32 Egress Gateway. |
Severity | major |
Alert details provided | Summary (oc_egressgateway_dd_unreachable{app="n32-egress-gateway"} == 1) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4020 |
Metric Used | oc_egressgateway_dd_unreachable |
Resolution | Alert gets cleared automatically when the connection with Data Director is established. |
5.3.2.10.4 DDUnreachableFromPLMNEGW
Table 5-248 DDUnreachableFromPLMNEGW
Trigger Condition | This alarm is raised when Data Director is not reachable from PLMN Egress Gateway. |
Severity | major |
Alert details provided | Summary (oc_egressgateway_dd_unreachable{app="plmn-egress-gateway"} == 1) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4021 |
Metric Used | oc_egressgateway_dd_unreachable |
Resolution | Alert gets cleared automatically when the connection with Data Director is established. |
5.3.2.11 Steering of Roaming (SOR) Alerts
5.3.2.11.1 SEPPPn32fSORFailureAlertPercent30to40
Table 5-249 SEPPPn32fSORFailureAlertPercent30to40
Field | Details |
---|---|
Trigger Condition | 30% to 40% of SOR traffic results in failure. |
Severity | Minor |
Alert details provided | Summary:
'namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}' Expression:sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)>=0.3 and sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)<0.4 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4022 |
Metric Used | ocsepp_pn32f_sor_failure_total and ocsepp_pn32f_sor_requests_total |
Resolution |
This alert will be raised when the percentage failure of SOR responses is in the range 30%-40%, in the sample collected in last 2 min. Possible Resolutions :
|
5.3.2.11.2 SEPPPn32fSORFailureAlertPercent40to50
Table 5-250 SEPPPn32fSORFailureAlertPercent40to50
Field | Details |
---|---|
Trigger Condition | 40% to 50% of SOR traffic results in failure. |
Severity | Major |
Alert details provided | Summary:
'namespace: {{$labels.namespace}}, timestamp: {{ with
query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end
}}'
Expression: sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)>=0.4 and sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)<0.5 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4023 |
Metric Used | ocsepp_pn32f_sor_failure_total and ocsepp_pn32f_sor_requests_total |
Resolution |
This alert will be raised when the percentage failure of SOR responses is in the range 40%-50%, in the sample collected in last 2 min. Possible Resolutions :
|
5.3.2.11.3 SEPPPn32fSORFailureAlertPercentAbove50
Table 5-251 SEPPPn32fSORFailureAlertPercentAbove50
Field | Details |
---|---|
Trigger Condition | 50% of SOR traffic results in failure |
Severity | Critical |
Alert details provided | Summary:
'namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}' Expression:sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)>=0.5 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4024 |
Metric Used | ocsepp_pn32f_sor_failure_total and ocsepp_pn32f_sor_requests_total |
Resolution |
This alert will be raised when the percentage failure of SOR responses is above 50%, in the sample collected in last 2 min. Possible Resolutions :
|
5.3.2.11.4 SEPPPn32fSORTimeoutFailureAlert
Table 5-252 SEPPPn32fSORTimeoutFailureAlert
Field | Details |
---|---|
Trigger Condition | Increase of more than five timeout errors in last two minutes for SOR. |
Severity | critical |
Alert details provided | Summary:
'namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}' Expression: idelta(ocsepp_pn32f_sor_timeout_failure_total[2m]) > 5 or (ocsepp_pn32f_sor_timeout_failure_total unless ocsepp_pn32f_sor_timeout_failure_total offset 2m) |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4025 |
Metric Used | ocsepp_pn32f_sor_timeout_failure_total |
Resolution |
This alert will be raised when the response received from SOR Server suggests that server is either down or unreachable for more than five error counts in the sample collected in last two minutes. Possible Resolutions :
|
5.3.2.12 Global Rate Limiting on Ingress Gateway of SEPP Alerts
5.3.2.12.1 Ingress RSS Rate Limit per RSS Message Drop Above Point one Percent Alert
Table 5-253 Ingress RSS Rate Limit per RSS Message Drop Above Point one Percent Alert
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 0.1 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised. |
Severity | Warning |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 0.1 Percent of Total Transactions of that RSSExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 0.1 < 10 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7011 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.2 Ingress RSS Rate Limit per RSS Message Drop Above 10 Percent Alert
Table 5-254 Ingress RSS Rate Limit per RSS Message Drop Above 10 Percent Alert
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 10 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised. |
Severity | Minor |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 10 Percent of Total Transactions of that RSSExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 10 < 25 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7012 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.3 Ingress RSS Rate Limit per RSS Message Drop Above 25 Percent Alert
Table 5-255 Ingress RSS Rate Limit per RSS Message Drop Above 25 Percent Alert:
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 25 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised. |
Severity | Major |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 25 Percent of Total Transactions of that RSSExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 25 < 50 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7013 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.4 Ingress RSS Rate Limit per RSS Message Drop Above 50 Percent Alert
Table 5-256 Ingress RSS Rate Limit per RSS Message Drop Above 50 Percent Alert
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 50 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised. |
Severity | Critical |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 50 Percent of Total Transactions of that RSSExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 50 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7014 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.5 Ingress RSS Rate Limit Message Drop Above Point one Percent Alert
Table 5-257 Ingress RSS Rate Limit Message Drop Above Point one Percent Alert
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 0.1 percent of total transactions, this metric will be pegged and corresponding alert will be raised. |
Severity | Warning |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate detected above 0.1 Percent of Total TransactionExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (namespace) *100 >= 0.1 < 1 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7015 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.6 Ingress RSS Rate Limit Message Drop Above one Percent Alert
Table 5-258 Ingress RSS Rate Limit Message Drop Above one Percent Alert:
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 1 percent of total transactions, this metric will be pegged and corresponding alert will be raised. |
Severity | Warning |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate detected above 1 Percent of Total TransactionsExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (namespace) *100 >= 1 < 10 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7016 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.7 Ingress RSS Rate Limit Message Drop Above 10 Percent Alert
Table 5-259 Ingress RSS Rate Limit Message Drop Above 10 Percent Alert
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 10 percent of total transactions, this metric will be pegged and corresponding alert will be raised. |
Severity | Minor |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate detected above 10 Percent of Total Transactions.Expression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 10 < 25 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7017 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.8 Ingress RSS Rate Limit Message Drop Above 25 Percent Alert
Table 5-260 Ingress RSS Rate Limit Message Drop Above 25 Percent Alert
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 25 percent of total transactions, this metric will be pegged and corresponding alert will be raised. |
Severity | Major |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate detected above 25 Percent of Total TransactionsExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 25 < 50 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7018 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.12.9 Ingress RSS Rate Limit Message Drop Above 50 Percent Alert
Table 5-261 Ingress RSS Rate Limit Message Drop Above 50 Percent Alert
Trigger Condition | If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 50 percent of total transactions, this metric will be pegged andcorresponding alert will be raised. |
Severity | Critical |
Alert Details Provided |
Summary: Ingress RSS Based Rate Limiting Message Drop Rate detected above 50 Percent of Total TransactionsExpression: sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 50 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.7019 |
Metric Name | oc_ingressgateway_rss_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.13 Cat-0 SBI Message Schema Validation Alerts
5.3.2.13.1 SEPPN32fMessageValidationOnHeaderFailureMinorAlert
Table 5-262 SEPPN32fMessageValidationOnHeaderFailureMinorAlert
Field | Details |
---|---|
Trigger Condition | Message validation failed for request query parameters for 40 % of requests (on which message validation was applied) in last 2 minutes. |
Severity | minor |
Alert Details Provided |
Summary: Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}Expression: (sum(rate(ocsepp_message_validation_on_header_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 40 < 60 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4026 |
Metric Used | ocsepp_message_validation_on_header_failure_total |
Resolution | The alerts gets cleared when the count is not between 40 to 60. |
5.3.2.13.2 SEPPN32fMessageValidationOnHeaderFailureMajorAlert
Table 5-263 SEPPN32fMessageValidationOnHeaderFailureMajorAlert
Field | Description |
---|---|
Trigger Condition | Message validation failed for request query parameters for 60 % of requests(on which message validation was applied) in last 2 minutes. |
Severity | major |
Alert Details Provided |
Summary: Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}Expression: (sum(rate(ocsepp_message_validation_on_header_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 60 < 80 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4027 |
Metric Name | ocsepp_message_validation_on_header_failure_total |
Resolution | The alerts gets cleared when the count is not between 60 to
80.Possible Resolutions:
|
5.3.2.13.3 SEPPN32fMessageValidationOnHeaderFailureCriticalAlert
Table 5-264 SEPPN32fMessageValidationOnHeaderFailureCriticalAlert
Field | Description |
---|---|
Trigger Condition | Message validation failed for request query parameters for 80 % of requests(on which message validation was applied) in last 2 minutes. |
Severity | critical |
Alert Details Provided |
Summary: Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}Expression: (sum(rate(ocsepp_message_validation_on_header_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 80 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4028 |
Metric Name | ocsepp_message_validation_on_header_failure_total |
Resolution | The alerts gets cleared when the count is not between 80 to 100. |
5.3.2.13.4 SEPPN32fMessageValidationOnBodyFailureMinorAlert
Table 5-265 SEPPN32fMessageValidationOnBodyFailureMinorAlert
Field | Description |
---|---|
Trigger Condition | Message validation failed for request body for 40 % of requests(on which message validation was applied) in last 2 minutes. |
Severity | minor |
Alert Details Provided |
Summary: Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}Expression: (sum(rate(ocsepp_message_validation_on_body_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 40 < 60 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4029 |
Metric Name | ocsepp_message_validation_on_body_failure_total |
Resolution | The alerts gets cleared when the count is not between 60 to 100. |
5.3.2.13.5 SEPPN32fMessageValidationOnBodyFailureMajorAlert
Table 5-266 SEPPN32fMessageValidationOnBodyFailureMajorAlert
Field | Details |
---|---|
Trigger Condition | Message validation failed for request body for 60 % of requests(on which message validation was applied) in last 2 minutes. |
Severity | major |
Alert Details Provided |
Summary: Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}Expression: (sum(rate(ocsepp_message_validation_on_body_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 60 < 80 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4030 |
Metric Name | ocsepp_message_validation_on_body_failure_total |
Resolution | The alerts gets cleared when the count is not between 80 to 100. |
5.3.2.13.6 SEPPN32fMessageValidationOnBodyFailureCriticalAlert
Table 5-267 SEPPN32fMessageValidationOnBodyFailureCriticalAlert
Field | Details |
---|---|
Trigger Condition | Message validation failed for request body for 80 % of requests(on which message validation was applied) in last 2 minutes. |
Severity | critical |
Alert Details Provided |
Summary: Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}Expression:(sum(rate(ocsepp_message_validation_on_body_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 80 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4031 |
Metric Name | ocsepp_message_validation_on_body_failure_total |
Resolution | The alerts gets cleared when the count is not between 80 to 100. |
5.3.2.14 Cat-1 Service API Validation Alerts
5.3.2.14.1 SEPPN32fServiceApiValidationFailureAlert
Table 5-268 SEPPN32fServiceApiValidationFailureAlert
Trigger Condition | Service API not in allowed list |
Severity | Major |
Alert details provided | SummaryN32f : Service API not in allowed listExpression: delta(ocsepp_security_service_api_failure_total[2m]) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4005 |
Metric Used | ocsepp_security_service_api_failure_total |
Resolution 1 |
This alert will be raised when there is difference of at least 1 between first and last data point in sample collected in last 2 minutes. Alert will be cleared after 2 minutes. Possible Resolutions:
|
Resolution 2 |
The alert gets cleared when the N32C Handshake is established after successful TCP connection to remote SEPP. Steps: The failure reason is present in the alert. Possible Resolutions:
|
5.3.2.15 Cat-2 Network ID Validation Alerts
5.3.2.15.1 SEPPN32fNetworkIDValidationHeaderFailureAlert
Table 5-269 SEPPN32fNetworkIDValidationHeaderFailureAlert
Field | Details |
---|---|
Trigger Condition | If Network ID Validation for Header fails, this metrics will be pegged and corresponding alert will be raised. |
Severity | Major |
Alert details provided | Summary: 'namespace: {{ $labels.namespace}},
timestamp: {{ with query "time()" }}{{ . | first | value |
humanizeTimestamp }}{{ end }}: Network ID Validation has failed because
{{ $labels.cause }}'
Expression: sum(increase(ocsepp_network_id_validation_header_failure_total[2m]) >0 or (ocsepp_network_id_validation_header_failure_total unless ocsepp_network_id_validation_header_failure_total offset 2m )) by (namespace, remote_sepp_name, nf_instance_id, peer_fqdn, plmn_identifier, app, resource_uri, pod) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4011 |
Metric Used | ocsepp_network_id_validation_header_failure_total |
Resolution | The alerts gets cleared when the count goes below 0. |
5.3.2.15.2 SEPPN32fNetworkIDValidationBodyIEFailureAlert
Table 5-270 SEPPN32fNetworkIDValidationBodyIEFailureAlert
Field | Details |
---|---|
Trigger Condition | If Network ID Validation for Body fails, this metrics will be pegged and corresponding alert will be raised. |
Severity | Major |
Alert details provided | Summary: 'namespace: {{
$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first |
value | humanizeTimestamp }}{{ end }}: Network ID Body Validation has
failed because {{ $labels.cause }}'
Expression: sum(increase(ocsepp_network_id_validation_body_failure_total[2m]) >0 or (ocsepp_network_id_validation_body_failure_total unless ocsepp_network_id_validation_body_failure_total offset 2m )) by (namespace, remote_sepp_name, nf_instance_id, peer_fqdn, plmn_identifier, app, resource_uri, pod) > 0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4012 |
Metric Used | ocsepp_network_id_validation_body_failure_total |
Resolution | The alerts gets cleared when the count goes below 0. |
5.3.2.16 Cat-3 Previous Location Check Alerts
5.3.2.16.1 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent30to40
Table 5-271 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent30to40
Trigger Condition | When previous location check validation failure error is detected between 30 to 40 Percent of Total Transactions , this alert will be raised. |
Severity | Minor |
Alert Details Provided |
Summary Previous location check validation failure detected between 30 to 40 Percent of Total Transactions Expression sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.3 and sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.4 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4032 |
Metric Name | ocsepp_previous_location_validation_failure_total |
Resolution | The alerts gets cleared when the previous location check validation failure error does not lie between 30 to 40 percent of total transactions. |
5.3.2.16.2 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent40to50
Table 5-272 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent40to50
Trigger Condition | When previous location check validation failure error is detected between 40 to 50 Percent of Total Transactions , this alert will be raised. |
Severity | Major |
Alert Details Provided |
Summary Previous location check validation failure detected between 40 to 50 Percent of Total Transactions Expression sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.4 and sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.5 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4033 |
Metric Name | ocsepp_previous_location_validation_failure_total |
Resolution | The alerts gets cleared when the previous location check validation failure error does not lie between 40 to 50 percent of total transactions. |
5.3.2.16.3 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercentAbove50
Table 5-273 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercentAbove50
Trigger Condition | When previous location check validation failure error is detected above 50 Percent of Total Transactions , this alert will be raised. |
Severity | Critical |
Alert Details Provided |
Summary Previous location check validation failure detected above 50 Percent of Total Transactions Expression sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.5" |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4034 |
Metric Name | ocsepp_previous_location_validation_failure_total |
Resolution | The alerts gets cleared when the previous location check validation failure error does not lie above 50 percent of total transactions. |
5.3.2.16.4 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent30to40
Table 5-274 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent30to40
Trigger Condition | When previous location check exception failure is detected between 30 to 40 Percent of Total Transactions , this alert will be raised. |
Severity | Minor |
Alert Details Provided |
Summary Previous location check exception failure detected between 30 to 40 Percent of Total Transactions Expression sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.3 and sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.4 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4035 |
Metric Name | ocsepp_previous_location_exception_failure_total |
Resolution | The alerts gets cleared when the previous location check exception failure does not lie between 30 to 40 percent of total transactions. |
5.3.2.16.5 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent40to50
Table 5-275 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent40to50
Trigger Condition | When previous location check exception failure error is detected between 40 to 50 Percent of Total Transactions , this alert will be raised. |
Severity | Major |
Alert Details Provided |
Summary Previous location check exception failure detected between 40 to 50 Percent of Total Transactions Expression sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.4 and sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.5 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4036 |
Metric Name | ocsepp_previous_location_exception_failure_total |
Resolution | The alerts gets cleared when the previous location check exception failure error does not lie between 40 to 50 percent of total transactions. |
5.3.2.16.6 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercentAbove50
Table 5-276 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercentAbove50
Trigger Condition | When previous location check exception failure error is detected above 50 Percent of Total Transactions , this alert will be raised. |
Severity | Critical |
Alert Details Provided |
Summary Previous location check exception failure detected above 50 Percent of Total Transactions Expression sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.5 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4037 |
Metric Name | ocsepp_previous_location_exception_failure_total |
Resolution | The alerts gets cleared when the previous location check exception failure error does not lie above 50 percent of total transactions. |
5.3.2.17 Rate Limiting for Egress Roaming Signaling per PLMN Alerts
5.3.2.17.1 Egress Request Rate Limit per PLMN Message Drop Above 10 Percent Alert
Table 5-277 Egress Request Rate Limit per PLMN Message Drop Above 10 Percent Alert
Trigger Condition | If a request is dropped due to the tokens in the bucket are exhausted and drop rate per PLMN is detected above 10 percent of total transactions of that PLMN, oc_ingressgateway_plmn_egress_ratelimit_total metric will be pegged and corresponding alert will be raised. |
Severity | Minor |
Alert Details Provided |
Summary Egress Rate Limiting Request Drop Rate detected per PLMN above 10 Percent of Total Transactions Expression sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[5m])) by (EgressRateLimitList,PLMN_ID,namespace)/sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total[5m])) by (EgressRateLimitList,PLMN_ID,namespace) *100 >= 10 < 25 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4039 |
Metric Name | oc_ingressgateway_plmn_egress_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.17.2 Egress Request Rate Limit per PLMN Message Drop Above 25 Percent Alert
Table 5-278 Egress Request Rate Limit per PLMN Message Drop Above 25 Percent Alert
Trigger Condition | If a request is dropped due to the tokens in the bucket are exhausted and drop rate per PLMN is detected above 25 percent of total transactions of that PLMN, oc_ingressgateway_plmn_egress_ratelimit_total metric will be pegged and corresponding alert will be raised. |
Severity | Major |
Alert Details Provided |
Summary Egress Rate Limiting Request Drop Rate detected per PLMN above 25 Percent of Total Transactions Expression sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[5m])) by (EgressRateLimitList,PLMN_ID,namespace)/sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total[5m])) by (EgressRateLimitList,PLMN_ID,namespace) *100 >= 10 < 25 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4040 |
Metric Name | oc_ingressgateway_plmn_egress_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.17.3 Egress Request Rate Limit per PLMN Message Drop Above 50 Percent Alert
Table 5-279 Egress Request Rate Limit per PLMN Message Drop Above 50 Percent Alert
Trigger Condition | If a request is dropped due to the tokens in the bucket are exhausted and the drop rate per PLMN is detected above 50 percent of total transactions of that PLMN, oc_ingressgateway_plmn_egress_ratelimit_total metric will be pegged and corresponding alert will be raised. |
Severity | Critical |
Alert Details Provided |
Summary Egress Rate Limiting Request Drop Rate detected per PLMN above 50 Percent of Total Transactions Expression sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[5m])) by (EgressRateLimitList,PLMN_ID,namespace)/sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total[5m])) by (EgressRateLimitList,PLMN_ID,namespace) *100 >= 50 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4041 |
Metric Name | oc_ingressgateway_plmn_egress_ratelimit_total |
Resolution | The alerts gets cleared when the count goes down. |
5.3.2.18 Separate Port Configurations for N32c and N32f on the Egress Routes Alerts
5.3.2.18.1 EgressInterfaceConnectionFailure
Table 5-280 EgressInterfaceConnectionFailure
Field | Details |
---|---|
Trigger Condition | If the destination host and port mentioned in the Remote profile are unreachable or not available, then the alert will be raised. |
Severity | Major |
Alert Details Provided |
Summary: Egress connection failure on the interfaceExpression: sum(increase(oc_egressgateway_connection_failure_total{app="n32-egress-gateway"}[5m])) by (namespace,app,Host,Port) >0 |
OID | 1.3.6.1.4.1.323.5.3.46.1.2.4042 |
Metric Name | oc_egressgateway_connection_failure_total |
Resolution | If the destination host and port are reachable, then the alert will be cleared. |
5.4 SEPP Alert Configuration
This section describes the Measurement based Alert rules configuration for SEPP. The Alert Manager uses the Prometheus measurements values as reported by microservices in conditions under alert rules to trigger alerts.
5.4.1 Configuring SEPP Alerts in OCI
The following procedure describes how to configure the SEPP alerts for OCI. The OCI supports metric expressions written in MQL (Metric Query Language) and thus, requires a new SEPP alert file for configuring alerts in OCI observability platform.
The following are the steps:
- Run the following command to extract the .zip
file:
unzip ocsepp_oci_alertrules_<version>.zip
Theocsepp_oci
andocsepp_oci_resources
folders are available in the zip file.Note:
The zip file is available in the Scripts folder of CSAR package. - Open the
ocsepp_oci
folder, in thenotifications.tf file
, update the parameterendpoint
with the email id of the user. - Open the
ocsepp_oci_resources
folder, in thenotifications.tf file
, update the parameterendpoint
with the email id of the user. - Log in to the OCI Console.
Note:
For more details about logging in to the OCI, refer to Signing In to the OCI Console. - Open the navigation menu and select Developer Services. The Developer Services window appears on the right pane.
- Under the Developer Services, select Resource Manager.
- Under Resource Manager, select Stacks. The Stacks window appears.
- Click Create Stack.
- Select the default My Configuration radio button.
- Under Stack configuration, select the folder radio button and upload
the
ocsepp_oci
folder. - Enter the Name and Description and select the compartment.
- Select the latest Terraform version from the Terraform version drop-down.
- Click Next. The Edit Stack screen appears.
- Enter the required inputs to create the SEPP alerts or alarms and click Save and Run Apply.
- Verify that the alarms are created in the Alarm Definitions screen
(OCI Console> Observability & Management> Monitoring>Alarm
Definitions) provided.
The required inputs are:
- Alarms Configuration
- Compartment Name - Choose name of compartment from the drop-down
- Metric namespace - Metric namespace that the user provided while deploying OCI Adaptors.
- Topic Name - Any user configurable name. Must contain fewer than 256 characters. Only alphanumeric characters plus hyphens (-) and underscores (_) are allowed.
- Message Format - Keep it as ONS_OPTIMIZED. (This is pre-populated)
- Alarm is_enabled - Keep it as True. (This is pre-populated)
- Alarms Configuration
- The steps 6 to 15 must be repeated for uploading the
ocsepp_oci_resources
folder. Here, Metric namespace will be pre-populated.
For more details, see Oracle Communications Cloud Native Core, OCI Adaptor Deployment Guide.
5.4.2 Configuring SEPP Alerts for OCCNE 1.8.x and Previous Versions
The following procedure describes how to configure the SEPP alerts for OCCNE version 1.8.x and previous versions:
- Run the following command to find the config map to configure
alerts in the Prometheus server:
kubectl get configmap -n <Namespace>
where, <Namespace> is the prometheus server namespace used in helm install command.
-
Run the following command to take backup of current config map of prometheus server:where, <Namespace> is the prometheus server namespace used in helm install command.
kubectl get configmaps <NAME>-server -o yaml -n <Namespace> > /tmp/tempConfig.yaml
For example, assuming chart name is "prometheus-alert", so "_NAME_-server" becomes "prometheus-alert-server", run the following command to find the config map:kubectl get configmaps prometheus-alert-server -o yaml -n prometheus-alert2 > /tmp/tempConfig.yaml
- Run the following command to check if alertssepp is present in
the tempConfig.yaml
file:
cat /tmp/t_mapConfig.yaml | grep alertssepp
- Run the following command to delete the alertssepp entry from
the t_mapConfig.yaml file, if the alertssepp is present
:
sed -i '/etc\/config\/alertssepp/d' /tmp/t_mapConfig.yaml
- Run the following command to add the alertssepp entry in the
t_mapConfig.yaml file, if the alertssepp is not present
:
sed -i '/rule_files:/a\ \- /etc/config/alertssepp' /tmp/t_mapConfig.yaml
- Run the following command to reload the config map with the
modifed
file:
kubectl replace configmap <Name> -f /tmp/t_mapConfig.yaml
- Run the following command to add seppAlertRules.yaml file into
prometheus config map under filename of SEPP alert file
:
kubectl patch configmap <Name> -n <Namespace> --type merge --patch "$(cat <PATH>/seppAlertRules.yaml)"
- Restart prometheus-server pod.
- Verify the alerts in prometheus GUI.
Note:
Prometheus takes about 20 seconds to apply the updated Config map.
5.4.3 Configuring SEPP Alerts for OCCNE 1.9.x and Higher Versions
The following procedure describes how to configure the SEPP alerts for OCCNE 1.9.x and higher versions:
- Run the following command to apply the Prometheus rules Custom
Resource Definition (CRD):
Example:
Where,kubectl apply -f <file_name> -n <sepp namespace>
- <file_name> is the SEPP alerts file
- <sepp namespace> is the SEPP namespace
$ kubectl apply -f ocsepp_alerting_rules_promha.yaml -n seppsvc
- Run the following command to check if SEPP alert file is added to
Prometheus rules:
$ kubectl get prometheusrules --namespace <namespace>
Example:$ kubectl get prometheusrules --namespace seppsvc
- Log in to Prometheus GUI and verify the alerts section.
Note:
The Prometheus server takes an updated config map that is automatically reloaded after approximately 60 seconds. Refresh the Prometheus GUI to confirm that the SEPP alerts have been reloaded.