5 SEPP Metrics, KPIs, and Alerts

5.1 SEPP Metrics

This section provides information about the SEPP metrics.

Table 5-1 Dimensions

Dimensions Details Values
apiUrl Resource URIs passing across Inter PLMN Via SEPP Example: /namf-comm/v1/subscriptions
app SEPP Service names

Examples:

  • cn32f-svc
  • cn32c-svc
  • pn32f-svc
  • pn32c-svc
  • Plmn Ingress Gateway
  • N32 Ingress Gateway
application application name, here, it is ocsepp. Example: ocsepp
cause Indicates the reason of failure

Examples:

  • Network ID is not present in PLMN ID List
  • UDR Response Is Not Success
  • UE is either not present or unable to extract UE from message
  • Previous Location Check Validation Failed
chart Indicates the SEPP microservice release names

Examples:

  • pn32f-svc-2.23.3-0
  • cn32f-svc-2.23.3-0
container Indicates the name of the container. It is part of each metrics. The app and container contains the same value.

Examples:

  • cn32f-svc
  • cn32c-svc
  • pn32f-svc
  • pn32c-svc
  • Plmn Ingress Gateway
  • N32 Ingress Gateway
DestinationHost Indicates the destination host for Jetty client on PN32F or CN32F

Examples:

  • ocsepp-release-plmn-egress-gateway
  • ocsepp-release-n32-egress-gateway
direction

Direction of the request or response.

In Gateway Metrics, the values are egress and egressOut.

In N32F Metrics the values are ingress and egress.

Examples:

  • Ingress
  • Egress
Egress Rate Limit List The list that contains the PLMN for applying Egress Rate Limit. Example: ERL1
engVersion The SEPP Release version

Example:

  • 23.3.0
error_action The action needs to be taken when there is a validation failure in SEPP.

Examples:

REJECT

FORWARD

ErrorOriginator Name of service that originates the error. Example: PN32F
event The event that occurred on request processing or completion.

Examples:

  • onBegin
  • onHeaders
  • onQueued
  • onContent
  • onCommit
  • onFailure
handshake_procedure The type of the handshake operation at cSEPP or pSEPP. Example: capability-exchange
header SBI Headers

Examples:

  • via
  • server
Host FQDN of the target host Example: ocsepp-release-n32-egress-gateway.
http_error_message Reason for failure response received.

Examples:

  • Context Not Established
  • Destination URI contain invalid PLMN ID
  • Message validation failed
  • N32fContext Not Found
  • org.springframework.web.reactive.function.client.WebClientRequestException: Connect Timeout
http_method HTTP Method Name

Examples:

GET

PUT

POST

PATCH

DELETE

http_status HTTP Status Code in response (404 NOT_FOUND, 429 TOO_MANY_REQUESTS, 200 OK)

Examples:

2xx, 4xx, 5xx

namespace Name of the Kubernetes namespace on which microservice is running. Example: seppsvc
nf_instance_id Unique identity of the NF Instance sending request to OCSEPP. Example: 9faf1bbc-6e4a-4454-a507-aef01a101a06
NfServiceType Name of target network function service Example: nausf-auth
NfType Name of target network function

Examples:

  • ausf
  • udm
  • nrf
node Name of the Kubernetes worker node on which microservice is running. Example: k8s-node-13.chase1.lab.in.oracle.com
peer_domain Domain of Remote SEPP

Example: svc.cluster.local

peer_fqdn FQDN of peer present in Remote SEPP
peer_plmn_id Supported PLMN list of Remote SEPP

Example: "[Plmn [mcc=123, mnc=456]]"

plmn_identifier In CAT 2 Network ID Validation feature, PLMN is extracted from this identifier.

Examples:

  • supi
  • addUeLocation
  • guamiList
pod Name of the pod of SEPP microservice Example: ocsepp-release-cn32f-svc-6fd6ccfd4b-hkgqb
Port Port number Example: 443
release Name of the SEPP release deployed. Example: ocsepp-release
releaseVersion Indicates the current release version of SEPP. Example: 23.4.0
remote_sepp_name Name of the SEPP from where message is received or destined to Example: SEPP-1
remote_sepp_set_name Name of the Remote SEPP Set from where message is received or destined to Example: RPS-3
request_path Resource URI as per defined in 3GPP specifications for 5G. Example: /nudm-sdm/v2/imsi-987654000000008
ruleApplied Rules Applied on Local SEPP or Remote SEPP.

Examples:

  • REMOTE
  • LOCAL
Scheme Indicates the HTTP Scheme

Examples:

  • HTTPS
  • HTTP
sepp_type SEPP that acts as Producer SEPP or Consumer SEPP

Examples:

  • Consumer
  • Producer
sourceRss only if Allowed P-RSS Validation is enabled Example:
Status The status of the feature or microservice.

Examples:

  • accepted
  • dropped
  • ratelimit not applied
vendor For OCSEPP, vendor Value must be set to "oracle"

Example: oracle

5.1.1 Configuring SEPP Metrics Dashboard in OCI

This section describes the steps to upload the ocsepp_oci_dashboard_<version>.json file on OCI Logging Analytics Dashboard. As OCI doesn't support Grafana, OCI uses the Logging Analytics Dashboard Service for visualizing the metrics and logs.

The steps are:
  1. Log in to OCI Console.

    Note:

    For more details about logging in to the OCI, refer to Signing In to the OCI Console.
  2. Open the navigation menu and click Observability & Management.
  3. Under Logging Analytics, click Dashboards. The Dashboards page appears.
  4. Choose the Compartment on the left pane.
  5. Click Import dashboards.
  6. User can select and upload the ocsepp_oci_dashboard_<version>.json file. The following three parameters of json file must be customized before uploading it:
    1. ##COMPARTMENT_ID: The OCID of the compartment.
    2. ##METRIC_NAMESPACE: The metrics namespace that the user provided while deploying OCI adaptor.
    3. ##K8_NAMESPACE: Kubernetes namespace where SEPP is deployed.
  7. Import dashboard page appears. Click Import button on the page.

    User can view the imported dashboard and can view the metrics in the dashboard.

    Note:

    SEPP has organized the panels or widgets in five dashboards to support the SEPP metrics and all the five dashboards have been clubbed into a single JSON file.
For more details, see Oracle Communications Cloud Native Core, OCI Adaptor Deployment Guide.

5.1.2 Common Metrics

5.1.2.1 cgroup_cpu_nanoseconds

Table 5-2 cgroup_cpu_nanoseconds

Field Details
Metric Details Total CPU time consumed by service in nanoseconds
Microservice Consumer N32f, Producer N32f
Type Hologram
Dimensions
  • app (Consumer, Producer)
  • chart
  • service_resource_overload_level
  • container
  • engVersion
  • exported_application
  • exported_microservice
  • exported_namespace
  • exported_pod
  • exported_vendor
  • microservice
  • mktgVersion
  • namespace
  • node
  • pod
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.2.2 cgroup_memory_bytes

Table 5-3 cgroup_memory_bytes

Field Details
Metric Details Total memory consumed by service in bytes
Microservice Consumer N32f, Producer N32f
Type Gauge
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • engVersion
  • exported_application
  • exported_microservice
  • exported_namespace
  • exported_pod
  • exported_vendor
  • microservice
  • mktgVersion
  • namespace
  • node
  • pod
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.2.3 oc_configclient_request_total

Table 5-4 oc_configclient_request_total

Metric Details This metric will be pegged whenever config client is polling for configuration update from common configuration server
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • app (Plmn Ingress Gateway, N32 Ingress Gateway)
  • application
  • chart
  • configVersion
  • container
  • engVersion
  • microservice
  • mktgVersion
  • namespace
  • pod
  • releaseVersion
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.2.4 oc_configclient_response_total

Table 5-5 oc_configclient_response_total

Metric Details This metrics will be pegged whenever config client receives response from common configuration server
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • app (Plmn Ingress Gateway, N32 Ingress Gateway)
  • application
  • chart
  • configVersion
  • container
  • engVersion
  • microservice
  • mktgVersion
  • namespace
  • pod
  • releaseVersion
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.2.5 oc_configserver_reachability

Table 5-6 oc_configserver_reachability

Metric Details Gauge metric to peg the reachability of config server
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Gauge
Dimensions
  • app (Plmn Ingress Gateway, N32 Ingress Gateway)
  • application
  • chart
  • container
  • endpoint
  • engVersion
  • microservice
  • mktgVersion
  • namespace
  • pod
  • releaseVersion
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.2.6 service_resource_overload_level

Table 5-7 service_resource_overload_level

Field Details
Metric Details Overload level value for Warning, Minor, Major, and Critical.

Microservice Performance
Type Gauge
Dimensions
  • namespace
  • app
  • value
  • metric

5.1.3 CN32F Common Metrics

5.1.3.1 ocsepp_cn32f_jetty_request_stat_metrics_total

Table 5-8 ocsepp_cn32f_jetty_request_stat_metrics_total

Metric Details This metric will be pegged for every event occurred when a request is sent to CN32F
Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • app (Consumer)
  • chart
  • client_type
  • container
  • DestinationHost
  • event
  • nf_instance_id
  • namespace
  • pod
5.1.3.2 ocsepp_cn32f_jetty_response_stat_metrics_total

Table 5-9 ocsepp_cn32f_jetty_response_stat_metrics_total

Field Details
Metric Details This metric will be pegged for every event occurred when a response is received from CN32F
Microservice Consumer N32f
Type Counter
Dimensions
  • app (Consumer)
  • chart
  • client_type
  • container
  • DestinationHost
  • event
  • nf_instance_id
  • namespace
  • pod
5.1.3.3 ocsepp_cn32f_connection_failure_total

Table 5-10 ocsepp_cn32f_connection_failure_total

Field Details
Metric Details This metric will be pegged in the customized Jetty Client as soon as it fails to connect to the destination service.
Microservice Consumer N32f
Type Counter
Dimensions
  • app (Consumer)
  • chart
  • container
  • direction
  • ErrorOriginator
  • http_error_message
  • Host
  • nf_instance_id
  • namespace
  • pod
  • Port
5.1.3.4 ocsepp_cn32f_requests_failure_total

Table 5-11 ocsepp_cn32f_requests_failure_total

Metric Details

Total number of requests failed to be sent from cn32f to remote SEPP.

Condition: When any error or exception occurs on cn32f side because of which request is not sent to pn32f.

Microservice Consumer N32f
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor "Oracle"
5.1.3.5 ocsepp_cn32f_response_failure_total

Table 5-12 ocsepp_cn32f_response_failure_total

Field Details
Metric Details

Total number of response failed to be sent from cn32f pod to NF.

Condition: When any error or exception occurs on cn32f and request is not sent to NF.

Microservice Consumer N32f
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor "Oracle"
5.1.3.6 ocsepp_cn32f_requests_total

Table 5-13 ocsepp_cn32f_requests_total

Metric Details

Total number of requests received from NF.

Condition:When a request is received on InboundInterface of cn32f.

Microservice Consumer N32f
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType

  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • sourceRss (only if Allowed P-RSS Validation is enabled)
  • vendor "Oracle"
5.1.3.7 ocsepp_cn32f_response_total

Table 5-14 ocsepp_cn32f_response_total

Field Details
Metric Details

Total number of response received from remote SEPP.

Condition: When a response is received on OutboundInterface of cn32f.

Microservice Consumer N32f
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType

  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • sourceRss (only if Allowed P-RSS Validation is enabled)
  • http_status
  • vendor "Oracle"
5.1.3.8 ocsepp_cn32f_latency_seconds_count

Table 5-15 ocsepp_cn32f_latency_seconds_count

Field Details
Metric Details

This metric is used to display the number of ingress requests processed at cn32f in a particular time span (in seconds).

Microservice Consumer N32f
Type Histogram
Dimensions
  • app(consumer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.3.9 ocsepp_cn32f_latency_seconds_max

Table 5-16 ocsepp_cn32f_latency_seconds_max

Field Details
Metric Details This metrics is used to display the maximum of processing time of an ingress request at cn32f in seconds.
Microservice Consumer N32f
Type Histogram
Dimensions
  • app(consumer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.3.10 ocsepp_cn32f_latency_seconds_sum

Table 5-17 ocsepp_cn32f_latency_seconds_sum

Field Details
Metric Details This metrics is used to display the average of processing time of all the ingress request at cn32f for a particular time.
Microservice Consumer N32f
Type Histogram
Dimensions
  • app(consumer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • nf_service_type
  • vendor "Oracle"
5.1.3.11 ocsepp_cn32f_outgoing_connections

Table 5-18 ocsepp_cn32f_outgoing_connections

Field Details
Metric Details Gauge metric that will peg active outgoing connections from CN32F to destination
Microservice Consumer N32f
Type Gauge
Dimensions
  • app (Consumer)
  • chart
  • container
  • direction
  • Host
  • nf_instance_id
  • namespace
  • pod
5.1.3.12 ocsepp_cn32f_server_latency

Table 5-19 ocsepp_cn32f_server_latency

Field Details
Metric Details This metric will be pegged in Jetty response listener that captures the amount of time taken for processing of the request by jetty client
Microservice Consumer N32f
Type Gauge
Dimensions
  • method
  • instanceIdentifier
  • host

5.1.4 PN32F Common Metrics

5.1.4.1 ocsepp_pn32f_requests_total

Table 5-20 ocsepp_pn32f_requests_total

Metric Details

Number of requests received from Peer SEPP.

Condition: When a request reaches pn32f from peer SEPP.

Microservice Producer N32f
Type Counter
Dimensions
  • app(producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor "Oracle"
5.1.4.2 ocsepp_pn32f_requests_failure_total

Table 5-21 ocsepp_pn32f_requests_failure_total

Metric Details Number of requests transmitted to NF.

Condition: When a request transmits a message to a NF.
Microservice Producer N32f
Type Counter
Dimensions
  • app(producer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor
5.1.4.3 ocsepp_pn32f_responses_total

Table 5-22 ocsepp_pn32f_responses_total

Metric Details

Number of responses received from Egress Gateway.

Condition: When a response reaches pn32f from Egress Gateway.

Microservice Producer N32f
Type Counter
Dimensions
  • app(producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status (201 CREATED, 404 NOT_FOUND)
  • vendor "Oracle"
5.1.4.4 ocsepp_pn32f_responses_failure_total

Table 5-23 ocsepp_pn32f_responses_failure_total

Metric Details

Number of responses transmitted to Consumer SEPP (cSEPP).

Condition: When a response transmits a message to cSEPP.

Microservice Producer N32f
Type Counter
Dimensions
  • app (producer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor
5.1.4.5 ocsepp_pn32f_latency_seconds_count

Table 5-24 ocsepp_pn32f_latency_seconds_count

Metric Details This metric is used to display the number of ingress requests processed at pn32f in a particular time span (in seconds).
Microservice Producer N32f
Type Histogram
Dimensions
  • app (producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor
5.1.4.6 ocsepp_pn32f_latency_seconds_sum

Table 5-25 ocsepp_pn32f_latency_seconds_sum

Metric Details This metrics is used to display the average processing time of all the ingress request at pn32f for a particular time.
Microservice Producer N32f
Type Histogram
Dimensions
  • app (producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
5.1.4.7 ocsepp_pn32f_latency_seconds_max

Table 5-26 ocsepp_pn32f_latency_seconds_max

Metric Details This metrics is used to display the maximum processing time of all the ingress request at pn32f for a particular time.
Microservice Producer N32f
Type Histogram
Dimensions
  • vendor
  • nfInstanceId
  • peer_fqdn
  • peer_domain
  • plmn_id
  • direction
  • message_type
  • namespace
  • remote_sepp_name
  • targetUrl
  • app
  • container
  • pod
  • release
5.1.4.8 ocsepp_pn32f_connection_failure_total

Table 5-27 ocsepp_pn32f_connection_failure_total

Metric Details This metric will be pegged in the customized Jetty Client as soon as it fails to connect to the destination service.
Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • host
  • port
  • direction
  • instanceIdentifier
  • errorReason
  • errorOriginator
5.1.4.9 ocsepp_pn32f_jetty_request_stat_metrics_total

Table 5-28 ocsepp_pn32f_jetty_request_stat_metrics_total

Field Details
Metric Details This metric will be pegged for every event occurred when a request is sent to PN32F
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • client_type
  • container
  • DestinationHost
  • event
  • nf_instance_id
  • namespace
  • pod
5.1.4.10 ocsepp_pn32f_jetty_response_stat_metrics_total

Table 5-29 ocsepp_pn32f_jetty_response_stat_metrics_total

Field Details
Metric Details This metric will be pegged for every event occurred when a response is received from PN32F
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • client_type
  • container
  • DestinationHost
  • event
  • nf_instance_id
  • namespace
  • pod
5.1.4.11 ocsepp_pn32f_outgoing_connections

Table 5-30 ocsepp_pn32f_outgoing_connections

Field Details
Metric Details Gauge metric that will peg active outgoing connections from PN32F to destination
Microservice Producer N32f
Type Gauge
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • Host
  • nf_instance_id
  • namespace
  • pod
5.1.4.12 ocsepp_pn32f_server_latency

Table 5-31 ocsepp_pn32f_server_latency

Field Details
Metric Details This metric will be pegged in Jetty response listener that captures the amount of time taken for processing of the request by jetty client
Microservice Producer N32f
Type Gauge
Dimensions
  • method
  • instanceIdentifier
  • host

5.1.5 N32C Handshake Procedure Metrics

5.1.5.1 ocsepp_n32c_handshake_failure_attempts_total

Table 5-32 ocsepp_n32c_handshake_failure_attempts_total

Field Details
Metric Details If N32c Handshake procedure fails, this metrics will be pegged and corresponding alarm will be raised.
Microservice Producer and Consumer N32c
Type Counter
Dimensions
  • app(consumer, producer)
  • chart
  • container
  • handshake_procedure
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor
5.1.5.2 ocsepp_n32c_handshake_reInitiation_failure

Table 5-33 ocsepp_n32c_handshake_reInitiation_failure

Field Details
Metric Details If N32c Handshake Reinitiation procedure fails, this metrics will be pegged and corresponding alarm will be raised.
Microservice Consumer N32c, Producer N32c
Type Gauge
Dimensions
  • app(consumer, producer)
  • chart
  • container
  • handshake_procedure
  • namespace
  • peer_domain (optional)
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor
5.1.5.3 ocsepp_cn32c_handshake_requests_total

Table 5-34 ocsepp_cn32c_handshake_requests_total

Field Details
Metric Details

Total number of requests sent over n32c for handshake procedure.

Condition: When SEPP initiates any handshake procedure requests towards peer SEPP.

Microservice Consumer N32c
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • handshake_procedure
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor
5.1.5.4 ocsepp_cn32c_handshake_response_total

Table 5-35 ocsepp_cn32c_handshake_response_total

Field Details
Metric Details

Total number of responses received over n32c for handshake procedure.

Condition: When SEPP receives any handshake procedure response from peer SEPP. It can be successful or failure based on response code.

Microservice Consumer N32c
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • handshake_procedure
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status (2xx,4xx,5xx)
  • vendor
5.1.5.5 ocsepp_cn32c_handshake_initiation_req_total

Table 5-36 ocsepp_cn32c_handshake_initiation_req_total

Field Details
Metric Details

Total number of Handshake initiation requests received from config-mgr.

Condition: When handshake initiation requests are received from config-mgr.

Microservice Consumer N32c
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor
5.1.5.6 ocsepp_cn32c_handshake_reinitiation_req_total

Table 5-37 ocsepp_cn32c_handshake_reinitiation_req_total

Field Details
Metric Details

Total number of Handshake ReInitiation requests received from config-mgr.

Condition: When handshake Reinitiation requests received from config-mgr.

Microservice Consumer N32C
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor
5.1.5.7 ocsepp_cn32c_handshake_delete_req_total

Table 5-38 ocsepp_cn32c_handshake_delete_req_total

Field Details
Metric Details

Total number of Handshake context delete requests received from config-mgr.

Condition: When handshake context delete requests are received from config-mgr.

Microservice Consumer N32c
Type Counter
Dimensions
  • app(consumer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor
5.1.5.8 ocsepp_pn32c_handshake_requests_total

Table 5-39 ocsepp_pn32c_handshake_requests_total

Field Details
Metric Details

Total number of requests received over n32c for handshake procedure.

Condition: When any handshake procedure request is received from peer SEPP.

Microservice Producer N32c
Type Counter
Dimensions
  • app(producer)
  • chart
  • container
  • handshake_procedure
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor
5.1.5.9 ocsepp_pn32c_handshake_response_total

Table 5-40 ocsepp_pn32c_handshake_response_total

Field Details
Metric Details

Total number of responses sent over n32c for handshake procedure.

Condition: When SEPP sends response to handshake procedure received. It can be a success response or failure response based on success code.

Microservice Producer N32c
Type Counter
Dimensions
  • app(producer)
  • chart
  • container
  • handshake_procedure
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status (200 OK,4xx,5xx)
  • vendor

5.1.6 5G SBI Message Mediation Support Metrics

5.1.6.1 ocsepp_cn32f_mediation_requests_total

Table 5-41 ocsepp_cn32f_mediation_requests_total

Metric Details

Metric is common for both CN32F & PN32F.

Separation happens based on "app" tag.

Number of requests in which Trigger Rule Applied at SEPP end for Mediation, based on configuration.

Microservice Consumer N32f Producer N32f
Type Counter
Dimensions
  • peer_fqdn
  • peer_domain
  • plmn_id
  • statusCode
  • direction
  • method
  • NfType
  • NfServiceType
  • vendor
  • nfInstanceId
  • ruleApplied
  • requestType
5.1.6.2 ocsepp_n32f_mediation_not_applied_total

Table 5-42 ocsepp_n32f_mediation_not_applied_total

Field Details
Metric Details

Metric is common for both CN32F and PN32F.

Separation happens based on "app" tag.

Number of requests for which Trigger Rule do not match at SEPP and request is not forwarded to Mediation. For Match all configurations, the trigger points will be matched.

Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • message_type
  • ruleApplied
  • vendor "Oracle"
5.1.6.3 ocsepp_cn32f_mediation_response_total

Table 5-43 ocsepp_cn32f_mediation_response_total

Metric Details Number of requests in which CN32F service of SEPP get Response from Mediation Service.

Microservice Consumer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • message_type
  • ruleApplied
  • http_status
  • vendor "Oracle"
5.1.6.4 ocsepp_cn32f_mediation_response_failure

Table 5-44 ocsepp_cn32f_mediation_response_failure

Field Details
Metric Details Number of requests in which CN32F service of SEPP get Failure Response from Mediation Service.

Microservice Consumer N32f
Type Counter
Dimensions
  • peer_fqdn
  • peer_domain
  • plmn_id
  • statusCode
  • direction
  • error_msg
  • vendor
  • nfInstanceId
  • ruleApplied
  • requestType
  • method
  • request_path
5.1.6.5 ocsepp_pn32f_mediation_response_total

Table 5-45 ocsepp_pn32f_mediation_response_total

Metric Details Number of requests in which PN32F service of SEPP get Response from Mediation Service.

Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • message_type
  • ruleApplied
  • http_status
  • vendor "Oracle"
5.1.6.6 ocsepp_pn32f_mediation_response_failure

Table 5-46 ocsepp_pn32f_mediation_response_failure

Field Details
Metric Details Number of requests in which PN32F service of SEPP get Failure Response from Mediation Service.
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_error_message
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • request_path
  • message_type
  • ruleApplied
  • http_status
  • vendor "Oracle"
5.1.6.7 ocsepp_cn32f_mediation_latency_seconds_count

Table 5-47 ocsepp_cn32f_mediation_latency_seconds_count

Field Details
Metric Details

This metric is used to display the number of ingress requests processed at cn32f in a particular time span (in seconds).

Microservice Consumer N32f
Type Histogram
Dimensions
  • app (Consumer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.6.8 ocsepp_cn32f_mediation_latency_seconds_max

Table 5-48 ocsepp_cn32f_mediation_latency_seconds_max

Field Details
Metric Details Total time taken for processing a message (from sending a message to receiving the response).
Microservice Consumer N32f
Type Histogram
Dimensions
  • app (Consumer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.6.9 ocsepp_cn32f_mediation_latency_seconds_sum

Table 5-49 ocsepp_cn32f_mediation_latency_seconds_sum

Metric Details Total time taken for processing a message (from sending a message to receiving the response).
Microservice Producer N32f
Type Histogram
Dimensions
  • app (Consumer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.6.10 ocsepp_pn32f_mediation_latency_seconds_count

Table 5-50 ocsepp_pn32f_mediation_latency_seconds_count

Metric Details

Total time taken for processing a message (from sending a message to receiving the response).

Microservice Producer N32f
Type Histogram
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor
5.1.6.11 ocsepp_pn32f_mediation_latency_seconds_max

Table 5-51 ocsepp_pn32f_mediation_latency_seconds_max

Metric Details This metrics is used to display the maximum processing time of all the ingress request at pn32f for a particular time.
Microservice Producer N32f
Type Histogram
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor
5.1.6.12 ocsepp_pn32f_mediation_latency_seconds_sum

Table 5-52 ocsepp_pn32f_mediation_latency_seconds_sum

Metric Details This metrics is used to display the average of processing time of all the ingress request at pn32f for a particular time.
Microservice Producer N32f
Type Histogram
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor
5.1.6.13 ocsepp_pn32f_mediation_latency_seconds_sum

Table 5-53 ocsepp_pn32f_mediation_latency_seconds_sum

Field Details
Metric Details Time taken by Mediation Service to process request after getting called from PN32F service.
Microservice PN32f
Type Histogram
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor
5.1.6.14 ocsepp_n32f_mediation_requests_total

Table 5-54 ocsepp_n32f_mediation_requests_total

Field Details
Metric Details

Number of requests in which Trigger Rule Applied at SEPP end for Mediation, based on configuration.

Separation happens based on "app" tag.

Microservice Consumer N32F, Producer N32F
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • sourceRss
  • http_status
  • vendor "Oracle"

5.1.7 Hosted SEPP Metrics

5.1.7.1 ocsepp_allowed_p_rss_routing_failure_total

Table 5-55 ocsepp_allowed_p_rss_routing_failure_total

Metric Details Number of requests failing due to Hosted SEPP failure.
Microservice Consumer N32F
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • http_error_message
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • sourceRss
  • http_status
  • vendor "Oracle"

5.1.8 Message Copy Metrics

5.1.8.1 oc_ingressgateway_msgcopy_requests_total

Table 5-56 oc_ingressgateway_msgcopy_requests_total

Field Details
Metric Details This is incremented whenever request message is sent or acknowledged from Data Director.
Microservice PLMN Ingress gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • app (PLMN Ingress gateway, N32 Ingress Gateway)
  • chart
  • container
  • namespace
  • pod
  • type
5.1.8.2 oc_ingressgateway_msgcopy_responses_total

Table 5-57 oc_ingressgateway_msgcopy_responses_total

Field Details
Metric Details This is incremented whenever response message is sent or acknowledged from DD.
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • app (PLMN Ingress gateway, N32 Ingress Gateway)
  • chart
  • container
  • namespace
  • pod
  • type
5.1.8.3 oc_ingressgateway_dd_unreachable

Table 5-58 oc_ingressgateway_dd_unreachable

Field Details
Metric Details

This indicates whether DD is reachable or not

0 - reachable, 1 - unreachable

Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Gauge
Dimensions
  • app
5.1.8.4 oc_egressgateway_msgcopy_requests_total

Table 5-59 oc_egressgateway_msgcopy_requests_total

Field Details
Metric Details This is incremented whenever request message is sent or acknowledged from DD.
Microservice Plmn Egress Gateway, N32 Egress Gateway
Type Counter
Dimensions
  • app (PLMN Ingress gateway, N32 Ingress Gateway)
  • chart
  • container
  • namespace
  • pod
  • type
5.1.8.5 oc_egressgateway_msgcopy_responses_total

Table 5-60 oc_egressgateway_msgcopy_responses_total

Field Details
Metric Details This is incremented whenever response message is sent or acknowledged from DD.
Microservice Plmn Egress Gateway, N32 Egress Gateway
Type Counter
Dimensions
  • app (PLMN Ingress gateway, N32 Ingress Gateway)
  • chart
  • container
  • namespace
  • pod
  • type
5.1.8.6 oc_egressgateway_dd_unreachable

Table 5-61 oc_egressgateway_dd_unreachable

Field Details
Metric Details

This indicates whether DD is reachable or not

0 - reachable, 1 - unreachable

Microservice Plmn Egress Gateway, N32 Egress Gateway
Type Gauge
Dimensions
  • app

5.1.9 SOR Metrics

5.1.9.1 ocsepp_pn32f_sor_requests_total

Table 5-62 ocsepp_pn32f_sor_requests_total

Metric Details Number of requests sent to SOR
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor "Oracle"
5.1.9.2 ocsepp_pn32f_sor_responses_total

Table 5-63 ocsepp_pn32f_sor_responses_total

Metric Details Number of responses received from SOR
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor "Oracle"
5.1.9.3 ocsepp_pn32f_sor_retry_to_producer_requests_total

Table 5-64 ocsepp_pn32f_sor_retry_to_producer_requests_total

Metric Details Number of requests sent to Producer based on 3gpp header
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor "Oracle"
5.1.9.4 ocsepp_pn32f_sor_back_to_consumer_responses_total

Table 5-65 ocsepp_pn32f_sor_back_to_consumer_responses_total

Metric Details Number of responses received from SoR sent back to Consumer
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor "Oracle"
5.1.9.5 ocsepp_pn32f_sor_failure_total

Table 5-66 ocsepp_pn32f_sor_failure_total

Metric Details Number of 4xx or 5xx responses received from SOR
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor "Oracle"
5.1.9.6 ocsepp_pn32f_sor_timeout_failure_total

Table 5-67 ocsepp_pn32f_sor_timeout_failure_total

Metric Details Number of requests which are request timeout while connecting to SOR
Microservice Producer N32f
Type Counter
Dimensions
  • app (Producer)
  • chart
  • container
  • direction
  • http_method
  • namespace
  • nf_instance_id
  • NfServiceType
  • NfType
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • http_status
  • vendor "Oracle"

5.1.10 Rate Limiting for Ingress Roaming Signaling per Remote SEPP Set Metrics

5.1.10.1 oc_ingressgateway_rss_ratelimit_total

Table 5-68 oc_ingressgateway_rss_ratelimit_total

Metric Details Number of request for which RSS based rate limiting was applied and request was successfully forwarded.
Microservice N32 Ingress Gateway
Type Counter
Dimensions
  • app (PLMN Ingress Gateway, N32 Ingress Gateway)
  • chart
  • container
  • ErrorOriginator
  • nf_instance_id
  • http_method
  • namespace
  • peer_plmn_id
  • pod
  • remote_sepp_set_name
  • Scheme
  • Status
Metric filter Status = accepted
5.1.10.2 oc_ingressgateway_rss_ratelimit_total

Table 5-69 oc_ingressgateway_rss_ratelimit_total

Field Details
Metric Details Number of request for which RSS based rate limiting was not applied.
Microservice N32 Ingress Gateway
Type Counter
Dimensions
  • Method
  • Status
  • Scheme
  • InstanceIdentifier
  • ErrorOriginator
Metric filter Status = ratelimit not applied
5.1.10.3 ocsepp_configmgr_routefailure_total

Table 5-70 ocsepp_configmgr_routefailure_total

Metric Details Metric pegged due to route sync issue in SEPP.
Microservice Config Manager
Type Counter
Dimensions
  • errorCode
  • app
5.1.10.4 oc_ingressgateway_rss_ratelimit_total

Table 5-71 oc_ingressgateway_rss_ratelimit_total

Field Details
Metric Details Number of request for which RSS based rate limiting was applied but request had to be dropped.
Microservice N32 Ingress Gateway
Type Counter
Dimensions
  • app (PLMN Ingress Gateway, N32 Ingress Gateway)
  • chart
  • container
  • ErrorOriginator
  • nf_instance_id
  • http_method
  • namespace
  • peer_plmn_id
  • pod
  • remote_sepp_set_name
  • Scheme
  • Status
Metric filter Status = dropped

5.1.11 Topology Hiding Metrics

5.1.11.1 ocsepp_topology_latency_seconds_count

Table 5-72 ocsepp_topology_latency_seconds_count

Field Details
Metric Details

This metric is used to display the number of ingress requests processed at cn32f and pn32f in a particular time span (in seconds).

Microservice Consumer N32f, Producer N32f
Type Histogram
Dimensions
  • app (consumer, producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.11.2 ocsepp_topology_latency_seconds_max

Table 5-73 ocsepp_topology_latency_seconds_max

Field Details
Metric Details This metrics is used to display the maximum processing time of an ingress request at cn32f and pn32f in seconds.
Microservice Consumer N32f, Producer N32f
Type Histogram
Dimensions
  • app (consumer, producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.11.3 ocsepp_topology_latency_seconds_sum

Table 5-74 ocsepp_topology_latency_seconds_sum

Field Details
Metric Details This metrics is used to display the average processing time of all the ingress request at cn32f and pn32f for a particular time.
Microservice Consumer N32f, Producer N32f
Type Histogram
Dimensions
  • app (consumer, producer)
  • chart
  • container
  • direction
  • message_type
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • nf_type
  • vendor "Oracle"
5.1.11.4 ocsepp_topology_header_success_total

Table 5-75 ocsepp_topology_header_success_total

Field Details
Metric Details Count of headers for which topology hiding and recovery was successful
Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • app (consumer, producer)
  • chart
  • container
  • header
  • namespace
  • nf_instance_id
  • pod
  • vendor "Oracle"
5.1.11.5 ocsepp_topology_header_failure_total

Table 5-76 ocsepp_topology_header_failure_total

Field Details
Metric Details Count of headers for which topology hiding and recovery failed
Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • app (consumer, producer)
  • chart
  • container
  • header
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor "Oracle"
5.1.11.6 ocsepp_topology_body_success_total

Table 5-77 ocsepp_topology_body_success_total

Field Details
Metric Details

Count of body attributes for which topology hiding and recovery was successful.

Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • vendor "Oracle"
5.1.11.7 ocsepp_topology_body_failure_total

Table 5-78 ocsepp_topology_body_failure_total

Metric Details

Count of body for which topology hiding and recovery failed

Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • request_path
  • app (Consumer, Producer)
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • http_status
  • vendor "Oracle"
5.1.11.8 ocsepp_topology_success_total

Table 5-79 ocsepp_topology_success_total

Metric Details Count of messages for which topology hiding or recovery was successful
Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • app (consumer, producer)
  • chart
  • container
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • sourceRss
  • vendor "Oracle"
5.1.11.9 ocsepp_topology_invalid_header_regex_configured_total

Table 5-80 ocsepp_topology_invalid_header_regex_configured_total

Field Details
Metric Details If configured header regex pattern is invalid, this metric will be pegged.
Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • vendor
  • nfInstanceId
  • error_message
5.1.11.10 ocsepp_topology_header_regex_not_configured_total

Table 5-81 ocsepp_topology_header_regex_not_configured_total

Field Details
Metric Details If header regex pattern is not configured, this metric will be pegged.
Microservice Consumer N32f, Producer N32f
Type Counter
Dimensions
  • vendor
  • nfInstanceId
  • error_message

5.1.12 Cat 0 - SBI Message Schema Validation Metrics

5.1.12.1 ocsepp_message_validation_applied_total

Table 5-82 ocsepp_message_validation_applied_total

Field Details
Metric Details Total number of requests for which message validation is applied.
Microservice Producer N32f, Consumer N32F
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • vendor "Oracle"

Note:

  • The dimension "peer_plmn_id" is applicable only for Consumer N32F.
  • An additional dimension "sourceRss" is applicable for Consumer N32F in Roaming Hub mode.
.
5.1.12.2 ocsepp_message_validation_on_body_failure_total

Table 5-83 ocsepp_message_validation_on_body_failure_total

Metric Details Number of requests in which message validation failed on body at SEPP end.
Microservice Producer N32f, Consumer N32F
Type Counter
Dimensions
  • app
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • request_path
  • vendor "Oracle"
  • status_code

Note:

  • The dimension "peer_plmn_id" is applicable only for Consumer N32F.
  • An additional dimension "sourceRss" is applicable for Consumer N32F in Roaming Hub mode.
.
5.1.12.3 ocsepp_message_validation_on_header_failure_total

Table 5-84 ocsepp_message_validation_on_header_failure_total

Metric Details Number of requests for which message validation failed on query parameters at SEPP end.
Microservice Producer N32f, Consumer N32F
Type Counter
Dimensions
  • app
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • request_path
  • vendor "Oracle"
  • status_code

Note:

  • The dimension "peer_plmn_id" is applicable only for Consumer N32F.
  • An additional dimension "sourceRss" is applicable for Consumer N32F in Roaming Hub mode.
.

5.1.13 Cat 1 - Service API Validation Metrics

5.1.13.1 ocsepp_security_service_api_failure_total

Table 5-85 ocsepp_security_service_api_failure_total

Field Details
Metric Details

Metric are common for both CN32F and PN32F.

Separation happens based on "app" tag.

Number of requests failed as Method and Resource URI were not Allowed

Microservice N32f
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_fqdn
  • peer_plmn_id
  • pod
  • remote_sepp_name
  • request_path
  • http_status
  • vendor "Oracle"
  • nf_service_type
  • nf_type

5.1.14 Cat 2 - Network ID Validation Metrics

5.1.14.1 ocsepp_network_id_validation_body_failure_total

Table 5-86 ocsepp_network_id_validation_body_failure_total

Metric Details Number of request for which Network ID body validation feature were failed.
Microservice Producer N32f, Consumer N32F
Type Counter
Dimensions
  • app (Consumer, Producer)
  • cause
  • chart
  • container
  • error_action
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • plmn_identifier
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • request_path
  • sepp_type
  • vendor "Oracle"
5.1.14.2 ocsepp_network_id_validation_header_failure_total

Table 5-87 ocsepp_network_id_validation_header_failure_total

Metric Details Number of request for which Network ID header validation feature were failed.
Microservice Producer N32f, Consumer N32F
Type Counter
Dimensions
  • app (Consumer, Producer)
  • cause
  • chart
  • container
  • error_action
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • plmn_identifier
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • request_path
  • sepp_type
  • vendor "Oracle"

5.1.15 Cat 3 - Previous Location Check Metrics

5.1.15.1 ocsepp_previous_location_exception_failure_total

Table 5-88 ocsepp_previous_location_exception_failure_total

Metric Details Number of requests, for which previous location validation check failed due to exceptions.
Microservice Producer N32f
Type Counter
Dimensions
  • app (Consumer, Producer)
  • cause
  • chart
  • container
  • error_action
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • request_path
  • vendor "Oracle"
5.1.15.2 ocsepp_previous_location_validation_success_total

Table 5-89 ocsepp_previous_location_validation_success_total

Field Details
Metric Details Number of requests, for which previous location validation feature is successful.
Microservice Producer N32f
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • vendor "Oracle"
5.1.15.3 ocsepp_previous_location_validation_failure_total

Table 5-90 ocsepp_previous_location_validation_failure_total

Metric Details Number of requests, for which previous location validation check failed.
Microservice Producer N32f
Type Counter
Dimensions
  • app (Consumer, Producer)
  • cause
  • chart
  • container
  • error_action
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • request_path
  • vendor "Oracle"
5.1.15.4 ocsepp_previous_location_validation_requests_total

Table 5-91 ocsepp_previous_location_validation_requests_total

Field Details
Metric Details Number of requests, for which previous location validation feature is applied.
Microservice Producer N32f
Type Counter
Dimensions
  • app (Consumer, Producer)
  • chart
  • container
  • http_method
  • namespace
  • nf_instance_id
  • peer_domain
  • peer_fqdn
  • pod
  • remote_sepp_name
  • remote_sepp_set_name
  • vendor "Oracle"
5.1.15.5 ocsepp_pn32f_notification_total

Table 5-92 ocsepp_pn32f_notification_total

Field Details
Metric Details It is pegged every time the notification is received on pn32f from NRF for UDR profile change.
Microservice Producer N32f
Type Counter
Dimensions NA

5.1.16 Rate Limiting for Egress Roaming Signaling per PLMN Metrics

5.1.16.1 oc_ingressgateway_plmn_egress_ratelimit_total

Table 5-93 oc_ingressgateway_plmn_egress_ratelimit_total

Metric Details Number of requests for which Egress Rate Limiting was applied and request was successfully forwarded because tokens were available for the Egress Rate Limit List.
Microservice Ingress Gateway
Type Counter
Dimensions
  • Status
  • PLMN
  • Scheme
  • Egress Rate Limit List
  • InstanceIdentifier
  • ErrorOriginator
Status ERL_MATCH_TOKEN_AVAILABLE_FWD
5.1.16.2 oc_ingressgateway_plmn_egress_ratelimit_total

Table 5-94 oc_ingressgateway_plmn_egress_ratelimit_total

Metric Details Number of requests for which Egress Rate Limiting was applied, here tokens were not available to process the request, request was rejected as its priority was low (above than the configured cutoff).
Microservice Ingress Gateway
Type Counter
Dimensions
  • Status
  • PLMN
  • Scheme
  • Egress Rate Limit List
  • InstanceIdentifier
  • ErrorOriginator
Status ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT
5.1.16.3 oc_ingressgateway_plmn_egress_ratelimit_total

Table 5-95 oc_ingressgateway_plmn_egress_ratelimit_total

Metric Details

The number of requests for which egress rate limiting was applied, here tokens were not available to process the request, but the request was forwarded as its priority was high (less than the configured cutoff).

Microservice Ingress Gateway
Type Counter
Dimensions
  • Status
  • PLMN
  • Scheme
  • Egress Rate Limit List
  • InstanceIdentifier
  • ErrorOriginator
Status ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD
5.1.16.4 oc_ingressgateway_plmn_egress_ratelimit_total

Table 5-96 oc_ingressgateway_plmn_egress_ratelimit_total

Metric Details Number of requests for which rate limiting could not be applied as invalid PLMN ID was sent in the request. The request was forwarded.
Microservice Ingress Gateway
Type Counter
Dimensions
  • Status
  • PLMN
  • Scheme
  • Egress Rate Limit List
  • InstanceIdentifier
  • ErrorOriginator
Status ERROR_UNABLE_TO_EXTRACT_PLMN_FWD
5.1.16.5 oc_ingressgateway_plmn_egress_ratelimit_total

Table 5-97 oc_ingressgateway_plmn_egress_ratelimit_total

Metric Details Number of requests for which rate limiting could not be applied as none of the Egress Rate Limit List contains the corresponding PLMN ID. The request was forwarded.
Microservice Ingress Gateway
Type Counter
Dimensions
  • Status
  • PLMN
  • Scheme
  • Egress Rate Limit List
  • InstanceIdentifier
  • ErrorOriginator
Status ERL_NO_MATCH_FWD
5.1.16.6 oc_ingressgateway_plmn_egress_ratelimit_total

Table 5-98 oc_ingressgateway_plmn_egress_ratelimit_total

Metric Details Number of requests for which rate limiting could not be applied as some unexpected exception was raised during the execution of the rate limit filter for the request. The request was forwarded.
Microservice Ingress Gateway
Type Counter
Dimensions
  • Status
  • PLMN
  • Scheme
  • Egress Rate Limit List
  • InstanceIdentifier
  • ErrorOriginator
Status ERROR_INTERNAL_FWD

5.1.17 Config Manager Metrics

5.1.17.1 ocsepp_configmgr_rpp_config_failure_total

Table 5-99 ocsepp_configmgr_rpp_config_failure_total

Field Details
Metric Details This metrics is pegged whenever there is a Remote SEPP configuration failure.
Microservice Config Manager
Type Counter
Dimensions
  • app
  • http_error_message
  • http_method
  • http_status
  • vendor
  • chart
  • pod
  • namespace
  • container
5.1.17.2 ocsepp_configmgr_rpp_validation_failure_total

Table 5-100 ocsepp_configmgr_rpp_validation_failure_total

Field Details
Metric Details This metric is pegged whenever there is a change in mandatory parameter of Remote SEPP Profile or mandatory parameter is missing.
Microservice Config Manager
Type Counter
Dimensions
  • app
  • http_error_message
  • http_status
  • vendor
  • chart
  • pod
  • namespace
  • container
5.1.17.3 ocsepp_configmgr_routeupdate_total

Table 5-101 ocsepp_configmgr_routeupdate_total

Field Details
Metric Details This metric shows the total number of times config-mgr microservice has tried to update the route details for Egress Gateway microservices.
Microservice Config Manager
Type Counter
Dimensions
  • app
  • vendor
  • chart
  • pod
  • namespace
  • container

5.1.18 Support for TLS 1.3 Metrics

5.1.18.1 oc_ingressgateway_incoming_tls_connections

Table 5-102 oc_ingressgateway_incoming_tls_connections

Field Details
Metric Details Number of TLS connections received on the Ingress Gateway and their negotiated versions. The version can be TLS 1.2 or TLS 1.3.
Microservice

PLMN Ingress Gateway

N32 Ingress Gateway

Type Gauge
Dimensions
  • NegotiatedTLSVersion
  • Host
  • Direction
  • InstanceIdentifier
5.1.18.2 oc_egressgateway_outgoing_tls_connections

Table 5-103 oc_egressgateway_outgoing_tls_connections

Field Details
Metric Details Number of TLS connections sent on the Egress Gateway and their negotiated versions. The version can be TLS 1.2 or TLS 1.3.
Microservice

PLMN Egress Gateway

N32 Egress Gateway

Type Gauge
Dimensions
  • NegotiatedTLSVersion
  • Host
  • Direction
  • InstanceIdentifier
5.1.18.3 security_cert_x509_expiration_seconds

Table 5-104 security_cert_x509_expiration_seconds

Metric Details Time to certificate expiry in epoch seconds.
Microservice

PLMN Ingress Gateway

N32 Ingress Gateway

PLMN Egress Gateway

N32 Egress Gateway

Type Histogram
Dimensions
  • app
  • chart
  • endpoint
  • container
  • namespace
  • pod
  • serialNumber
  • subject
  • CN (CommonName)
  • O (Organization)
  • L (Locality)
  • S (State or ProvinceName)
  • C (CountryName)

5.1.19 Ingress and Egress Gateway Metrics

Table 5-105 Ingress and Egress Gateway Dimensions

Dimension Details
NFType Name of the NF Type in path.

For Eg: Path is /nxxx-yyy/vz/.......

Where XXX(Upper Case) is NFType

UNKNOWN if unable to extract NFType from the path

NFServiceType Name of the Service with in the NF.

Example: Path is /nxxx-yyy/vz/.......

Where nxxx-yyy is NFServiceType

UNKNOWN if unable to extract NFServiceType from the path

receivedResponseCode (Pod readiness state metric) receivedResponseCode (Pod readiness state metric)
id (Pod readiness state metric) Servivce profile Id of the backend svc
uri (Pod readiness state metric) Service profile Uri of the backend svc
event This tag captures the lifecycle event processed during the jetty request processing with the back-end svc
Host (Ip or fqdn) : port of ingress gateway
DestinationHost Destination ip/fqdn
client_type client_type
HttpVersion Http protocol version
oc_ingressgateway_pod_resource_stress_Type The type of resource for which the pod protection threshold has reached.e.g. CPU, MEMORY, PENDING_REQUEST
XfccHeaderPresent XfccHeaderPresent
consumerNfType consumerNfType
Scheme Http protocol scheme
Path Path predicate that matched the current request
ClientCertIdentity Cerificate Identity of the client
content_available content_available
Route_Path Path predicate/Header predicate that matched the current request
InstanceIdentifier Prefix of the pod configured in helm when there are multiple instances in same deployment
jetty_request_timeout Tag to capture if a request at IGW failed due to request timeout at jetty level
Virtual Host The fqdn which requires alternate route svc resolution
error_reason Reason for failure response received. If message is sent in the response, then it is filled with the message otherwise exception class is filled. In case of successful response it is filled with "no-error"
ErrorOriginator This tag captures the ErrorOriginator
quantile This tag captures the latency values with ranges as 10ms, 20ms, 40ms, 80ms, 100ms, 200ms, 500ms, 1000ms and 5000ms
oc_ingressgateway_xfcc_header_validate_ServiceType Name of the Service with in the NF.
oc_ingressgateway_dns_resolution_Status oc_ingressgateway_dns_resolution_Status

oc_ingressgateway_global_ratelimit_Status

Request accepted or dropped
oc_ingressgateway_global_ratelimit_total_app Application at which traffic rejection occurs - n32-ingress-gateway or plmn-ingress-gateway
oc_ingressgateway_global_ratelimit_total_Method Request method received ( POST , PUT , GET , PATCH , DELETE)
oc_ingressgateway_connection_failure_Host destination ip/fqdn
oc_ingressgateway_connection_failure_Port destination port
oc_ingressgateway_connection_failure_Direction This tag determines the direction in which there is connection failure at IGW
oc_ingressgateway_xfcc_header_validate_Status oc_ingressgateway_xfcc_header_validate_Status
oc_ingressgateway_xfcc_header_validate_Cause This tag determines the validation cause for the xfcc header validation metric being pegged
oc_ingressgateway_incoming_pod_connections_rejected_Direction The incoming connections rejected at IGW handled in pod protection
oc_ingressgateway_xfcc_header_validate_CertsCompared This tag captures the total number of certificates compared in XFCC header at IGW during the header validation
oc_configclient_request_total_releaseVersion This tag indicates the current release version of ingress gateway
oc_configclient_request_total_configVersion This tag indicates the configuration version that ingress gateway is currently maintaining
oc_configclient_response_total_releaseVersion This tag indicates the configuration version that ingress gateway is currently maintaining
oc_configclient_response_total_updated This tag indicates whether the configuration was updated or not
oc_ingressgateway_incoming_connections_Direction This tag indicates the direction of connection established i.e, whether it is incoming or outgoing
oc_ingressgateway_incoming_connections_Host This tag indicates the remote address of client connected to ingress gateway
oc_ingressgateway_outgoing_connections_Direction This tag indicates the direction of connection established i.e, whether it is incoming or outgoing
oc_ingressgateway_going_connections_Host This tag indicates the address of destination
Proxy Value received for "x-custom-egress-proxy-header".
ConnectedHostIp This tag captures the IP of destination host to which EGW sends ping requests
ConnectedHostFqdn This tag captures the fqdn of destination host to which EGW sends ping requests
ConnectedHostPort This tag captures the port of destination host to which EGW sends ping requests
oc_egressgateway_connection_failure_Host destination ip/fqdn
oc_egressgateway_connection_failure_Port destination port
oc_egressgateway_incoming_connections_Direction This tag indicates the direction of connection established i.e, whether it is incoming or outgoing
oc_egressgateway_incoming_connections_Host This tag indicates the remote address of client connected to ingress gateway
oc_egressgateway_outgoing_connections_Direction This tag indicates the direction of connection established i.e, whether it is incoming or outgoing
oc_egressgateway_outgoing_connections_Host This tag indicates address of destination
EndpointName Request sent for
Reroute_Path

Path that matched the request to over corresponding route

Example : /nef/**

Attempt

Attempt number for scp re-route.

Example : 1 , 2 etc.,

5.1.19.1 Ingress Gateway Metrics

This section provides information about the Ingress Gateway metrics used in SEPP.

5.1.19.1.1 oc_ingressgateway_http_requests_total

Table 5-106 oc_ingressgateway_http_requests_total

Metric Details This metric will be pegged as soon as the request reaches the Ingress gateway in the first custom filter of the application.
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • Method
  • NFType
  • NFServiceType
  • Host
  • HttpVersion
  • Scheme
  • Route_path
  • InstanceIdentifier
  • ClientCertIdentity app
5.1.19.1.2 oc_ingressgateway_http_responses_total

Table 5-107 oc_ingressgateway_http_responses_total

Field Details
Metric Details This metric will be pegged in the last custom filter of the Ingress gateway while the response is being sent back to the consumer NF.
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • Status
  • Method
  • NFType
  • NFServiceType
  • Host
  • HttpVersion
  • Scheme
  • Route_path
  • InstanceIdentifier
  • ClientCertIdentity
5.1.19.1.3 oc_ingressgateway_request_latency_seconds

Table 5-108 oc_ingressgateway_request_latency_seconds

Field Details
Metric Details This metric will be pegged in the last custom filter of the Ingress gateway while the response is being sent back to the consumer NF. This metric tracks the amount of time taken for processing the request. It starts as soon the request reaches the first custom filter of the application and lasts till the response is sent back to the consumer NF from the last custom filter of the application.
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Histogram
Dimensions
  • quantile
  • InstanceIdentifier
  • Route_path
  • Method
5.1.19.1.4 oc_ingressgateway_request_latency_seconds_count

Table 5-109 oc_ingressgateway_request_latency_seconds_count

Field Details
Metric Details This metric is used to display the number of ingress requests processed in a particular time span (in seconds).
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Histogram
Dimensions
  • InstanceIdentifier
  • Route_path
  • Method
5.1.19.1.5 oc_ingressgateway_request_latency_seconds_sum

Table 5-110 oc_ingressgateway_request_latency_seconds_sum

Field Details
Metric Details This metrics is used to display the average of processing time of all the ingress request for a particular time.
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Histogram
Dimensions
  • InstanceIdentifier
  • Route_path
  • Method
5.1.19.1.6 oc_configclient_request_total

Table 5-111 oc_configclient_request_total

Metric Details This metric will be pegged whenever config client is polling for configuration update from common configuration server
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • app (Plmn Ingress Gateway, N32 Ingress Gateway)
  • application
  • chart
  • configVersion
  • container
  • engVersion
  • microservice
  • mktgVersion
  • namespace
  • pod
  • releaseVersion
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.19.1.7 oc_configclient_response_total

Table 5-112 oc_configclient_response_total

Metric Details This metrics will be pegged whenever config client receives response from common configuration server
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • app (Plmn Ingress Gateway, N32 Ingress Gateway)
  • application
  • chart
  • configVersion
  • container
  • engVersion
  • microservice
  • mktgVersion
  • namespace
  • pod
  • releaseVersion
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.19.1.8 oc_configserver_reachability

Table 5-113 oc_configserver_reachability

Metric Details Gauge metric to peg the reachability of config server
Microservice Plmn Ingress Gateway, N32 Ingress Gateway
Type Gauge
Dimensions
  • app (Plmn Ingress Gateway, N32 Ingress Gateway)
  • application
  • chart
  • container
  • endpoint
  • engVersion
  • microservice
  • mktgVersion
  • namespace
  • pod
  • releaseVersion
  • security_istio_io_tlsMode
  • service_istio_io_canonical_name
  • service_istio_io_canonical_revision
  • vendor "Oracle"
5.1.19.1.9 oc_ingressgateway_incoming_connections

Table 5-114 oc_ingressgateway_incoming_connections

Field Details
Metric Details Gauge metric that will peg active incoming connections from client to ingress gateway
Microservice PLMN Ingress Gateway, N32 Ingress Gateway
Type Gauge
Dimensions
  • host
  • direction
  • instanceIdentifier
5.1.19.1.10 oc_ingressgateway_outgoing_connections

Table 5-115 oc_ingressgateway_outgoing_connections

Field Details
Metric Details Gauge metric that will peg active outgoing connections from ingress gateway to destination
Microservice PLMN Ingress Gateway, N32 Ingress Gateway
Type Gauge
Dimensions
  • host
  • direction
  • instanceIdentifier
5.1.19.1.11 oc_ingressgateway_connection_failure_total

Table 5-116 oc_ingressgateway_connection_failure_total

Field Details
Metric Details

This metric will be pegged in the customized Jetty Client as soon as it fails to connect to the destination service with direction as ingressOut. Here in case of Ingress gateway, the destination service will be a backend microservice of the NF.

And TLS connection failure metrics when connecting to ingress with direction as ingress.

Microservice PLMN Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • host
  • port
  • direction
  • instanceIdentifier
  • errorReason
  • errorOriginator
5.1.19.1.12 oc_ingressgateway_global_ratelimit_total

Table 5-117 oc_ingressgateway_global_ratelimit_total

Field Details
Metric Details This metric will be pegged in the custom filter implemented to check the global rate limit conditions.
Microservice PLMN Ingress Gateway, N32 Ingress Gateway
Type Counter
Dimensions
  • Method
  • Route_path
  • Scheme
  • InstanceIdentifier
  • Status
5.1.19.1.13 oc_ingressgateway_request_content_metrics_total

Table 5-118 oc_ingressgateway_request_content_metrics_total

Field Details
Metric Details This metric will be pegged by default filter RequestContentMetrics. It pegs whether request has request body or not.
Microservice PLMN Egress Gateway, N32 Egress Gateway
Type Counter
Dimensions
  • method
  • content_available
  • InstanceIden tifier
5.1.19.1.14 oc_ingressgateway_request_processing_latency_seconds

Table 5-119 oc_ingressgateway_request_processing_latency_seconds

Field Details
Metric Details This metric will be pegged in the last custom filter of the Ingress gateway while the response is being sent back to the consumer NF. This metric captures the amount of time taken for processing of the request only within Ingress gateway. It starts as soon the request reaches the first custom filter of the application and lasts till the request is forwarded to the destination.
Microservice PLMN Egress Gateway, N32 Egress Gateway
Type Histogram
Dimensions
  • quantile
  • InstanceIdentifier
  • Route_path
  • Method
5.1.19.1.15 oc_ingressgateway_route_overloadcontrol_total

Table 5-120 oc_ingressgateway_route_overloadcontrol_total

Metric Details When overload is enabled , this metric is pegged for every incoming request and describes whether request is accepted or discarded.

Microservice N32 Ingress Gateway
Type Counter
Dimensions
  • namespace
  • app
  • DiscardAction
  • Status
5.1.19.2 Egress Gateway Metrics

This section provides information about the Egress Gateway metrics used in SEPP.

5.1.19.2.1 oc_egressgateway_http_requests_total

Table 5-121 oc_egressgateway_http_requests_total

Field Details
Metric Details This metric will be pegged as soon as the request reaches the Egress gateway in the first custom filter of the application with direction as egress. This will also be pegged when the request goes out of egress in Jetty Request Listener with direction as egressOut.
Microservice Plmn Egress GatewayN32 Egress Gateway
Type Counter
Dimensions
  • Method
  • NFType
  • NFServiceType
  • Host
  • HttpVersion
  • Scheme
  • Proxy
  • InstanceIdentifier
  • Direction
5.1.19.2.2 oc_egressgateway_http_responses_total

Table 5-122 oc_egressgateway_http_responses_total

Metric Details This metric will be pegged in the last custom filter of the Egress gateway while the response is being sent back to backend NF microservice with direction as egress.

This will also be pegged when the response is fetched in Jetty responseListener with direction as egressOut.

BlacklistedFqdn tag will be filled with BlacklistedFqdn when request is sent with blacklisted producer
Microservice Plmn Egress GatewayN32 Egress Gateway
Type Counter
Dimensions
  • Status
  • Method
  • NFType
  • NFServiceType
  • Host
  • HttpVersion
  • Scheme
  • Proxy
  • InstanceIdentifier
  • Direction
  • BlacklistedFqdn
5.1.19.2.3 oc_egressgateway_incoming_connections

Table 5-123 oc_egressgateway_incoming_connections

Field Details
Metric Details Gauge metric that will peg active incoming connections from client to egress gateway
Microservice PLMN Egress Gateway, N32 Egress Gateway
Type Gauge
Dimensions
  • host
  • direction
  • instanceIdentifier
5.1.19.2.4 oc_egressgateway_outgoing_connections

Table 5-124 oc_egressgateway_outgoing_connections

Field Details
Metric Details Gauge metric that will peg active outgoing connections from egress gateway to destination
Microservice PLMN Egress Gateway, N32 Egress Gateway
Type Gauge
Dimensions
  • host
  • direction
  • instanceIdentifier
5.1.19.2.5 oc_egressgateway_connection_failure_total

Table 5-125 oc_egressgateway_connection_failure_total

Field Details
Metric Details This metric will be pegged by jetty client when the destination is not reachable by egress gateway. Here the destination is producer NF.
Microservice PLMN Egress Gateway, N32 Egress Gateway
Type Counter
Dimensions
  • host
  • port
  • direction
  • instanceIdentifier
  • errorReason
  • errorOriginator
5.1.19.2.6 oc_egressgateway_sbiRouting_http_requests_total

Table 5-126 oc_egressgateway_sbiRouting_http_requests_total

Field Details
Metric Details This metric is pegged in the SBIRoutingFilter only when SBIRouting feature is enabled for a route to which request is sent to EGW.
Microservice Plmn Egress GatewayN32 Egress Gateway
Type Counter
Dimensions
  • Sbi_Fqdn
  • Reroute_Path
  • Response_Code (This would be populated as blank for requests)
  • Attempt
  • HttpVersion
  • Scheme
  • InstanceIdentifier
5.1.19.2.7 oc_egressgateway_sbiRouting_http_responses_total

Table 5-127 oc_egressgateway_sbiRouting_http_responses_total

Field Details
Metric Details This metric will be pegged in the SBIRoutingFilter only when SBI Routing feature is enabled for a route to which request is sent to EGW and when sbiRerouteEnabled is set to true and reroute mechanism is executed.
Microservice PLMN Egress Gateway, N32 Egress Gateway
Type Counter
Dimensions
  • Sbi_Fqdn
  • Reroute_Path
  • Status
  • Attempt
  • HttpVersion
  • Scheme
  • InstanceIdentifier
  • ErrorOriginator
5.1.19.2.8 oc_egressgateway_server_latency_seconds

Table 5-128 oc_egressgateway_server_latency_seconds

Field Details
Metric Details This metric will be pegged in Jetty response listener that captures the amount of time taken for processing of the request by jetty client.
Microservice PLMN Egress Gateway, N32 Egress Gateway
Type Histogram
Dimensions
  • quantile
  • InstanceIdentifier
  • Method
5.1.19.2.9 oc_fqdn_alternate_route_total

Table 5-129 oc_fqdn_alternate_route_total

Field Details
Metric Details

Tracks the number of registration, deregistration and GET calls received for a given scheme and FQDN.

Note: Registration does not reflect active registration numbers. It captured number of registration requests received.

Microservice Egress Gateway
Type Counter
Dimensions

type: Register/Deregister/GET

binding_value: <scheme>+<FQDN>

5.1.19.2.10 oc_dns_srv_lookup_total

Table 5-130 oc_dns_srv_lookup_total

Field Details
Metric Details

Track the number of times the DNS SRV lookup was done for a given scheme and FQDN.

Microservice Egress Gateway
Type Counter
Dimensions

binding_value: <scheme>+<FQDN>

5.1.19.2.11 oc_alternate_route_resultset

Table 5-131 oc_alternate_route_resultset

Field Details
Metric Details

Provides number of alternate routes known for a given scheme and FQDN. Whenever DNS SRV lookup or static configuration is done, this metric provides number of known alternate route for a given pair. For example, <"http", "abc.oracle.com">: 2.

Microservice Egress Gateway
Type Gauge
Dimensions

binding_value: <scheme>+<FQDN>

5.1.19.2.12 oc_configclient_request_total

Table 5-132 oc_configclient_request_total

Field Details
Metric Details

This metric is pegged whenever a polling request is made from config client to the server for configuration updates.

Microservice Egress Gateway
Type Counter
Dimensions
Tags: releaseVersion, configVersion.
  • releaseVersion tag indicates the current chart version of alternate route service deployed.
  • configVersion tag indicates the current configuration version of alternate route service.
5.1.19.2.13 oc_configclient_response_total

Table 5-133 oc_configclient_response_total

Field Details
Metric Details

This metric is pegged whenever a response is received from the server to client.

Microservice Egress Gateway
Type Counter
Dimensions

Tags: releaseVersion, configVersion, updated.

  • releaseVersion tag indicates the current chart version of alternate route service deployed.
  • configVersion tag indicates the current configuration version of alternate route service.
  • updated tag indicates whether there is a configuration update or not.

5.2 SEPP KPIs

This section provides information about the SEPP KPIs.

5.2.1 N32C Handshake Procedure KPIs

5.2.1.1 cn32c Handshake Requests Per Remote SEPP

Table 5-134 cn32c Handshake Requests Per Remote SEPP

KPI Detail Measures the cn32c handshake requests per remote SEPP.
Metric Used for KPI

"sum(ocsepp_cn32c_handshake_requests_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id , remote_sepp_name)"

Service Operation n32c Handshake Request
Response Code NA
5.2.1.2 cn32c Handshake Success Rate

Table 5-135 cn32c Handshake Success Rate

KPI Detail Measures the cn32c handshake success rate.
Metric Used for KPI

(sum(ocsepp_cn32c_handshake_response_total{namespace=~"$Namespace",responseCode="200 OK"})/sum(ocsepp_cn32c_handshake_requests_total{namespace=~"$Namespace"}))*100

Service Operation n32c handshake success rate
Response Code 200 OK
5.2.1.3 cn32c Handshake Response Per Remote SEPP

Table 5-136 cn32c Handshake Response Per Remote SEPP

KPI Detail Measures the cn32c handshake response per remote SEPP.
Metric Used for KPI

"sum(ocsepp_cn32c_handshake_response_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)"

Service Operation n32c Handshake
Response Code All
5.2.1.4 cn32c Handshake Failure Per Remote SEPP

Table 5-137 cn32c Handshake Failure Per Remote SEPP

KPI Detail Measures the cn32c handshake failure per remote SEPP
Metric Used for KPI "sum(ocsepp_n32c_handshake_failure_attempts_total{namespace=~\"$Namespace\",app=\"cn32c-svc\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)"
Service Operation n32c Handshake
Response Code 4xx and 5xx
5.2.1.5 pn32c Handshake Requests Total Per Remote SEPP

Table 5-138 pn32c Handshake Requests Total Per Remote SEPP

KPI Detail Measures the pn32c handshake requests total per remote SEPP
Metric Used for KPI "sum(ocsepp_pn32c_handshake_requests_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)"
Service Operation n32c Handshake
Response Code All
5.2.1.6 pn32c Handshake Response Total Per Remote SEPP

Table 5-139 pn32c Handshake Response Total Per Remote SEPP

KPI Detail Measures the pn32c handshake response total per remote SEPP
Metric Used for KPI "sum(ocsepp_pn32c_handshake_response_total{namespace=~\"$Namespace\"})by(peer_domain, peer_fqdn, peer_plmn_id, remote_sepp_name)"
Service Operation n32c Handshake
Response Code All
5.2.1.7 pn32c Handshake Success rate

Table 5-140 pn32c Handshake Success rate

KPI Detail Measures the pn32c handshake success rate.
Metric Used for KPI

(sum(ocsepp_pn32c_handshake_response_total{namespace=~"$Namespace",responseCode="200 OK"})/sum(ocsepp_pn32c_handshake_requests_total{namespace=~"$Namespace"}))*100

Service Operation n32c Handshake
Response Code 200
5.2.1.8 pn32c Handshake Failure Per Remote SEPP

Table 5-141 pn32c Handshake Failure Per Remote SEPP

KPI Detail Measures the pn32c handshake failure total
Metric Used for KPI

sum(ocsepp_n32c_handshake_failure_attempts_total{namespace=~"$Namespace",app="pn32c-svc"})by(peer_domain, peer_fqdn, peer_plmn_id)

Service Operation n32c Handshake
Response Code 4xx and 5xx

5.2.2 SEPP Common KPIs

5.2.2.1 Memory Usage per POD

Table 5-142 Memory Usage per POD

KPI Detail Measures the memory usage per POD
Metric Used for KPI

sum(container_memory_usage_bytes{namespace=~"$Namespace",image!=""}/(1024*1024*1024)) by (pod)

Service Operation NA
Response Code NA
5.2.2.2 CPU Usage per POD

Table 5-143 CPU Usage per POD

KPI Detail Measures the CPU usage per POD
Metric Used for KPI sum(rate(container_cpu_usage_seconds_total{namespace=~"$Namespace",image!=""}[2m])) by (pod) * 1000

Service Operation N/A
Response Code N/A
5.2.2.3 Total Ingress gateway requests

Table 5-144 Total Ingress gateway requests

Field Details
KPI Detail Measures the total Ingress gateway requests
Metric Used for KPI sum((oc_ingressgateway_http_requests_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction)
5.2.2.4 Total Egress gateway requests

Table 5-145 Total Egress gateway requests

Field Details
KPI Detail Measures the total egress gateway requests
Metric Used for KPI sum((oc_egressgateway_http_requests_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction)
5.2.2.5 Total Ingress gateway responses

Table 5-146 Total Ingress gateway responses

Field Details
KPI Detail Measures the total Ingress gateway responses
Metric Used for KPI sum((oc_ingressgateway_http_responses_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction)
5.2.2.6 Total Egress gateway responses

Table 5-147 Total Egress gateway responses

Field Details
KPI Detail Measures the total Egress gateway responses
Metric Used for KPI sum((oc_egressgateway_http_responses_total{namespace=~"$Namespace"}))by(app,HttpVersion,Direction)
5.2.2.7 IGW Processing Time (ms)

Table 5-148 IGW Processing Time (ms)

Field Details
KPI Detail Measures the IGW Processing Time
Metric Used for KPI sum(irate(oc_ingressgateway_request_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(Method,app) /sum(irate(oc_ingressgateway_request_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(Method,app)
5.2.2.8 PercentageDiscard

Table 5-149 PercentageDiscard

KPI Detail Measures the number of Discard requests for Percentage based scheme
Metric Used for KPI

oc_ingressgateway_route_overloadcontrol_total

sum(irate(oc_ingressgateway_route_overloadcontrol_total{DiscardAction="PercentageBased",Status="DISCARDED",namespace=$NAMESPACE}[2m]))

5.2.2.9 PriorityDiscard

Table 5-150 PriorityDiscard

KPI Detail Measures the number of Discard requests for Priority based scheme
Metric Used for KPI

oc_ingressgateway_route_overloadcontrol_total

sum(irate(oc_ingressgateway_route_overloadcontrol_total{DiscardAction="PriorityBased",Status="DISCARDED",namespace=NAMESPACE}[2m]))

.

5.2.3 CN32F Common KPIs

5.2.3.1 cn32f Routing Success Rate

Table 5-151 cn32f Routing Success Rate

KPI Detail Measures the cn32f routing success rate.
Metric Used for KPI

(sum(ocsepp_cn32f_response_total{namespace=~"$Namespace"})/sum(ocsepp_cn32f_requests_total{namespace=~"$Namespace"}))*100

Service Operation n32f message forward
Response Code All
5.2.3.2 Total cn32f Requests

Table 5-152 Total cn32f Requests

KPI Detail Measures the cn32f requests rate per remote SEPP.
Metric Used for KPI

sum((ocsepp_cn32f_requests_total{namespace=~"$Namespace", direction="egress"}))by(PEER_DOMAIN, PEER_FQDN, PLMN_ID)

Service Operation n32f message forward
Response Code All
5.2.3.3 cn32f Processing Time (ms)

Table 5-153 cn32f Processing Time (ms)

KPI Detail Measures the cn32f processing time (ms)
Metric Used for KPI sum(irate(ocsepp_cn32f_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_cn32f_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)
Service Operation n32f message forward
Response Code All
5.2.3.4 Total cn32f Responses

Table 5-154 Total cn32f Responses

KPI Detail Measures the cn32f response rate per remote SEPP
Metric Used for KPI sum((ocsepp_cn32f_response_total{namespace=~"$Namespace", direction="egress"})) by(PEER_DOMAIN, PEER_FQDN, PLMN_ID)

Service Operation n32f message forward
Response Code All
5.2.3.5 cn32f Failures

Table 5-155 cn32f Failures

KPI Detail Measures the total cn32f request failures.
Metric Used for KPI sum(ocsepp_cn32f_requests_failure_total{namespace=~"$Namespace"}) by (PEER_DOMAIN, PEER_FQDN, PLMN_ID, statusCode)
Service Operation n32f message forward
Response Code 5xxx

5.2.4 PN32F Common KPIs

5.2.4.1 Total pn32f Requests

Table 5-156 Total pn32f Requests

KPI Detail Measures the total pn32f requests
Metric Used for KPI sum((ocsepp_pn32f_requests_total{namespace=~"$Namespace", direction="egress"}))by(PEER_DOMAIN, PEER_FQDN, PLMN_ID)
Service Operation n32f message forward
Response Code All
5.2.4.2 Total pn32f Responses

Table 5-157 Total pn32f Responses

KPI Detail Measures the pn32f response rate per remote SEPP.
Metric Used for KPI sum((ocsepp_pn32f_responses_total{namespace=~"$Namespace", direction="egress"})) by(PEER_DOMAIN, PEER_FQDN, PLMN_ID)
Service Operation n32f message forward
Response Code All
5.2.4.3 pn32f Processing Time (ms)

Table 5-158 pn32f Processing Time (ms)

KPI Detail Measures the pn32f processing time in milli seconds
Metric Used for KPI

sum(irate(ocsepp_pn32f_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_pn32f_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)

Service Operation n32f message forward
Response Code All
5.2.4.4 pn32f Failures

Table 5-159 pn32f Failures

KPI Detail Measures the pn32f request failures in total
Metric Used for KPI sum(ocsepp_pn32f_requests_failure_total{namespace=~"$Namespace"}) by (PEER_DOMAIN, PEER_FQDN, PLMN_ID)
Service Operation n32f message forward
Response Code 4xx and 5xx
5.2.4.5 pn32f Routing Success Rate

Table 5-160 pn32f Routing Success Rate

KPI Detail Measures the pn32f routing success rate
Metric Used for KPI (sum(ocsepp_pn32f_responses_total{namespace=~"$Namespace"})/sum(ocsepp_pn32f_requests_total{namespace=~"$Namespace"}))*100
Service Operation n32f message forward
Response Code All

5.2.5 Global Rate Limiting Feature KPIs

5.2.5.1 PLMN IGW Global Rate limit Traffic Rejected

Table 5-161 PLMN IGW Global Rate limit Traffic Rejected

KPI Detail Measures the PLMN IGW Global rate limit traffic rejected
Metric Used for KPI

sum(irate(oc_ingressgateway_global_ratelimit_total{namespace=~"$Namespace",app="plmn-ingress-gateway", Status="dropped"}[2m]))

No. of messages rejected for traffic initiated from producer side

5.2.5.2 N32 IGW Global Rate limit Traffic Rejected

Table 5-162 N32 IGW Global Rate limit Traffic Rejected

KPI Detail Measures the N32 IGW Global rate limit traffic rejected
Metric Used for KPI

sum(irate(oc_ingressgateway_global_ratelimit_total{namespace=~"$Namespace",app="n32-ingress-gateway", Status="dropped"}[2m]))

No. of messages rejected for traffic initiated from consumer side

5.2.6 Topology Hiding KPIs

5.2.6.1 CN32F Topology Egress Request Processing Time(ms)

Table 5-163 CN32F Topology Egress Request Processing Time(ms)

KPI Detail Measures the cn32f topology Egress request processing time
Metric Used for KPI sum(irate(ocsepp_topology_latency_seconds_sum{app="cn32f-svc",direction="egress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="cn32f-svc",direction="egress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)
Service Operation n32f message forward
Response Code NA
5.2.6.2 CN32F Topology Ingress Response Processing Time

Table 5-164 CN32F Topology Ingress Response Processing Time

KPI Detail Measures the cn32f topology Egress response processing time
Metric Used for KPI sum(irate(ocsepp_topology_latency_seconds_sum{app="cn32f-svc",direction="ingress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="cn32f-svc",direction="ingress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)
Service Operation n32f message forward
Response Code NA
5.2.6.3 CN32F Topology Hiding Success

Table 5-165 CN32F Topology Hiding Success

KPI Detail Measures the n32f topology success by messages
Metric Used for KPI sum(ocsepp_topology_success_total{app="cn32f-svc", namespace=~"$Namespace"})
Service Operation n32f message forward
Response Code NA
5.2.6.4 N32F Topology Success by headers

Table 5-166 N32F N32F Topology Success by headers

KPI Detail Measures N32F Topology success by headers
Metric Used for KPI

sum(ocsepp_topology_header_success_total{app="cn32f-svc"}) by(header)

Note : Update label app to "app=pn32f-svc" for PN32F microservice.

Service Operation n32f message forward
Response Code NA
5.2.6.5 CN32F Topology Hiding Missing Regex Configuration

Table 5-167 CN32F Topology Hiding Missing Regex Configuration

KPI Detail Measures the cn32f topology hiding missing regex configuration
Metric Used for KPI sum(ocsepp_topology_header_regex_not_configured_total{app="cn32f-svc", namespace=~"$Namespace"}) by(error_msg)
Service Operation n32f message forward
Response Code NA
5.2.6.6 CN32F Topology Hiding Invalid Header Regex

Table 5-168 CN32F Topology Hiding Invalid Header Regex

Field Details
KPI Detail Measures the cn32f topology hiding invalid header regex
Metric Used for KPI

sum(ocsepp_topology_invalid_header_regex_configured_total{app="cn32f-svc", namespace=~"$Namespace"}) by(error_msg)

Service Operation n32f message forward
Response Code NA
5.2.6.7 PN32F Topology Ingress Request Processing Time(ms)

Table 5-169 PN32F Topology Ingress Request Processing Time(ms)

KPI Detail Measures the pn32f topology Ingress request processing time
Metric Used for KPI sum(irate(ocsepp_topology_latency_seconds_sum{app="pn32f-svc",direction="ingress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="pn32f-svc",direction="ingress",message_type="request",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)
Service Operation n32f message forward
Response Code NA
5.2.6.8 PN32F Topology Egress Response Processing Time(ms)

Table 5-170 PN32F Topology Egress Response Processing Time(ms)

KPI Detail Measures the pn32f topology Egress response processing time
Metric Used for KPI sum(irate(ocsepp_topology_latency_seconds_sum{app="pn32f-svc",direction="egress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_topology_latency_seconds_count{app="pn32f-svc",direction="egress",message_type="response",namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)
Service Operation n32f message forward
Response Code NA
5.2.6.9 PN32F Topology Hiding Invalid Header Regex

Table 5-171 PN32F Topology Hiding Invalid Header Regex

Field Details
KPI Detail Measures the pn32f topology hiding invalid header Regex
Metric Used for KPI sum(ocsepp_topology_invalid_header_regex_configured_total{app="pn32f-svc", namespace=~"$Namespace"}) by(error_msg)
Service Operation n32f message forward
Response Code NA
5.2.6.10 PN32F Topology Hiding Missing Regex Configuration

Table 5-172 PN32F Topology Hiding Missing Regex Configuration

Field Details
KPI Detail Measures the pn32f topology hiding missing Regex configuration
Metric Used for KPI sum(ocsepp_topology_header_regex_not_configured_total{app="pn32f-svc", namespace=~"$Namespace"}) by(error_msg)
Service Operation n32f message forward
Response Code NA
5.2.6.11 PN32F Topology Hiding Success

Table 5-173 PN32F Topology Hiding Success

Field Details
KPI Detail Measures the pn32f topology hiding success
Metric Used for KPI sum(ocsepp_topology_success_total{app="pn32f-svc", namespace=~"$Namespace"})
Service Operation n32f message forward
Response Code NA

5.2.7 5G SBI Message Mediation Support KPIs

5.2.7.1 Mediation Requests Counters - N32 Egress Request

Table 5-174 Mediation Requests Counters - N32 Egress Request

Field Details
KPI Detail Measures the Mediation Requests Counters for N32 Egress Request
Metric Used for KPI sum(ocsepp_n32f_mediation_requests_total{direction="N32_Egress_Request", namespace=~"$Namespace"})
5.2.7.2 Mediation Requests Counters - N32 Ingress Response

Table 5-175 Mediation Requests Counters - N32 Ingress Response

Field Details
KPI Detail Measures the Mediation Requests Counters for N32 Ingress Response
Metric Used for KPI sum(ocsepp_n32f_mediation_requests_total{direction="N32_Ingress_Response", namespace=~"$Namespace"})
5.2.7.3 Mediation Requests Counters - N32 Ingress Request

Table 5-176 Mediation Requests Counters - N32 Ingress Request

Field Details
KPI Detail Measures the Mediation Requests Counters for N32 Ingress Request
Metric Used for KPI sum(ocsepp_n32f_mediation_requests_total{direction="N32_Ingress_Request", namespace=~"$Namespace"})
5.2.7.4 Mediation Requests Counters - N32 Egress Response

Table 5-177 Mediation Requests Counters - N32 Egress Response

Field Details
KPI Detail Measures the Mediation Requests Counters for N32 Egress Response
Metric Used for KPI sum(ocsepp_n32f_mediation_requests_total{direction="N32_Egress_Response", namespace=~"$Namespace"})
5.2.7.5 Mediation Response Counters - N32 Egress Request

Table 5-178 Mediation Response Counters - N32 Egress Request

Field Details
KPI Detail Measures the Mediation Response Counters for N32 Egress Request
Metric Used for KPI sum(ocsepp_n32f_mediation_response_total{direction="N32_Egress_Request", namespace=~"$Namespace"})
5.2.7.6 Mediation Response Counters - N32 Ingress Response

Table 5-179 Mediation Response Counters - N32 Ingress Response

Field Details
KPI Detail Measures the Mediation Response Counters for N32 Ingress Response
Metric Used for KPI sum(ocsepp_cn32f_mediation_response_total{direction="N32_Ingress_Response",namespace=~"$Namespace"})
5.2.7.7 Mediation Response Counters - N32 Ingress Request

Table 5-180 Mediation Response Counters - N32 Ingress Request

Field Details
KPI Detail Measures the Mediation Response Counters for N32 EIngress Request
Metric Used for KPI sum(ocsepp_pn32f_mediation_response_total{direction="N32_Ingress_Request",namespace=~"$Namespace"})
5.2.7.8 Mediation Response Counters - N32 Egress Response

Table 5-181 Mediation Response Counters - N32 Egress Response

Field Details
KPI Detail Measures the Mediation Response Counters for N32 Egress Response
Metric Used for KPI sum(ocsepp_pn32f_mediation_response_total{direction="N32_Egress_Response",namespace=~"$Namespace"})

5.2.7.9 Mediation Response Failure

Table 5-182 Mediation Response Failure

Field Details
KPI Detail Measures the Mediation Response Failure
Metric Used for KPI

sum(ocsepp_cn32f_mediation_response_failure{namespace=~"$Namespace"}) by (Direction, status_code)

sum(ocsepp_pn32f_mediation_response_failure{namespace=~"$Namespace"}) by (Direction, status_code)

5.2.7.10 Mediation Applied Total

Table 5-183 Mediation Applied Total

Field Details
KPI Detail Measures the Mediation Applied Total
Metric Used for KPI

(sum(ocsepp_n32f_mediation_requests_total{namespace=~"$Namespace"})*100)/(sum(ocsepp_n32f_mediation_not_applied_total{namespace=~"$Namespace"})+sum(ocsepp_n32f_mediation_requests_total{namespace=~"$Namespace"}))

5.2.7.11 Mediation Response Time At PN32F

Table 5-184 Mediation Response Time At PN32F

Field Details
KPI Detail Measures the Mediation Response Time at PN32F
Metric Used for KPI sum(irate(ocsepp_pn32f_mediation_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_pn32f_mediation_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)
5.2.7.12 Mediation Response Time At CN32F

Table 5-185 Mediation Response Time At CN32F

Field Details
KPI Detail Measures the Mediation Response Time at CN32F
Metric Used for KPI

sum(irate(ocsepp_cn32f_mediation_latency_seconds_sum{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name) /sum(irate(ocsepp_cn32f_mediation_latency_seconds_count{namespace=~"$Namespace"}[2m])) by(peer_fqdn,remote_sepp_name)

5.2.8 Ingress Gateway Message Copy KPIs

5.2.8.1 Total Requests Data sent towards DD for Ingress Gateway

Table 5-186 Total Requests Data sent towards DD for Ingress Gateway

Field Details
KPI Detail Measures the total requests data sent towards DD for Ingress Gateway.
Metric Used for KPI

sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="req"}[2m])) by(app)

5.2.8.2 Total Ack received from DD for Requests for Ingress Gateway

Table 5-187 Total Ack received from DD for Requests for Ingress Gateway

Field Details
KPI Detail Measures the total Ack received from DD for requests for Ingress Gateway.
Metric Used for KPI

sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="ack"}[2m])) by(app)

5.2.9 Egress Gateway Message Copy KPIs

5.2.9.1 Total Requests Data sent towards DD for Egress Gateway

Table 5-188 Total Requests Data sent towards DD for Egress Gateway

Field Details
KPI Detail Measures the total Requests Data sent towards DD for Egress Gateway.
Metric Used for KPI

sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="req"}[2m])) by(app)

5.2.9.2 Total Ack received from DD for Requests for Egress Gateway

Table 5-189 Total Ack received from DD for Requests for Egress Gateway

Field Details
KPI Detail Measures the total acknowledgement received from DD on Egress Gateway.
Metric Used for KPI

sum(irate(oc_egressgateway_msgcopy_requests_total{namespace="$Namespace",type="ack"}[2m])) by(app)

5.2.10 Hosted SEPP KPIs

5.2.10.1 CN32F Allowed P-RSS Validation Failure Count

Table 5-190 CN32F Allowed P-RSS Validation Failure Count

Field Details
KPI Detail Measures the number of messages failed due to incorrect routing rules configured at cn32f microservice.
Metric Used for KPI

sum(ocsepp_allowed_p_rss_routing_failure_total{app="cn32f-svc", namespace=~"$Namespace"}) by (app)

5.2.10.2 PN32F Allowed P-RSS Validation Failure Count

Table 5-191 PN32F Allowed P-RSS Validation Failure Count

Field Details
KPI Detail Measures the number of messages failed due to incorrect routing rules configured at pn32f microservice
Metric Used for KPI

sum(ocsepp_allowed_p_rss_routing_failure_total{app="pn32f-svc", namespace=~"$Namespace"}) by (app)

5.2.11 SoR KPIs

5.2.11.1 Pn32f to SoR Request count total

Table 5-192 Pn32f to SoR Request count total

Field Details
KPI Detail Number of messages sent to SOR from SEPP
Metric Used for KPI

sum(ocsepp_pn32f_sor_requests_total{direction="egress", namespace=~"$Namespace"})

5.2.11.2 SoR to Pn32f Response count total

Table 5-193 SoR to Pn32f Response count total

Field Details
KPI Detail Number of responses received from SOR to SEPP
Metric Used for KPI

sum(ocsepp_pn32f_sor_responses_total{direction="ingress", namespace=~"$Namespace"})

5.2.12 Rate Limiting for Ingress Roaming Signaling per Remote SEPP Set KPIs

5.2.12.1 Average No of messages discarded for a particular RSS

Table 5-194 Average No of messages discarded for a particular RSS

Field Details
KPI Detail Measures the average number of messages discarded for a particular RSS.
Metric used for KPI sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”,Remote_SEPP_Set=“<Remote SEPP Set name>", Status=“dropped”}[2m]))
5.2.12.2 Average No of messages accepted for a particular RSS

Table 5-195 Average No of messages accepted for a particular RSS

Field Details
KPI Detail Measures the average number of messages accepted for a particular RSS.
Metric used for KPI sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”,Remote_SEPP_Set=“<Remote SEPP Set name>”, Status=“accepted”}[2m]))
5.2.12.3 Average No of messages for which feature not applied

Table 5-196 Average No of messages for which feature not applied

Field Details
KPI Detail Measures the average number of messages for which feature not applied.
Metric used for KPI sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”,Status=“ratelimit not applied”}[2m]))
5.2.12.4 Average of all messages by Status

Table 5-197 Average of all messages by Status

Field Details
KPI Detail Measures the average of all messages by Status
Metric used for KPI sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”}[2m])) by (Status)
5.2.12.5 List of Average number of messages dropped for all RSS

Table 5-198 List of Average number of messages dropped for all RSS

Field Details
KPI Detail Lists the average number of messages dropped for all RSS
Metric used for KPI sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”, Status=“dropped”}[2m])) by (Remote_SEPP_Set)
5.2.12.6 List of Average number of messages accepted for all RSS

Table 5-199 List of Average number of messages accepted for all RSS

Field Details
KPI Detail Lists the average number of messages accepted for all RSS
Metric used for KPI sum(irate(oc_ingressgateway_rss_ratelimit_total{namespace=“namespace”, Status=“accepted”}[2m])) by (Remote_SEPP_Set)

5.2.13 Cat 0 - SBI Message Schema Validation KPIs

5.2.13.1 Message validation applied requests on cn32f

Table 5-200 Message validation applied requests on cn32f

Field Details
KPI Detail Measures the total number of requests at CN32F on which message validation has been applied by request path.
Metric Used for KPI sum(ocsepp_message_validation_applied_total{namespace=~"$Namespace",app="cn32f-svc"}) by (requestPath)
5.2.13.2 Cn32f message validation failure on request body

Table 5-201 Cn32f message validation failure on request body

Field Details
KPI Detail Measures the total number of message validation failure(s) on request body by request path.
Metric Used for KPI sum(ocsepp_message_validation_on_body_failure_total{namespace=~"$Namespace",app="cn32f-svc"}) by (request_path)
5.2.13.3 Cn32f message validation failures on request query parameter(s)

Table 5-202 Cn32f message validation failures on request query parameter(s)

Field Details
KPI Detail Measures the total number of message validation failures on request query parameter(s) by request path.
Metric Used for KPI sum(ocsepp_message_validation_on_header_failure_total{namespace=~"$Namespace",app="cn32f-svc"}) by (request_path)
5.2.13.4 Message validation applied requests on pn32f

Table 5-203 Message validation applied requests on pn32f

Field Details
KPI Detail Measures the total number of requests at pn32f on which message validation has been applied by request path.
Metric Used for KPI sum(ocsepp_message_validation_applied_total{namespace=~"$Namespace",app="pn32f-svc"}) by (requestPath)
5.2.13.5 Pn32f message validation failure on request body

Table 5-204 Pn32f message validation failure on request body

Field Details
KPI Detail Measures the total number of message validation failure(s) on request body by request path.
Metric Used for KPI sum(ocsepp_message_validation_on_body_failure_total{namespace=~"$Namespace",app="pn32f-svc"}) by (request_path)
5.2.13.6 Pn32f message validation failures on request query parameter(s)

Table 5-205 Pn32f message validation failures on request query parameter(s)

Field Details
KPI Detail Measures the total number of message validation failures on request query parameter(s) by request path.
Metric Used for KPI sum(ocsepp_message_validation_on_header_failure_total{namespace=~"$Namespace",app="pn32f-svc"}) by (request_path)

5.2.14 Rate Limiting for Egress Roaming Signaling per PLMN KPIs

5.2.14.1 Average Number of Messages Rejected for a Particular PLMN

Table 5-206 Average Number of Messages Rejected for a Particular PLMN

Field Details
KPI Detail Measures the average number of messages rejected for a particular PLMN
Metric used for KPI sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”,PLMN_ID="PLMN ID", Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[2m]))
5.2.14.2 Average Number of Messages Accepted for a Particular PLMN

Table 5-207 Average Number of Messages Accepted for a Particular PLMN

Field Details
KPI Detail Measures the average number of messages accepted for a particular PLMN
Metric used for KPI sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”,PLMN_ID="PLMN ID", Status=~"ERL_MATCH_TOKEN_AVAILABLE_FWD|ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD"}[2m])
5.2.14.3 Average Number of Messages for which Feature not Applied

Table 5-208 Average Number of Messages for which Feature not Applied

Field Details
KPI Detail Measures the average number of messages for which feature not applied
Metric used for KPI sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”,Status!~"ERL_MATCH_TOKEN_AVAILABLE_FWD|ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD|ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[2m]))
5.2.14.4 Average of all Messages by Status

Table 5-209 Average of all Messages by Status

Field Details
KPI Detail Measures the average of all messages by status
Metric used for KPI sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace=“namespace”}[2m])) by (Status)
5.2.14.5 Average Number of Messages Rejected per PLMN

Table 5-210 Average Number of Messages Rejected per PLMN

Field Details
KPI Detail Measures the average number of messages rejected per PLMN
Metric used for KPI sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace="namespace", Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[2m])) by (PLMN_ID)
5.2.14.6 Average Number of Messages Accepted per PLMN

Table 5-211 Average Number of Messages Accepted per PLMN

Field Details
KPI Detail Measures the average number of messages accepted per PLMN
Metric used for KPI sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{namespace="namespace", Status=~"ERL_MATCH_TOKEN_AVAILABLE_FWD|ERL_MATCH_NO_TOKEN_HIGH_PRI_FWD"}[2m])) by (PLMN_ID)

5.3 SEPP Alerts

This section provides information about the SEPP alerts and their configuration.

Note:

For CNE1.8.4 or earlier versions:

  • namespace: {{$labels.kubernetes_namespace}}
  • podname: {{$labels.kubernetes_pod_name}}

For CNE 1.9.x or later versions:

  • namespace: {{$labels.namespace}}
  • podname: {{$labels.pod}}

5.3.1 System Level Alerts

5.3.1.1 SEPPPodMemoryUsageAlert

Table 5-212 SEPPPodMemoryUsageAlert

Trigger Condition Pod memory usage is above the threshold (70% )
Severity Warning
Alert details provided Summary
'namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, 
timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: 
Memory usage is {{ $value | printf "%.2f" }} which is above 70% (current value is: {{ $value }})'
Expression:
(sum by(namespace,container) (container_memory_usage_bytes{container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}) / sum by(namespace,container)(container_spec_memory_limit_bytes{container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}) ) * 100 >= 70
OID 1.3.6.1.4.1.323.5.3.46.1.2.4003
Metric Used

container_memory_usage_bytes

Note: This is a Kubernetes metric used for instance availability monitoring. If the metric is not available, use the similar metric as exposed by the monitoring system.

Resolution

The alert gets cleared when the memory utilization falls below the critical threshold.

Note: The threshold is configurable in the SeppAlertrules.yaml file.

If guidance is required, contact My Oracle Support.

5.3.1.2 SEPPPodCpuUsageAlert

Table 5-213 SEPPPodCpuUsageAlert

Field Details
Trigger Condition Pod CPU usage is above the threshold ( 70% )
Severity Warning
Alert details provided Summary
'namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, 
timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: 
CPU usage is {{ $value | printf "%.2f" }}
 which is usage is above 70% (current value is: {{ $value }})'
Expression:
(sum by (namespace,container) (rate(container_cpu_usage_seconds_total{container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}[2m])) ) / (sum by (container, namespace) (kube_pod_container_resource_limits{resource="cpu",container=~".*cn32c-svc.*|.*pn32c-svc.*|.*cn32f-svc.*|.*pn32f-svc.*|.*config-mgr-svc.*|.*n32-egress-gateway.*|.*n32-ingress-gateway.*|.*plmn-egress-gateway.*|.*plmn-ingress-gateway.*|.*nf-mediation.*"}) ) * 100 >= 70
OID 1.3.6.1.4.1.323.5.3.46.1.2.4002
Metric Used

container_cpu_usage_seconds_total

Note : This is a Kubernetes metric used for instance availability monitoring. If the metric is not available, use the similar metric as exposed by the monitoring system.

Resolution The alert gets cleared when the CPU utilization is below the critical threshold.

Note: The threshold is configurable in the SeppAlertrules.yaml file.

If guidance is required, contact My Oracle Support.

5.3.2 Application Level Alerts

5.3.2.1 Common Alerts
5.3.2.1.1 SEPPN32fRoutingFailure

Table 5-214 SEPPN32fRoutingFailure

Field Details
Trigger Condition N32f service not able to forward message
Severity Info
Alert details provided Summary
namespace: {{ $labels.namespace}}, podname: {{ $labels.pod}}, timestamp: {{ with query
      "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}
Expression:
idelta(ocsepp_cn32f_requests_failure_total[2m]) > 0 or (ocsepp_cn32f_requests_failure_total
      unless ocsepp_cn32f_requests_failure_total offset 2m)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4001
Metric Used ocsepp_cn32f_requests_failure_total
Resolution

The alert gets cleared when Consumer SEPP accepts request only if producer NF domain and PLMN match the Remote SEPP configured.

Steps:

The failure reason is present in the alert.

Possible Resolutions :

  1. Check whether the Remote SEPP is present in database.
  2. Validate the Remote SEPP PLMN which is configured.
  3. Validate the handshake is completed with the remote SEPP and context is present in database.
  4. Validate the producer NF Domain.
  5. Check whether the Remote SEPP Set for required Remote SEPP is present in the database.
  6. Check whether the N32F route is present in database (common_configuration table).
5.3.2.1.2 SEPPConfigMgrRouteFailureAlert

Table 5-215 SEPPConfigMgrRouteFailureAlert

Trigger Condition When routing failure occurs while posting remote SEPP or roaming partner set, this alert will be raised.
Severity Major
Alert Details Provided

Summary

namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Route Failure has occurred because {{ $labels.errorReason }}

Expression

sum(increase(ocsepp_configmgr_routefailure_total{app="config-mgr-svc"}[5m]) >0 or (ocsepp_configmgr_routefailure_total{app="config-mgr-svc"} unless ocsepp_configmgr_routefailure_total{app="config-mgr-svc"} offset 5m )) by (namespace,errorCode) > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.4026
Metric Name Metric ocsepp_configmgr_routefailure_total
Resolution The alert is cleared if no new failures are observed in 5 minutes window.
5.3.2.1.3 EgressSbiErrorRateAbove1Percent

Table 5-216 EgressSbiErrorRateAbove1Percent

Trigger Condition Sbi Transaction Error Rate exceeded configured threshold
Severity Major
Alert details provided Summary
"Sbi Transaction Error Rate detected above 1 Percent of Total Sbi
    Transactions"
Expression
 sum(rate(oc_egressgateway_sbiRouting_http_responses_total{Status!~"2.*"}[24h]))
      by (app,pod, namespace) /sum(rate(oc_egressgateway_sbiRouting_http_responses_total[24h])) by
      (app,pod, namespace) *100 >= 1 
OID 1.3.6.1.4.1.323.5.3.46.1.2.7001
Metric Used oc_egressgateway_sbiRouting_http_responses_total
Resolution

This alert will be raised when the total SBI transaction error rate will be above 1% of the total transaction done during 24 hour time period. Metric will be cleared when the error rate will be below 1%

5.3.2.2 Handshake Alerts
5.3.2.2.1 SEPPCn32cHandshakeFailureAlert

Table 5-217 SEPPCn32cHandshakeFailureAlert

Trigger Condition Handshake procedure has failed on Consumer SEPP
Severity Major
Alert details provided Summary
'namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, 
timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}:
 Handshake procedure has failed on Consumer side because {{ $labels.reason }}'
Expression:
 sum(increase(ocsepp_n32c_handshake_failure_attempts_total{app="cn32c-svc"}[5m])
    >0 or (ocsepp_n32c_handshake_failure_attempts_total{app="cn32c-svc"}  unless
    ocsepp_n32c_handshake_failure_attempts_total{app="cn32c-svc"}  offset 5m )) by
    (namespace,remote_sepp_name,nfinstanceid,peer_fqdn,app)  > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.2001
Metric Used ocsepp_n32c_handshake_failure_attempts_total filtered by app=cn32-svc
Resolution 1 The alert gets cleared when the N32C Handshake is established after successful TCP connection to remote SEPP.

Failure reason: Release name used while helm installation is other than ocsepp-release.

Error Verification: Check the failure reason in the alert. If the failure reason is 404 –route not found or Route not found, follow the recovery steps:

  1. Run the following command to get pod details:

    $ kubectl get pods –n <namespace>

    Example:
    
    # kubectl get pods -n csepp
    NAME                                                 READY   STATUS                  RESTARTS   AGE
    ocsepp-release-appinfo-6cdc48fc47-c9gfv              1/1     Running                 0          8d
    ocsepp-release-cn32c-svc-6547db777d-76gwd            1/1     Running                 0          8d
    ocsepp-release-cn32f-svc-7cd54bdf68-czbnb            1/1     Running                 0          8d
    ocsepp-release-config-mgr-svc-79c95d4b9d-8stk7       1/1     Running                 0          8d
    ocsepp-release-n32-egress-gateway-54c658b947-s5f9m   0/2     Pending                 0          23h
    ocsepp-release-n32-egress-gateway-54c658b947-scvvp   2/2     Running                 0          7d23h
    ocsepp-release-n32-ingress-gateway-777c68cb9-8jsdc   0/2     Pending                 0          23h
    ocsepp-release-n32-ingress-gateway-777c68cb9-98t7x   0/2     Init:ImagePullBackOff   0          23h
    ocsepp-release-pn32c-svc-58bff857f-jmfdd             1/1     Running                 0          8d
    ocsepp-release-pn32f-svc-784d5c7568-rh24g            
    
  2. Run the following command to navigate to the pod:

    $ kubectl exec –it <config-mgr-pod name> –n <namespace> bash

    Example:

    $ kubectl exec -it ocsepp-release-config-mgr-svc-79c95d4b9d-8stk7 -n csepp bash
  3. Run the command to get the existing route details present on N32 Egress Gateway:

    curl -X GET http://<config-manager-service-name>:9090/sepp/nf-common-component/v1/egw/n32/routesconfiguration

    Example:

    curl -X GET http://ocsepp-release-config-mgr-svc:9090/sepp/nf-common-component/v1/egw/n32/routesconfiguration
  4. If this output is null, add the configuration details in config-mgr-svc deployment.

    For more information about the configuration details, see the Deployment Configuration for Config-mgr-svc section in Oracle Communications Cloud Native Core Security Edge Protection Proxy Installation Guide.

  5. After the config-mgr-svc pod is restarted, run the step1 to step3 again. After adding the configuration, rerun the curl command mentioned in step3 to get the route details.
  6. Delete and add the RemoteSepp and reinitiate the handshake.

    If the value is still null, contact My Oracle Support.
Resolution 2

The alert gets cleared when the N32C Handshake is established after successful TCP connection to remote SEPP.

Steps:

The failure reason is present in the alert.

Possible Resolutions:

  1. Disable the Remote SEPP.
  2. Delete the Remote SEPP.
  3. Update and reinitiate Handshake.
5.3.2.2.2 SEPPPn32cHandshakeFailureAlert

Table 5-218 SEPPPn32cHandshakeFailureAlert

Trigger Condition Handshake procedure has failed on Producer sepp
Severity Major
Alert details provided Summary
 'namespace: {{$labels.namespace}}, podname: {{$labels.pod}},
 timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}:
 Handshake procedure has failed on Producer side because {{ $labels.error_msg }}'
Expression:
sum(increase(ocsepp_n32c_handshake_failure_attempts_total{app="pn32c-svc"}[5m])
    >0 or (ocsepp_n32c_handshake_failure_attempts_total{app="pn32c-svc"}  unless
    ocsepp_n32c_handshake_failure_attempts_total{app="pn32c-svc"}  offset 5m )) by
    (namespace,remote_sepp_name,nfinstanceid,peer_fqdn,app)  > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.3001
Metric Used ocsepp_n32c_handshake_failure_attempts_total filtered by app=pn32-svc
Resolution

The alert gets cleared when the N32C Handshake is successful due to TCP connection success of Producer to consumer SEPP.

Steps:

The failure reason is present in the alert.

Possible Resolution:

Update and reinitiate the Handshake.
5.3.2.3 Upgrade Alerts
5.3.2.3.1 SEPPUpgradeStartedAlert

Table 5-219 SEPPUpgradeStartedAlert

Trigger Condition Rest API trigger at start of Upgrade
Severity NA
Alert details provided

applicationname

alertname

servicename

releasename

namespace

oid

severity

vendor

sourcerelease

targetrelease

OID 1.3.6.1.4.1.323.5.3.46.1.2.8001
Metric Used NA
Resolution

If a success alert is generated then start and failure alerts will be cleared.

5.3.2.3.2 SEPPUpgradeFailedAlert

Table 5-220 SEPPUpgradeFailedAlert

Trigger Condition Rest API trigger at failure of Upgrade
Severity NA
Alert details provided

applicationname

alertname

servicename

releasename

namespace

oid

severity

vendor

sourcerelease

targetrelease

OID 1.3.6.1.4.1.323.5.3.46.1.2.8002
Metric Used NA
Resolution

If a success alert is generated then start and failure alerts will be cleared.

5.3.2.3.3 SEPPUpgradeSuccessfulAlert

Table 5-221 SEPPUpgradeSuccessfulAlert

Trigger Condition Rest API trigger at success of Upgrade
Severity NA
Alert details provided

applicationname

alertname

servicename

releasename

namespace

oid

severity

vendor

sourcerelease

targetrelease

OID 1.3.6.1.4.1.323.5.3.46.1.2.8003
Metric Used NA
Resolution

If a success alert is generated then start and failure alerts will be cleared.

5.3.2.4 Rollback Alerts
5.3.2.4.1 SEPPRollbackStartedAlert

Table 5-222 SEPPRollbackStartedAlert

Trigger Condition Rest API trigger at start of Rollback
Severity NA
Alert details provided

applicationname

alertname

servicename

releasename

namespace

oid

severity

vendor

sourcerelease

targetrelease

OID 1.3.6.1.4.1.323.5.3.46.1.2.8004
Metric Used NA
Resolution

If a success alert is generated then start and failure alerts will be cleared.

5.3.2.4.2 SEPPRollbackFailedAlert

Table 5-223 SEPPRollbackFailedAlert

Trigger Condition Rest API trigger at failure of Rollback
Severity NA
Alert details provided

applicationname

alertname

servicename

releasename

namespace

oid

severity

vendor

sourcerelease

targetrelease

OID 1.3.6.1.4.1.323.5.3.46.1.2.8005
Metric Used NA
Resolution

If a success alert is generated then start and failure alerts will be cleared.

5.3.2.4.3 SEPPRollbackSuccessfulAlert

Table 5-224 SEPPRollbackSuccessfulAlert

Trigger Condition Rest API trigger at success of Rollback
Severity NA
Alert details provided

applicationname

alertname

servicename

releasename

namespace

oid

severity

vendor

sourcerelease

targetrelease

OID 1.3.6.1.4.1.323.5.3.46.1.2.8006
Metric Used NA
Resolution Cleared after DEFAULT_DURATION_FOR_ALERT_EXPIRY minutes
5.3.2.5 Global Rate Limiting on Ingress Gateway of SEPP Alerts
5.3.2.5.1 IngressGlobalMessageDropAbovePointOnePercent

Table 5-225 IngressGlobalMessageDropAbovePointOnePercent

Trigger Condition Ingress Global Message Drop Rate detected greater than or equal to 0.1 Percent of Total Transactions.
Severity Warning
Alert details provided Summary
"Ingress Global Message Drop Rate detected above 0.1 Percent of Total Transactions"
Expression
 sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by
      (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 0.1 <
      1  
OID 1.3.6.1.4.1.323.5.3.46.1.2.7002
Metric Used oc_ingressgateway_global_ratelimit_total
Resolution

The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 0.1% of the total messages received. This will get cleared once percentage of message rejected is below 0.1% or greater than or equal to 1%.

5.3.2.5.2 IngressGlobalMessageDropAbove1Percent

Table 5-226 IngressGlobalMessageDropAbove1Percent

Trigger Condition Ingress Global Message Drop Rate detected greater than or equal to 1 Percent of Total Transactions.
Severity Warning
Alert details provided Summary
"Ingress Global Message Drop Rate detected above 1 Percent of Total Transactions"
Expression
 sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by
      (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 1 <
      10 
OID 1.3.6.1.4.1.323.5.3.46.1.2.7003
Metric Used oc_ingressgateway_global_ratelimit_total
Resolution

The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 1% of the total messages received. This will get cleared once percentage of message rejected is below 1% greater than or equal to 10%.

5.3.2.5.3 IngressGlobalMessageDropAbove10Percent

Table 5-227 IngressGlobalMessageDropAbove10Percent

Trigger Condition Ingress Global Message Drop Rate detected greater than or equal to 10 Percent of Total Transactions
Severity Minor
Alert details provided Summary
"Ingress Global Message Drop Rate detected above 10 Percent of Total Transactions"
Expression
 sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by
      (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 10 <
      25 
OID 1.3.6.1.4.1.323.5.3.46.1.2.7004
Metric Used oc_ingressgateway_global_ratelimit_total
Resolution

The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 10% of the total messages received. This will get cleared once percentage of message rejected is below 10% or greater than or equal to 25% .

5.3.2.5.4 IngressGlobalMessageDropAbove25Percent

Table 5-228 IngressGlobalMessageDropAbove25Percent

Trigger Condition Ingress Global Message Drop Rate detected greater than or equal to 25 Percent of Total Transactions
Severity Major
Alert details provided Summary
"Ingress Global Message Drop Rate detected above 25 Percent of Total Transactions"
Expression
 sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by
      (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >= 25 <
      50 
OID 1.3.6.1.4.1.323.5.3.46.1.2.7005
Metric Used oc_ingressgateway_global_ratelimit_total
Resolution

The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 25% of the total messages received.This will get cleared once percentage of message rejected is below 25% or greater than or equal to 50%.

5.3.2.5.5 IngressGlobalMessageDropAbove50Percent

Table 5-229 IngressGlobalMessageDropAbove50Percent

Trigger Condition Ingress Global Message Drop Rate detected greater than or equal to 50 Percent of Total Transactions
Severity Critical
Alert details provided Summary
"Ingress Global Message Drop Rate detected above 50 Percent of Total Transactions"
Expression
 sum(rate(oc_ingressgateway_global_ratelimit_total{Status="dropped"}[5m])) by
      (namespace)/sum(rate(oc_ingressgateway_global_ratelimit_total[5m])) by (namespace) *100 >=
      50
OID 1.3.6.1.4.1.323.5.3.46.1.2.7006
Metric Used oc_ingressgateway_global_ratelimit_total
Resolution

The alert will be raised when the percentage of messages rejected for Global Rate Limit will be greater than or equal to 50% of the total messages received.This will get cleared once percentage of message rejected is below 50%.

5.3.2.6 Topology Hiding Alerts
5.3.2.6.1 SEPPN32fTopologyOperationFailureAlert

Table 5-230 SEPPN32fTopologyOperationFailureAlert

Field Details
Trigger Condition Topology Hiding or Recovery Failure exceeded configured threshold (1%)
Severity Major
Alert details provided Summary
"Topology hiding/recovery operation failres reached more than configured threshold"
Expression
 delta(ocsepp_topology_header_failure_total[2m])>0 or 
(ocsepp_topology_header_failure_total unless ocsepp_topology_header_failure_total offset 2m)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4004
Metric Used ocsepp_topology_header_failure_total, ocsepp_topology_header_success_total
Resolution

This alert will be raised when the total Topology Hiding or Recovery failures reach more than 1%.

Alert will be cleared when the error rate is below 1%.

Possible Resolutions:

  1. Check the header for which alert is raised, header name present in alert label.
  2. Verify the error_msg using "ocsepp_topology_header_failure_total" metric and KPI.
  3. Fix or add configuration for the header.

Note: The alert will be cleared only if the corresponding success metric is pegged.

5.3.2.6.2 SEPPN32fTopologyBodyOperationFailureAlert

Table 5-231 SEPPN32fTopologyBodyOperationFailureAlert

Field Details
Trigger Condition

Topology Operation failed and exceeds defined threshold

Severity Major
Alert details provided Summary
"Topology Hiding/Recovery Operation failures reached more than configured
    threshold"
Expression:
delta(ocsepp_topology_body_failure_total[2m])>0 or 
(ocsepp_topology_body_failure_total unless ocsepp_topology_body_failure_total offset 2m)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4006
Metric Used ocsepp_topology_body_failure_total

ocsepp_topology_body_success_total
Resolution This alert will be raised when the total Topology Hiding or Recovery for message body failures reach more than 1%.

Alert will be cleared when the error rate will be below 1%.

Possible Resolutions:
  1. Check the apiUrl, method for which alert is raised, apiUrl present in alert label.
  2. Verify the error_msg using "ocsepp_topology_body_failure_total" metric and KPI.
  3. Fix or add configuration for the body Identifiers.
Note: The alert will be cleared only if the corresponding success metric is pegged.
5.3.2.7 5G SBI Message Mediation Support Alerts
5.3.2.7.1 SEPPCN32fMediationFailure

Table 5-232 SEPPCN32fMediationFailure

Trigger Condition

Mediation processing Failure

Severity Info
Alert details provided Summary
 "Mediation processing Failure"
Expression:
increase(ocsepp_cn32f_mediation_response_failure{status_code!="504
      GATEWAY_TIMEOUT"}[10m]) > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.4007
Metric Used ocsepp_cn32f_mediation_response_failure
Resolution

This alert will be raised when Mediation microservice is unable to apply rules on the incoming request & response from SEPP.

Possible Resolution:
  1. Check if the Mediation Rules exist.
  2. Check the Agenda Group in the mediation rule is matching from the request and response sent from SEPP.
5.3.2.7.2 SEPPCN32fMediationUnreachable

Table 5-233 SEPPCN32fMediationUnreachable

Trigger Condition

Mediation service is not accessible

Severity Critical
Alert details provided Summary
"Mediation service is not accessible"
Expression:
 increase(ocsepp_cn32f_mediation_response_failure {status_code="504
      GATEWAY_TIMEOUT"}[10m]) > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.4008
Metric Used ocsepp_cn32f_mediation_response_failure
Resolution

This alert will be raised when Mediation microservice is not accessible.

Possible Resolution:
  1. Check if the Mediation microservice pod is up.
  2. Check if Mediation Service Name and servicePort number is correct.
5.3.2.7.3 SEPPPN32fMediationFailure

Table 5-234 SEPPPN32fMediationFailure

Trigger Condition

Mediation processing Failure

Severity Info
Alert details provided Summary
"Mediation processing Failure"
Expression:
increase(ocsepp_pn32f_mediation_response_failure {status_code!="504
      GATEWAY_TIMEOUT"}[10m]) > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.4009
Metric Used ocsepp_pn32f_mediation_response_failure
Resolution

This alert will be raised when Mediation microservice is unable to apply rules on the incoming request & response from SEPP.

Possible Resolution:
  1. Check if the Mediation Rules exist.
  2. Check the Agenda Group in the mediation rule is matching from the request and response sent from SEPP.
5.3.2.7.4 SEPPPN32fMediationUnreachable

Table 5-235 SEPPPN32fMediationUnreachable

Trigger Condition

Mediation service is not accessible

Severity Critical
Alert details provided Summary
"Mediation service is not accessible"
Expression:
increase(ocsepp_pn32f_mediation_response_failure {status_code="504
      GATEWAY_TIMEOUT"}[10m]) > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.4010
Metric Used ocsepp_pn32f_mediation_response_failure
Resolution

This alert will be raised when Mediation microservice is not accessible.

Possible Resolution:
  1. Check if the Mediation microservice pod is up.
  2. Check if Mediation Service Name and servicePort number is correct.
5.3.2.8 Overload Control Alerts
5.3.2.8.1 SEPPServiceOverload65Percent

Table 5-236 SEPPServiceOverload65Percent

Trigger Condition CPU memory of pn32f-svc more than 65%
Severity Warning
Alert details provided Summary
Backend service is in overload with load level > 65%
Expression
 service_resource_overload_level == 1 
OID 1.3.6.1.4.1.323.5.3.46.1.2.7007
Metric Used service_resource_overload_level
Resolution

The alert will be cleared when CPU Memory for backend-svc goes below 60%.

5.3.2.8.2 SEPPServiceOverload70Percent

Table 5-237 SEPPServiceOverload70Percent

Trigger Condition CPU memory of pn32f-svc more than 70%
Severity Minor
Alert details provided Summary
Backend service is in overload with load level > 70%
Expression
 service_resource_overload_level == 2 
OID 1.3.6.1.4.1.323.5.3.46.1.2.7008
Metric Used service_resource_overload_level
Resolution

The alert will be cleared when CPU Memory for backend-svc goes below 70%

5.3.2.8.3 SEPPServiceOverload80Percent

Table 5-238 SEPPServiceOverload80Percent

Trigger Condition CPU memory of pn32f-svc more than 80%
Severity Major
Alert details provided Summary
Backend service is in overload with load level > 80%
Expression
 service_resource_overload_level == 3
OID 1.3.6.1.4.1.323.5.3.46.1.2.7009
Metric Used service_resource_overload_level
Resolution

The alert will be cleared when CPU Memory for backend-svc goes below 80%

5.3.2.8.4 SEPPServiceOverload90Percent

Table 5-239 SEPPServiceOverload90Percent

Trigger Condition CPU memory of pn32f-svc more than 90%
Severity Critical
Alert details provided Summary
Backend service is in overload with load level > 90%
Expression
 service_resource_overload_level == 4
OID 1.3.6.1.4.1.323.5.3.46.1.2.7010
Metric Used service_resource_overload_level
Resolution

The alert will be cleared when CPU Memory for backend-svc goes below 90%

5.3.2.9 Hosted SEPP Alerts
5.3.2.9.1 SEPPPn32fHSRoutingFailureAlert

Table 5-240 SEPPPn32fHSRoutingFailureAlert

Trigger Condition When the routing failure rate at Pn32f service is greater than 20 percentage.
Severity Major
Alert details provided Allowed P-RSS Validation failure at Roaming Hub

Expression

((sum by(namespace, app, nfInstanceId, pod) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod) (ocsepp_pn32f_requests_total))) > 0.2

OID 1.3.6.1.4.1.323.5.3.46.1.2.4013
Metric Used ocsepp_allowed_p_rss_routing_failure_total , ocsepp_pn32f_requests_total
Resolution The alert gets automatically cleared when the failure rate at pn32f microservice goes below 20 percent.
5.3.2.9.2 SEPPCn32fHSRoutingFailureAlertMinor

Table 5-241 SEPPCn32fHSRoutingFailureAlertMinor

Field Details
Trigger Condition When the routing failure rate at Cn32f service is greater than 50 percentage.
Severity Minor
Alert details provided Allowed P-RSS Validation failure at Roaming Hub for Consumer SEPP.

Expression

((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.5

OID 1.3.6.1.4.1.323.5.3.46.1.2.4014
Metric Used ocsepp_allowed_p_rss_routing_failure_total , ocsepp_cn32f_requests_total
Resolution The alert gets automatically cleared when the failure rate at cn32f microservice goes below 50 percent.
5.3.2.9.3 SEPPCn32fHSRoutingFailureAlertMajor

Table 5-242 SEPPCn32fHSRoutingFailureAlertMajor

Field Details
Trigger Condition When the routing failure rate at Cn32f service is greater than 60 percentage.
Severity Major
Alert details provided Allowed P-RSS Validation failure at Roaming Hub for Consumer SEPP.

Expression

((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.6

OID 1.3.6.1.4.1.323.5.3.46.1.2.4015
Metric Used ocsepp_allowed_p_rss_routing_failure_total, ocsepp_cn32f_requests_total
Resolution The alert gets automatically cleared when the failure rate at cn32f microservice goes below 60 percent.
5.3.2.9.4 SEPPCn32fHSRoutingFailureAlertCritical

Table 5-243 SEPCn32fHSRoutingFailureAlertCritical

Field Details
Trigger Condition When the routing failure rate at Cn32f service is greater than 65 percentage.
Severity Critical
Alert details provided Allowed P-RSS Validation failure at Roaming Hub for Consumer SEPP.

Expression

((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.65

OID 1.3.6.1.4.1.323.5.3.46.1.2.4016
Metric Used ocsepp_allowed_p_rss_routing_failure_total, ocsepp_cn32f_requests_total
Resolution The alert gets automatically cleared when the failure rate at cn32f microservice goes below 65 percent.
5.3.2.9.5 SEPPCn32fHSRoutingFailureAlertWarning

Table 5-244 SEPCn32fHSRoutingFailureAlertWarning

Field Details
Trigger Condition When the routing failure rate at Cn32f service is greater than 25 percentage.
Severity Warning
Alert details provided Allowed P-RSS Validation failure at Roaming Hub for Consumer SEPP.

Expression

((sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_allowed_p_rss_routing_failure_total) ) / (sum by(namespace, app, nfInstanceId, pod, sourceRss) (ocsepp_cn32f_requests_total))) > 0.25

OID 1.3.6.1.4.1.323.5.3.46.1.2.4017
Metric Used ocsepp_allowed_p_rss_routing_failure_total, ocsepp_cn32f_requests_total
Resolution The alert gets automatically cleared when the failure rate at cn32f microservice goes below 25 percent.
5.3.2.10 SEPP Message Feed Alerts
5.3.2.10.1 DDUnreachableFromN32IGW

Table 5-245 DDUnreachableFromN32IGW

Trigger Condition This alarm is raised when Data Director is not reachable from N32 Ingress Gateway.
Severity major
Alert details provided Summary

(oc_ingressgateway_dd_unreachable{app="n32-ingress-gateway"} == 1)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4018
Metric Used oc_ingressgateway_dd_unreachable
Resolution Alert gets cleared automatically when the connection with Data Director is established.
5.3.2.10.2 DDUnreachableFromPLMNIGW

Table 5-246 DDUnreachableFromPLMNIGW

Trigger Condition This alarm is raised when Data Director is not reachable from PLMN Ingress Gateway.
Severity major
Alert details provided Summary

(oc_ingressgateway_dd_unreachable{app="n32-ingress-gateway"} == 1)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4019
Metric Used oc_ingressgateway_dd_unreachable
Resolution Alert gets cleared automatically when the connection with Data Director is established.
5.3.2.10.3 DDUnreachableFromN32EGW

Table 5-247 DDUnreachableFromN32EGW

Trigger Condition This alarm is raised when Data Director is not reachable from N32 Egress Gateway.
Severity major
Alert details provided Summary

(oc_egressgateway_dd_unreachable{app="n32-egress-gateway"} == 1)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4020
Metric Used oc_egressgateway_dd_unreachable
Resolution Alert gets cleared automatically when the connection with Data Director is established.
5.3.2.10.4 DDUnreachableFromPLMNEGW

Table 5-248 DDUnreachableFromPLMNEGW

Trigger Condition This alarm is raised when Data Director is not reachable from PLMN Egress Gateway.
Severity major
Alert details provided Summary (oc_egressgateway_dd_unreachable{app="plmn-egress-gateway"} == 1)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4021
Metric Used oc_egressgateway_dd_unreachable
Resolution Alert gets cleared automatically when the connection with Data Director is established.
5.3.2.11 Steering of Roaming (SOR) Alerts
5.3.2.11.1 SEPPPn32fSORFailureAlertPercent30to40

Table 5-249 SEPPPn32fSORFailureAlertPercent30to40

Field Details
Trigger Condition 30% to 40% of SOR traffic results in failure.
Severity Minor
Alert details provided Summary:

'namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}'

Expression:

sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)>=0.3 and sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)<0.4

OID 1.3.6.1.4.1.323.5.3.46.1.2.4022
Metric Used ocsepp_pn32f_sor_failure_total and ocsepp_pn32f_sor_requests_total
Resolution

This alert will be raised when the percentage failure of SOR responses is in the range 30%-40%, in the sample collected in last 2 min.

Possible Resolutions :

  1. Check the below headers in the response coming from SOR server. If any of these is missing, it will cause SOR Failure:
    1. Server Header
    2. Location Header
  2. Check if the redirection code (3xx) received from SOR should be the same as the one configured through CNC Console. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
  3. Check if the SOR Server is sending the response code 5xx and whether the code is not configured through CNC Console or retry to Producer NF is disabled. This code can be viewed in the metric ocsepp_pn32f_sor_failure_total.
  4. Check if any client error(4xx) is coming while connecting to SoR. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
5.3.2.11.2 SEPPPn32fSORFailureAlertPercent40to50

Table 5-250 SEPPPn32fSORFailureAlertPercent40to50

Field Details
Trigger Condition 40% to 50% of SOR traffic results in failure.
Severity Major
Alert details provided Summary:

'namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}'

Expression:

sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)>=0.4 and sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)<0.5

OID 1.3.6.1.4.1.323.5.3.46.1.2.4023
Metric Used ocsepp_pn32f_sor_failure_total

and

ocsepp_pn32f_sor_requests_total
Resolution

This alert will be raised when the percentage failure of SOR responses is in the range 40%-50%, in the sample collected in last 2 min.

Possible Resolutions :

  1. Check the below headers in the response coming from SoR server, if any of these is missing, it will cause SOR Failure:
    1. Server Header
    2. Location Header
  2. Check if the redirection code (3xx) received from SOR should be same as one configured through CNC Console. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
  3. Check if SOR Server is sending response code 5xx and the code is not configured through CNC Console or Retry to Producer NF is disabled. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
  4. Check if any client error (4xx) is coming while connecting to SOR. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
5.3.2.11.3 SEPPPn32fSORFailureAlertPercentAbove50

Table 5-251 SEPPPn32fSORFailureAlertPercentAbove50

Field Details
Trigger Condition 50% of SOR traffic results in failure
Severity Critical
Alert details provided Summary:

'namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}'

Expression:

sum(rate(ocsepp_pn32f_sor_failure_total[2m]))by(namespace,nf_instance_id,app)/sum(rate(ocsepp_pn32f_sor_requests_total[2m]))by(namespace,nf_instance_id,app)>=0.5

OID 1.3.6.1.4.1.323.5.3.46.1.2.4024
Metric Used ocsepp_pn32f_sor_failure_total

and

ocsepp_pn32f_sor_requests_total
Resolution

This alert will be raised when the percentage failure of SOR responses is above 50%, in the sample collected in last 2 min.

Possible Resolutions :

  1. Check the below headers in the response coming from SOR server, if any of these is missing, it will cause SOR Failure:
    1. Server Header
    2. Location Header
  2. Check if the redirection code(3xx) received from SOR should be same as one configured via CNC Console. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
  3. Check if SOR Server is sending response code 5xx and the code is not configured through CNC Console or retry to Producer NF is disabled. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
  4. Check if any client error(4xx) is coming while connecting to SOR. This code can be viewed in the metricocsepp_pn32f_sor_failure_total.
5.3.2.11.4 SEPPPn32fSORTimeoutFailureAlert

Table 5-252 SEPPPn32fSORTimeoutFailureAlert

Field Details
Trigger Condition Increase of more than five timeout errors in last two minutes for SOR.
Severity critical
Alert details provided Summary:

'namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}'

Expression:

idelta(ocsepp_pn32f_sor_timeout_failure_total[2m]) > 5 or (ocsepp_pn32f_sor_timeout_failure_total unless ocsepp_pn32f_sor_timeout_failure_total offset 2m)
OID 1.3.6.1.4.1.323.5.3.46.1.2.4025
Metric Used ocsepp_pn32f_sor_timeout_failure_total
Resolution

This alert will be raised when the response received from SOR Server suggests that server is either down or unreachable for more than five error counts in the sample collected in last two minutes.

Possible Resolutions :

  1. Check and fix if the SOR server is unreachable.
  2. Check and fix if the configuration made through CNC Console has wrong values for server. Check if the FQDN and port configured are correct.
  3. The scheme selected must be supported by SOR server.
5.3.2.12 Global Rate Limiting on Ingress Gateway of SEPP Alerts
5.3.2.12.1 Ingress RSS Rate Limit per RSS Message Drop Above Point one Percent Alert

Table 5-253 Ingress RSS Rate Limit per RSS Message Drop Above Point one Percent Alert

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 0.1 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised.
Severity Warning
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 0.1 Percent of Total Transactions of that RSS

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 0.1 < 10
OID 1.3.6.1.4.1.323.5.3.46.1.2.7011
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.2 Ingress RSS Rate Limit per RSS Message Drop Above 10 Percent Alert

Table 5-254 Ingress RSS Rate Limit per RSS Message Drop Above 10 Percent Alert

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 10 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised.
Severity Minor
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 10 Percent of Total Transactions of that RSS

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 10 < 25
OID 1.3.6.1.4.1.323.5.3.46.1.2.7012
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.3 Ingress RSS Rate Limit per RSS Message Drop Above 25 Percent Alert

Table 5-255 Ingress RSS Rate Limit per RSS Message Drop Above 25 Percent Alert:

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 25 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised.
Severity Major
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 25 Percent of Total Transactions of that RSS

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 25 < 50
OID 1.3.6.1.4.1.323.5.3.46.1.2.7013
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.4 Ingress RSS Rate Limit per RSS Message Drop Above 50 Percent Alert

Table 5-256 Ingress RSS Rate Limit per RSS Message Drop Above 50 Percent Alert

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate per RSS is detected above 50 percent of total transactions of that RSS, this metric will be pegged and corresponding alert will be raised.
Severity Critical
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate per RSS detected above 50 Percent of Total Transactions of that RSS

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 50
OID 1.3.6.1.4.1.323.5.3.46.1.2.7014
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.5 Ingress RSS Rate Limit Message Drop Above Point one Percent Alert

Table 5-257 Ingress RSS Rate Limit Message Drop Above Point one Percent Alert

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 0.1 percent of total transactions, this metric will be pegged and corresponding alert will be raised.
Severity Warning
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate detected above 0.1 Percent of Total Transaction

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (namespace) *100 >= 0.1 < 1
OID 1.3.6.1.4.1.323.5.3.46.1.2.7015
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.6 Ingress RSS Rate Limit Message Drop Above one Percent Alert

Table 5-258 Ingress RSS Rate Limit Message Drop Above one Percent Alert:

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 1 percent of total transactions, this metric will be pegged and corresponding alert will be raised.
Severity Warning
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate detected above 1 Percent of Total Transactions

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (namespace) *100 >= 1 < 10
OID 1.3.6.1.4.1.323.5.3.46.1.2.7016
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.7 Ingress RSS Rate Limit Message Drop Above 10 Percent Alert

Table 5-259 Ingress RSS Rate Limit Message Drop Above 10 Percent Alert

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 10 percent of total transactions, this metric will be pegged and corresponding alert will be raised.
Severity Minor
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate detected above 10 Percent of Total Transactions.

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 10 < 25
OID 1.3.6.1.4.1.323.5.3.46.1.2.7017
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.8 Ingress RSS Rate Limit Message Drop Above 25 Percent Alert

Table 5-260 Ingress RSS Rate Limit Message Drop Above 25 Percent Alert

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 25 percent of total transactions, this metric will be pegged and corresponding alert will be raised.
Severity Major
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate detected above 25 Percent of Total Transactions

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 25 < 50
OID 1.3.6.1.4.1.323.5.3.46.1.2.7018
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.12.9 Ingress RSS Rate Limit Message Drop Above 50 Percent Alert

Table 5-261 Ingress RSS Rate Limit Message Drop Above 50 Percent Alert

Trigger Condition If a request has to be dropped when all the tokens in the bucket are exhausted and drop rate is detected above 50 percent of total transactions, this metric will be pegged andcorresponding alert will be raised.
Severity Critical
Alert Details Provided

Summary:

Ingress RSS Based Rate Limiting Message Drop Rate detected above 50 Percent of Total Transactions

Expression:

sum(rate(oc_ingressgateway_rss_ratelimit_total{Status="dropped"}[5m])) by (Remote_SEPP_Set,namespace)/sum(rate(oc_ingressgateway_rss_ratelimit_total[5m])) by (Remote_SEPP_Set,namespace) *100 >= 50
OID 1.3.6.1.4.1.323.5.3.46.1.2.7019
Metric Name oc_ingressgateway_rss_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.13 Cat-0 SBI Message Schema Validation Alerts
5.3.2.13.1 SEPPN32fMessageValidationOnHeaderFailureMinorAlert

Table 5-262 SEPPN32fMessageValidationOnHeaderFailureMinorAlert

Field Details
Trigger Condition Message validation failed for request query parameters for 40 % of requests (on which message validation was applied) in last 2 minutes.
Severity minor
Alert Details Provided

Summary:

Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}

Expression:

(sum(rate(ocsepp_message_validation_on_header_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 40 < 60

OID 1.3.6.1.4.1.323.5.3.46.1.2.4026
Metric Used ocsepp_message_validation_on_header_failure_total
Resolution The alerts gets cleared when the count is not between 40 to 60.
5.3.2.13.2 SEPPN32fMessageValidationOnHeaderFailureMajorAlert

Table 5-263 SEPPN32fMessageValidationOnHeaderFailureMajorAlert

Field Description
Trigger Condition Message validation failed for request query parameters for 60 % of requests(on which message validation was applied) in last 2 minutes.
Severity major
Alert Details Provided

Summary:

Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}

Expression:

(sum(rate(ocsepp_message_validation_on_header_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 60 < 80

OID 1.3.6.1.4.1.323.5.3.46.1.2.4027
Metric Name ocsepp_message_validation_on_header_failure_total
Resolution The alerts gets cleared when the count is not between 60 to 80.Possible Resolutions:
  1. Check Logs or Metrics:

    Review the following metrics for message validation failures:

    • ocsepp_message_validation_on_body_failure
    • ocsepp_message_validation_on_header_failure
  2. To identify the Failing Resource URI and HTTP Method, do the following:
    • For request body validation failures, search for the text: "Message validation failed for request body for request"
    • For query parameter validation failures, search for: "Message validation failed for request query parameter(s) for request"
    • For more detailed information about logs, refer to Oracle Communications Cloud Native Core, Security Edge Protection Proxy Troubleshooting Guide.
  3. In CNC Console GUI, navigate to SEPP and select Security Countermeasure from the left-hand menu.
    • Click Cat 0 - SBI Message Schema Validation to open the Message Validation List.
    • Search for the relevant resource URI to retrieve the corresponding schema.
    • Compare the request body or query parameters against the schema to ensure the request complies with the schema. If necessary, update the schema to reflect the correct structure.
5.3.2.13.3 SEPPN32fMessageValidationOnHeaderFailureCriticalAlert

Table 5-264 SEPPN32fMessageValidationOnHeaderFailureCriticalAlert

Field Description
Trigger Condition Message validation failed for request query parameters for 80 % of requests(on which message validation was applied) in last 2 minutes.
Severity critical
Alert Details Provided

Summary:

Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}

Expression:

(sum(rate(ocsepp_message_validation_on_header_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 80
OID 1.3.6.1.4.1.323.5.3.46.1.2.4028
Metric Name ocsepp_message_validation_on_header_failure_total
Resolution The alerts gets cleared when the count is not between 80 to 100.
5.3.2.13.4 SEPPN32fMessageValidationOnBodyFailureMinorAlert

Table 5-265 SEPPN32fMessageValidationOnBodyFailureMinorAlert

Field Description
Trigger Condition Message validation failed for request body for 40 % of requests(on which message validation was applied) in last 2 minutes.
Severity minor
Alert Details Provided

Summary:

Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}

Expression:

(sum(rate(ocsepp_message_validation_on_body_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 40 < 60

OID 1.3.6.1.4.1.323.5.3.46.1.2.4029
Metric Name ocsepp_message_validation_on_body_failure_total
Resolution The alerts gets cleared when the count is not between 60 to 100.
5.3.2.13.5 SEPPN32fMessageValidationOnBodyFailureMajorAlert

Table 5-266 SEPPN32fMessageValidationOnBodyFailureMajorAlert

Field Details
Trigger Condition Message validation failed for request body for 60 % of requests(on which message validation was applied) in last 2 minutes.
Severity major
Alert Details Provided

Summary:

Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}

Expression:

(sum(rate(ocsepp_message_validation_on_body_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 60 < 80
OID 1.3.6.1.4.1.323.5.3.46.1.2.4030
Metric Name ocsepp_message_validation_on_body_failure_total
Resolution The alerts gets cleared when the count is not between 80 to 100.
5.3.2.13.6 SEPPN32fMessageValidationOnBodyFailureCriticalAlert

Table 5-267 SEPPN32fMessageValidationOnBodyFailureCriticalAlert

Field Details
Trigger Condition Message validation failed for request body for 80 % of requests(on which message validation was applied) in last 2 minutes.
Severity critical
Alert Details Provided

Summary:

Namespace: {{ $labels.kubernetes_namespace }}, Podname: {{$labels.kubernetes_pod_name}}, App: {{ $labels.app }}, Nfinstanceid: {{ $labels.nfInstanceId }}

Expression:(sum(rate(ocsepp_message_validation_on_body_failure_total[2m])) by (app, pod, namespace, nf_instance_id) /sum(rate(ocsepp_message_validation_applied_total[2m])) by (app, pod, namespace, nf_instance_id))*100 >= 80

OID 1.3.6.1.4.1.323.5.3.46.1.2.4031
Metric Name ocsepp_message_validation_on_body_failure_total
Resolution The alerts gets cleared when the count is not between 80 to 100.
5.3.2.14 Cat-1 Service API Validation Alerts
5.3.2.14.1 SEPPN32fServiceApiValidationFailureAlert

Table 5-268 SEPPN32fServiceApiValidationFailureAlert

Trigger Condition Service API not in allowed list
Severity Major
Alert details provided Summary
N32f : Service API not in allowed list
Expression:
delta(ocsepp_security_service_api_failure_total[2m]) > 0
OID 1.3.6.1.4.1.323.5.3.46.1.2.4005
Metric Used ocsepp_security_service_api_failure_total
Resolution 1

This alert will be raised when there is difference of at least 1 between first and last data point in sample collected in last 2 minutes. Alert will be cleared after 2 minutes.

Possible Resolutions:

  1. Check the Resource URI + Method for which alert is raised.
  2. Verify the error_msg using "ocsepp_security_service_api_failure_total" metric and KPI.
  3. Fix or add configuration for the Resource URI + Method in Service API's and Allowed List.
Resolution 2

The alert gets cleared when the N32C Handshake is established after successful TCP connection to remote SEPP.

Steps:

The failure reason is present in the alert.

Possible Resolutions:

  1. Disable the Remote SEPP.
  2. Delete the Remote SEPP.
  3. Update and reinitiate Handshake.
5.3.2.15 Cat-2 Network ID Validation Alerts
5.3.2.15.1 SEPPN32fNetworkIDValidationHeaderFailureAlert

Table 5-269 SEPPN32fNetworkIDValidationHeaderFailureAlert

Field Details
Trigger Condition If Network ID Validation for Header fails, this metrics will be pegged and corresponding alert will be raised.
Severity Major
Alert details provided Summary: 'namespace: {{ $labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Network ID Validation has failed because {{ $labels.cause }}'

Expression:

sum(increase(ocsepp_network_id_validation_header_failure_total[2m]) >0 or (ocsepp_network_id_validation_header_failure_total unless ocsepp_network_id_validation_header_failure_total offset 2m )) by (namespace, remote_sepp_name, nf_instance_id, peer_fqdn, plmn_identifier, app, resource_uri, pod) > 0

OID 1.3.6.1.4.1.323.5.3.46.1.2.4011
Metric Used ocsepp_network_id_validation_header_failure_total
Resolution The alerts gets cleared when the count goes below 0.
5.3.2.15.2 SEPPN32fNetworkIDValidationBodyIEFailureAlert

Table 5-270 SEPPN32fNetworkIDValidationBodyIEFailureAlert

Field Details
Trigger Condition If Network ID Validation for Body fails, this metrics will be pegged and corresponding alert will be raised.
Severity Major
Alert details provided Summary:

'namespace: {{ $labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Network ID Body Validation has failed because {{ $labels.cause }}'

Expression:

sum(increase(ocsepp_network_id_validation_body_failure_total[2m]) >0 or (ocsepp_network_id_validation_body_failure_total unless ocsepp_network_id_validation_body_failure_total offset 2m )) by (namespace, remote_sepp_name, nf_instance_id, peer_fqdn, plmn_identifier, app, resource_uri, pod) > 0

OID 1.3.6.1.4.1.323.5.3.46.1.2.4012
Metric Used ocsepp_network_id_validation_body_failure_total
Resolution The alerts gets cleared when the count goes below 0.
5.3.2.16 Cat-3 Previous Location Check Alerts
5.3.2.16.1 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent30to40

Table 5-271 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent30to40

Trigger Condition When previous location check validation failure error is detected between 30 to 40 Percent of Total Transactions , this alert will be raised.
Severity Minor
Alert Details Provided

Summary

Previous location check validation failure detected between 30 to 40 Percent of Total Transactions

Expression

sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.3 and sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.4
OID 1.3.6.1.4.1.323.5.3.46.1.2.4032
Metric Name ocsepp_previous_location_validation_failure_total
Resolution The alerts gets cleared when the previous location check validation failure error does not lie between 30 to 40 percent of total transactions.
5.3.2.16.2 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent40to50

Table 5-272 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercent40to50

Trigger Condition When previous location check validation failure error is detected between 40 to 50 Percent of Total Transactions , this alert will be raised.
Severity Major
Alert Details Provided

Summary

Previous location check validation failure detected between 40 to 50 Percent of Total Transactions

Expression

sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.4 and sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.5
OID 1.3.6.1.4.1.323.5.3.46.1.2.4033
Metric Name ocsepp_previous_location_validation_failure_total
Resolution The alerts gets cleared when the previous location check validation failure error does not lie between 40 to 50 percent of total transactions.
5.3.2.16.3 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercentAbove50

Table 5-273 SEPPPn32fPreviousLocationCheckValidationFailureAlertPercentAbove50

Trigger Condition When previous location check validation failure error is detected above 50 Percent of Total Transactions , this alert will be raised.
Severity Critical
Alert Details Provided

Summary

Previous location check validation failure detected above 50 Percent of Total Transactions

Expression

sum(rate(ocsepp_previous_location_validation_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.5"
OID 1.3.6.1.4.1.323.5.3.46.1.2.4034
Metric Name ocsepp_previous_location_validation_failure_total
Resolution The alerts gets cleared when the previous location check validation failure error does not lie above 50 percent of total transactions.
5.3.2.16.4 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent30to40

Table 5-274 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent30to40

Trigger Condition When previous location check exception failure is detected between 30 to 40 Percent of Total Transactions , this alert will be raised.
Severity Minor
Alert Details Provided

Summary

Previous location check exception failure detected between 30 to 40 Percent of Total Transactions

Expression

sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.3 and sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.4
OID 1.3.6.1.4.1.323.5.3.46.1.2.4035
Metric Name ocsepp_previous_location_exception_failure_total
Resolution The alerts gets cleared when the previous location check exception failure does not lie between 30 to 40 percent of total transactions.
5.3.2.16.5 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent40to50

Table 5-275 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercent40to50

Trigger Condition When previous location check exception failure error is detected between 40 to 50 Percent of Total Transactions , this alert will be raised.
Severity Major
Alert Details Provided

Summary

Previous location check exception failure detected between 40 to 50 Percent of Total Transactions

Expression

sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.4 and sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)<0.5
OID 1.3.6.1.4.1.323.5.3.46.1.2.4036
Metric Name ocsepp_previous_location_exception_failure_total
Resolution The alerts gets cleared when the previous location check exception failure error does not lie between 40 to 50 percent of total transactions.
5.3.2.16.6 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercentAbove50

Table 5-276 SEPPPn32fPreviousLocationCheckExceptionFailureAlertPercentAbove50

Trigger Condition When previous location check exception failure error is detected above 50 Percent of Total Transactions , this alert will be raised.
Severity Critical
Alert Details Provided

Summary

Previous location check exception failure detected above 50 Percent of Total Transactions

Expression

sum(rate(ocsepp_previous_location_exception_failure_total[2m]))by(namespace)/sum(rate(ocsepp_previous_location_validation_requests_total[2m]))by(namespace)>=0.5
OID 1.3.6.1.4.1.323.5.3.46.1.2.4037
Metric Name ocsepp_previous_location_exception_failure_total
Resolution The alerts gets cleared when the previous location check exception failure error does not lie above 50 percent of total transactions.
5.3.2.17 Rate Limiting for Egress Roaming Signaling per PLMN Alerts
5.3.2.17.1 Egress Request Rate Limit per PLMN Message Drop Above 10 Percent Alert

Table 5-277 Egress Request Rate Limit per PLMN Message Drop Above 10 Percent Alert

Trigger Condition If a request is dropped due to the tokens in the bucket are exhausted and drop rate per PLMN is detected above 10 percent of total transactions of that PLMN, oc_ingressgateway_plmn_egress_ratelimit_total metric will be pegged and corresponding alert will be raised.
Severity Minor
Alert Details Provided

Summary

Egress Rate Limiting Request Drop Rate detected per PLMN above 10 Percent of Total Transactions

Expression

sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[5m])) by (EgressRateLimitList,PLMN_ID,namespace)/sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total[5m])) by (EgressRateLimitList,PLMN_ID,namespace) *100 >= 10 < 25
OID 1.3.6.1.4.1.323.5.3.46.1.2.4039
Metric Name oc_ingressgateway_plmn_egress_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.17.2 Egress Request Rate Limit per PLMN Message Drop Above 25 Percent Alert

Table 5-278 Egress Request Rate Limit per PLMN Message Drop Above 25 Percent Alert

Trigger Condition If a request is dropped due to the tokens in the bucket are exhausted and drop rate per PLMN is detected above 25 percent of total transactions of that PLMN, oc_ingressgateway_plmn_egress_ratelimit_total metric will be pegged and corresponding alert will be raised.
Severity Major
Alert Details Provided

Summary

Egress Rate Limiting Request Drop Rate detected per PLMN above 25 Percent of Total Transactions

Expression

sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[5m])) by (EgressRateLimitList,PLMN_ID,namespace)/sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total[5m])) by (EgressRateLimitList,PLMN_ID,namespace) *100 >= 10 < 25
OID 1.3.6.1.4.1.323.5.3.46.1.2.4040
Metric Name oc_ingressgateway_plmn_egress_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.17.3 Egress Request Rate Limit per PLMN Message Drop Above 50 Percent Alert

Table 5-279 Egress Request Rate Limit per PLMN Message Drop Above 50 Percent Alert

Trigger Condition If a request is dropped due to the tokens in the bucket are exhausted and the drop rate per PLMN is detected above 50 percent of total transactions of that PLMN, oc_ingressgateway_plmn_egress_ratelimit_total metric will be pegged and corresponding alert will be raised.
Severity Critical
Alert Details Provided

Summary

Egress Rate Limiting Request Drop Rate detected per PLMN above 50 Percent of Total Transactions

Expression

sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total{Status="ERL_MATCH_NO_TOKEN_LOW_PRI_REJECT"}[5m])) by (EgressRateLimitList,PLMN_ID,namespace)/sum(rate(oc_ingressgateway_plmn_egress_ratelimit_total[5m])) by (EgressRateLimitList,PLMN_ID,namespace) *100 >= 50
OID 1.3.6.1.4.1.323.5.3.46.1.2.4041
Metric Name oc_ingressgateway_plmn_egress_ratelimit_total
Resolution The alerts gets cleared when the count goes down.
5.3.2.18 Separate Port Configurations for N32c and N32f on the Egress Routes Alerts
5.3.2.18.1 EgressInterfaceConnectionFailure

Table 5-280 EgressInterfaceConnectionFailure

Field Details
Trigger Condition If the destination host and port mentioned in the Remote profile are unreachable or not available, then the alert will be raised.
Severity Major
Alert Details Provided

Summary:

Egress connection failure on the interface

Expression:

sum(increase(oc_egressgateway_connection_failure_total{app="n32-egress-gateway"}[5m])) by (namespace,app,Host,Port) >0
OID 1.3.6.1.4.1.323.5.3.46.1.2.4042
Metric Name oc_egressgateway_connection_failure_total
Resolution If the destination host and port are reachable, then the alert will be cleared.

5.4 SEPP Alert Configuration

This section describes the Measurement based Alert rules configuration for SEPP. The Alert Manager uses the Prometheus measurements values as reported by microservices in conditions under alert rules to trigger alerts.

5.4.1 Configuring SEPP Alerts in OCI

The following procedure describes how to configure the SEPP alerts for OCI. The OCI supports metric expressions written in MQL (Metric Query Language) and thus, requires a new SEPP alert file for configuring alerts in OCI observability platform.

The following are the steps:

  1. Run the following command to extract the .zip file:
    unzip ocsepp_oci_alertrules_<version>.zip
    The ocsepp_oci and ocsepp_oci_resources folders are available in the zip file.

    Note:

    The zip file is available in the Scripts folder of CSAR package.
  2. Open the ocsepp_oci folder, in the notifications.tf file, update the parameter endpoint with the email id of the user.
  3. Open the ocsepp_oci_resources folder, in the notifications.tf file, update the parameter endpoint with the email id of the user.
  4. Log in to the OCI Console.

    Note:

    For more details about logging in to the OCI, refer to Signing In to the OCI Console.
  5. Open the navigation menu and select Developer Services. The Developer Services window appears on the right pane.
  6. Under the Developer Services, select Resource Manager.
  7. Under Resource Manager, select Stacks. The Stacks window appears.
  8. Click Create Stack.
  9. Select the default My Configuration radio button.
  10. Under Stack configuration, select the folder radio button and upload the ocsepp_oci folder.
  11. Enter the Name and Description and select the compartment.
  12. Select the latest Terraform version from the Terraform version drop-down.
  13. Click Next. The Edit Stack screen appears.
  14. Enter the required inputs to create the SEPP alerts or alarms and click Save and Run Apply.
  15. Verify that the alarms are created in the Alarm Definitions screen (OCI Console> Observability & Management> Monitoring>Alarm Definitions) provided.

    The required inputs are:

    • Alarms Configuration
      • Compartment Name - Choose name of compartment from the drop-down
      • Metric namespace - Metric namespace that the user provided while deploying OCI Adaptors.
      • Topic Name - Any user configurable name. Must contain fewer than 256 characters. Only alphanumeric characters plus hyphens (-) and underscores (_) are allowed.
      • Message Format - Keep it as ONS_OPTIMIZED. (This is pre-populated)
      • Alarm is_enabled - Keep it as True. (This is pre-populated)
  16. The steps 6 to 15 must be repeated for uploading the ocsepp_oci_resources folder. Here, Metric namespace will be pre-populated.

For more details, see Oracle Communications Cloud Native Core, OCI Adaptor Deployment Guide.

5.4.2 Configuring SEPP Alerts for OCCNE 1.8.x and Previous Versions

The following procedure describes how to configure the SEPP alerts for OCCNE version 1.8.x and previous versions:

  1. Run the following command to find the config map to configure alerts in the Prometheus server:
    kubectl get configmap -n <Namespace>

    where, <Namespace> is the prometheus server namespace used in helm install command.

  2. Run the following command to take backup of current config map of prometheus server:
    kubectl get configmaps <NAME>-server -o yaml -n <Namespace> > /tmp/tempConfig.yaml
    where, <Namespace> is the prometheus server namespace used in helm install command.
    For example, assuming chart name is "prometheus-alert", so "_NAME_-server" becomes "prometheus-alert-server", run the following command to find the config map:
    kubectl get configmaps prometheus-alert-server -o yaml -n prometheus-alert2 > /tmp/tempConfig.yaml
  3. Run the following command to check if alertssepp is present in the tempConfig.yaml file:
    cat /tmp/t_mapConfig.yaml  | grep alertssepp
  4. Run the following command to delete the alertssepp entry from the t_mapConfig.yaml file, if the alertssepp is present :
    sed -i '/etc\/config\/alertssepp/d' /tmp/t_mapConfig.yaml
    
  5. Run the following command to add the alertssepp entry in the t_mapConfig.yaml file, if the alertssepp is not present :
    sed -i '/rule_files:/a\    \- /etc/config/alertssepp'  /tmp/t_mapConfig.yaml
  6. Run the following command to reload the config map with the modifed file:
    kubectl replace configmap <Name> -f /tmp/t_mapConfig.yaml
  7. Run the following command to add seppAlertRules.yaml file into prometheus config map under filename of SEPP alert file :
    kubectl patch configmap <Name> -n <Namespace> --type merge --patch
    "$(cat <PATH>/seppAlertRules.yaml)"
  8. Restart prometheus-server pod.
  9. Verify the alerts in prometheus GUI.

Note:

Prometheus takes about 20 seconds to apply the updated Config
    map.

5.4.3 Configuring SEPP Alerts for OCCNE 1.9.x and Higher Versions

The following procedure describes how to configure the SEPP alerts for OCCNE 1.9.x and higher versions:

  1. Run the following command to apply the Prometheus rules Custom Resource Definition (CRD):
    kubectl apply -f <file_name> -n <sepp namespace>
    Where,
    • <file_name> is the SEPP alerts file
    • <sepp namespace> is the SEPP namespace
    Example:
    $ kubectl apply -f ocsepp_alerting_rules_promha.yaml -n seppsvc
  2. Run the following command to check if SEPP alert file is added to Prometheus rules:
    $ kubectl get prometheusrules --namespace <namespace> 
    Example:
    $ kubectl get prometheusrules --namespace seppsvc
  3. Log in to Prometheus GUI and verify the alerts section.

    Note:

    The Prometheus server takes an updated config map that is automatically reloaded after approximately 60 seconds. Refresh the Prometheus GUI to confirm that the SEPP alerts have been reloaded.