5 NEF Alerts
This chapter includes information about the following NEF alerts:
Note:
- The performance and capacity of the NEF system may vary based on the call model, feature or interface configuration, and underlying CNE and hardware environment.
- Due to unavailability of metric
and/or MQL queries, the following alerts are not supported for OCI:
- OcnefNfStatusUnavailable
- OcnefPodsRestart
- OcnefIngressGatewayServiceDown
- OcnefApiRouterServiceDown
- OcnefFiveGcAgentServiceDown
- OcnefMonitoringEventServiceDown
- OcnefCCFClientServiceDown
- OcnefExpiryAuditorServiceDown
- OcnefQOSServiceDown
- OcnefTIServiceDown
- OcnefDTServiceDown
- OcnefEgressGatewayServiceDown
- OcnefMemoryUsageCrossedMinorThreshold
- OcnefMemoryUsageCrossedMajorThreshold
- FiveGcInvalidConfiguration
- OcnefAllSiteStatus
- OcnefDBReplicationStatus
5.1 System Level Alerts
This section lists the system level alerts for NEF.
5.1.1 OcnefNfStatusUnavailable
Table 5-1 OcnefNfStatusUnavailable
Field | Details |
---|---|
Description | 'NEF services unavailable' |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : All NEF services are unavailable." |
Severity | Critical |
Condition | All the NEF services are unavailable, either because the NEF is getting deployed or purged. |
Metric Used |
'up' Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions | The alert is cleared automatically when the NEF services restart.
Steps:
|
5.1.2 OcnefPodsRestart
Table 5-2 OcnefPodsRestart
Field | Details |
---|---|
Description | 'Pod <Pod Name> has restarted. |
Summary | "kubernetes_namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : A Pod has restarted" |
Severity | Major |
Condition | A pod belonging to any of the NEF services has restarted. |
Metric Used | kube_pod_container_status_restarts_total |
Recommended Actions |
The alert is cleared automatically if the specific pod is up. Steps:
|
5.1.3 OcnefTotalExternalIngressTrafficRateAboveMinorThreshold
Table 5-3 OcnefTotalExternalIngressTrafficRateAboveMinorThreshold
Field | Details |
---|---|
Description | OCNEF External Ingress traffic rate is above the configured minor threshold i.e. 800 TPS (current value is: {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Traffic rate is above 80 percent of max TPS (1000)" |
Severity | Minor |
Condition | The total NEF External Ingress traffic rate has crossed
the configured minor threshold of 800 TPS.
Default value of this alert trigger point in NefAlertrules alert file is 80 % of 1000 (maximum ingress request rate). |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7003 |
Metric Used | oc_ingressgateway_http_requests_total |
Recommended Actions | The alert is cleared either when the total External
Ingress traffic rate falls below the minor threshold or when the total
traffic rate crosses the major threshold, in which case the
OcnefTotalExternalIngressTrafficRateAboveMajorThreshold alert is
raised.
Note: The threshold is configurable in the NefAlertrules alert file. Reassess why the NEF is receiving additional traffic. If this alert is unexpected, contact My Oracle Support.Steps:
|
5.1.4 OcnefTotalFivegcIngressTrafficRateAboveMinorThreshold
Table 5-4 OcnefTotalFivegcIngressTrafficRateAboveMinorThreshold
Field | Details |
---|---|
Description | OCNEF Fivegc Ingress traffic rate is above the configured minor threshold i.e. 800 TPS (current value is: {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Traffic rate is above 80 percent of max TPS (1000)" |
Severity | Minor |
Condition | The total NEF Fivegc Ingress traffic rate has crossed the
configured minor threshold of 800 TPS.
Default value of this alert trigger point in NefAlertrules alert file is 80 % of 1000 (maximum ingress request rate). |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7004 |
Metric Used | oc_ingressgateway_http_requests_total |
Recommended Actions | The alert is cleared either when the total Fivegc
Ingress traffic rate falls below the minor threshold or when the total
traffic rate crosses the major threshold, in which case the
OcnefTotalFivegcIngressTrafficRateAboveMajorThreshold alert is raised.
Note: The threshold is configurable in the NefAlertrules alert file. Reassess why the NEF is receiving additional traffic. If this alert is unexpected, contact My Oracle Support.Steps:
|
5.1.5 OcnefTotalExternalIngressTrafficRateAboveMajorThreshold
Table 5-5 OcnefTotalExternalIngressTrafficRateAboveMajorThreshold
Field | Details |
---|---|
Description | OCNEF External Ingress traffic rate is above the configured major threshold i.e. 900 TPS (current value is: {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Traffic rate is above 90 percent of max TPS (1000)" |
Severity | Major |
Condition | The total NEF External Ingress traffic rate has crossed
the configured major threshold of 900 TPS.
Default value of this alert trigger point in NefAlertrules alert file is 90 % of 1000 (maximum ingress request rate). |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7005 |
Metric Used | oc_ingressgateway_http_requests_total |
Recommended Actions | The alert is cleared either when the total External
Ingress traffic rate falls below the major threshold or when the total
traffic rate crosses the critical threshold, in which case the
OcnefTotalExternalIngressTrafficRateAboveCriticalThreshold alert is
raised.
Note: The threshold is configurable in the NefAlertrules alert file. Reassess why the NEF is receiving additional traffic. If this alert is unexpected, contact My Oracle Support.Steps:
|
5.1.6 OcnefTotalFivegcIngressTrafficRateAboveMajorThreshold
Table 5-6 OcnefTotalFivegcIngressTrafficRateAboveMajorThreshold
Field | Details |
---|---|
Description | OCNEF Fivegc Ingress traffic rate is above the configured major threshold i.e. 900 TPS (current value is: {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Traffic rate is above 90 percent of max TPS (1000)" |
Severity | Major |
Condition | The total NEF Fivegc Ingress traffic rate has crossed the
configured major threshold of 900 TPS.
Default value of this alert trigger point in NefAlertrules alert file is 90 % of 1000 (maximum ingress request rate). |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7006 |
Metric Used | oc_ingressgateway_http_requests_total |
Recommended Actions | The alert is cleared either when the total Fivegc
Ingress traffic rate falls below the major threshold or when the total
traffic rate crosses the critical threshold, in which case the
OcnefTotalFivegcIngressTrafficRateAboveCriticalThreshold alert is
raised.
Note: The threshold is configurable in the NefAlertrules alert file. Reassess why the NEF is receiving additional traffic. If this alert is unexpected, contact My Oracle Support.Steps:
|
5.1.7 OcnefTotalExternalIngressTrafficRateAboveCriticalThreshold
Table 5-7 OcnefTotalExternalIngressTrafficRateAboveCriticalThreshold
Field | Details |
---|---|
Description | OCNEF External Ingress traffic rate is above the configured critical threshold i.e. 950 TPS (current value is: {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Traffic Rate is above 95 percent of max TPS (1000)" |
Severity | Critical |
Condition | The total NEF External Ingress traffic rate has crossed
the configured critical threshold of 950 TPS.
Default value of this alert trigger point in NefAlertrules alert file is 95 % of 1000 (maximum ingress request rate). |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7007 |
Metric Used | oc_ingressgateway_http_requests_total |
Recommended Actions | The alert is cleared either when the total External
Ingress traffic rate falls below the critical threshold.
Note: The threshold is configurable in the NefAlertrules alert file. Reassess why the NEF is receiving additional traffic. If this alert is unexpected, contact My Oracle Support.Steps:
|
5.1.8 OcnefTotalFivegcIngressTrafficRateAboveCriticalThreshold
Table 5-8 OcnefTotalFivegcIngressTrafficRateAboveCriticalThreshold
Field | Details |
---|---|
Description | OCNEF Fivegc Ingress traffic rate is above the configured critical threshold i.e. 950 TPS (current value is: {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Traffic Rate is above 95 percent of max TPS (1000)" |
Severity | Critical |
Condition | The total NEF Fivegc Ingress traffic rate has crossed the
configured critical threshold of 950 TPS.
Default value of this alert trigger point in NefAlertrules alert file is 95 % of 1000 (maximum ingress request rate). |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7008 |
Metric Used | oc_ingressgateway_http_requests_total |
Recommended Actions | The alert is cleared either when the total Fivegc
Ingress traffic rate falls below the critical threshold.
Note: The threshold is configurable in the NefAlertrules alert file. Reassess why the NEF is receiving additional traffic. If this alert is unexpected, contact My Oracle Support.Steps:
|
5.1.9 OcnefExternalIngressTransactionErrorRateAboveZeroPointOnePercent
Table 5-9 OcnefExternalIngressTransactionErrorRateAboveZeroPointOnePercent
Field | Details |
---|---|
Description | External Ingress transaction Error rate is above 0.1 percent(current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction Error rate detected above 0.1 percent of total transactions" |
Severity | Warning |
Condition | The number of failed external ingress transactions is above 0.1 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7009 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure external
ingress transactions is below 0.1 percent of the total transactions or
when the number of failed transactions crosses the 1% threshold, in
which case the OcnefExternalIngressTransactionErrorRateAbove1Percent is
raised.
Steps:
|
5.1.10 OcnefFivegcIngressTransactionErrorRateAboveZeroPointOnePercent
Table 5-10 OcnefFivegcIngressTransactionErrorRateAboveZeroPointOnePercent
Field | Details |
---|---|
Description | Fivegc Ingress transaction error rate is above 0.1 percent of total transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 0.1 percent of total transactions" |
Severity | Warning |
Condition | The number of failed Fivegc ingress transactions is above 0.1 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7010 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure Fivegc
ingress transactions is below 0.1 percent of the total transactions or
when the number of failed transactions crosses the 1% threshold, in
which case the OcnefFivegcIngressTransactionErrorRateAbove1Percent is
raised.
Steps:
|
5.1.11 OcnefExternalIngressTransactionErrorRateAbove1Percent
Table 5-11 OcnefExternalIngressTransactionErrorRateAbove1Percent
Field | Details |
---|---|
Description | External Ingress transaction error rate is above 1 percent of total transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 1 percent of total transactions" |
Severity | Warning |
Condition | The number of failed External Ingress transactions is above 1 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7011 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure External
Ingress transactions is below 1 percent of the total transactions or
when the number of failed transactions crosses the 10% threshold, in
which case the OcnefExternalIngressTransactionErrorRateAbove10Percent is
raised.
Steps:
|
5.1.12 OcnefFivegcIngressTransactionErrorRateAbove1Percent
Table 5-12 OcnefFivegcIngressTransactionErrorRateAbove1Percent
Field | Details |
---|---|
Description | Fivegc Ingress transaction error rate is above 1 percent of total Fivegc Ingress transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 1 percent of total transactions" |
Severity | Warning |
Condition | The number of failed Fivegc Ingress transactions is above 1 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7012 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure Fivegc
Ingress transactions is below 1 percent of the total transactions or
when the number of failed transactions crosses the 10% threshold, in
which case the OcnefFivegcIngressTransactionErrorRateAbove10Percent is
raised.
Steps:
|
5.1.13 OcnefExternalIngressTransactionErrorRateAbove10Percent
Table 5-13 OcnefExternalIngressTransactionErrorRateAbove10Percent
Field | Details |
---|---|
Description | External Ingress transaction error rate is above 10 percent of total External Ingress transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 10 percent of total transactions" |
Severity | Minor |
Condition | The number of failed External Ingress transactions is above 10 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7013 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure External
Ingress transactions is below 10 percent of the total transactions or
when the number of failed transactions crosses the 25% threshold, in
which case the OcnefExternalIngressTransactionErrorRateAbove25Percent is
raised.
Steps:
|
5.1.14 OcnefFivegcIngressTransactionErrorRateAbove10Percent
Table 5-14 OcnefFivegcIngressTransactionErrorRateAbove10Percent
Field | Details |
---|---|
Description | Fivegc Ingress transaction error rate is above 10 percent of total Fivegc Ingress transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 10 percent of total transactions" |
Severity | Minor |
Condition | The number of failed Fivegc Ingress transactions is above 10 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7014 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure Fivegc
Ingress transactions is below 10 percent of the total transactions or
when the number of failed transactions crosses the 25% threshold, in
which case the OcnefFivegcIngressTransactionErrorRateAbove25Percent is
raised.
Steps:
|
5.1.15 OcnefExternalIngressTransactionErrorRateAbove25Percent
Table 5-15 OcnefExternalIngressTransactionErrorRateAbove25Percent
Field | Details |
---|---|
Description | External Ingress transaction error rate detected above 25 percent of total External Ingress transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 25 percent of total transactions" |
Severity | Major |
Condition | The number of failed External Ingress transactions is above 25 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7015 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure External
Ingress transactions is below 25 percent of the total transactions or
when the number of failed transactions crosses the 50% threshold, in
which case the OcnefExternalIngressTransactionErrorRateAbove50Percent is
raised.
Steps:
|
5.1.16 OcnefFivegcIngressTransactionErrorRateAbove25Percent
Table 5-16 OcnefFivegcIngressTransactionErrorRateAbove25Percent
Field | Details |
---|---|
Description | Fivegc Ingress transaction error rate detected above 25 percent of total Fivegc Ingress transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 25 percent of total transactions" |
Severity | Major |
Condition | The number of failed Fivegc Ingress transactions is above 25 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7016 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure Fivegc
Ingress transactions is below 25 percent of the total transactions or
when the number of failed transactions crosses the 50% threshold, in
which case the OcnefFivegcIngressTransactionErrorRateAbove50Percent is
raised.
Steps:
|
5.1.17 OcnefExternalIngressTransactionErrorRateAbove50Percent
Table 5-17 OcnefExternalIngressTransactionErrorRateAbove50Percent
Field | Details |
---|---|
Description | External Ingress transaction error rate detected above 50 percent of total External Ingress transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 50 percent of total transactions" |
Severity | Critical |
Condition | The number of failed External Ingress transactions is above 50 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7017 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure External
Ingress transactions is below 50 percent of the total transactions.
Steps:
|
5.1.18 OcnefFivegcIngressTransactionErrorRateAbove50Percent
Table 5-18 OcnefFivegcIngressTransactionErrorRateAbove50Percent
Field | Details |
---|---|
Description | Fivegc Ingress transaction error rate detected above 50 percent of total Fivegc Ingress transactions (current value is {{ $value }}) |
Summary | "timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }}: Transaction error rate detected above 50 percent of total transactions" |
Severity | Critical |
Condition | The number of failed Fivegc Ingress transactions is above 50 percent of the total transactions. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7018 |
Metric Used | oc_ingressgateway_http_responses_total |
Recommended Actions | The alert is cleared when the number of failure Fivegc
Ingress transactions is below 50 percent of the total transactions.
Steps:
|
5.1.19 OcnefEgressGatewayServiceDown
Table 5-19 OcnefEgressGatewayServiceDown
Field | Details |
---|---|
Description | "NEF Egress-Gateway service {{$labels.app_kubernetes_io_name}} is down" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, podname: {{$labels.kubernetes_pod_name}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Egress-Gateway service down" |
Severity | Critical |
Condition | None of the pods of the Egress Gateway microservice is available. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the Egress Gateway service is available. Note: The threshold is configurable in the NefAlertrules alert file. Steps:
|
5.1.20 OcnefMemoryUsageCrossedMinorThreshold
Table 5-20 OcnefMemoryUsageCrossedMinorThreshold
Field | Details |
---|---|
Description | "NEF Memory Usage for pod {{ $labels.pod }} has crossed the configured minor threshold (50%) (value={{ $value }}) of its limit." |
Summary | "namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Memory Usage of pod exceeded 50% of its limit." |
Severity | Minor |
Condition | A pod has reached the configured minor threshold (50%) of its memory resource limits. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7020 |
Metric Used | 'container_memory_usage_bytes''container_spec_memory_limit_bytes'
Note: This is a Kubernetes metric used for instance availability monitoring. If the metric is not available, use the similar metric as exposed by the monitoring system. |
Recommended Actions | The alert gets cleared when the memory utilization falls
below the Minor Threshold or crosses the major threshold, in which case
OcnefMemoryUsageCrossedMajorThreshold alert is raised.
Note: The threshold is configurable in the NefAlertrules alert file. In case the issue persists, capture all the outputs for the above steps and contact My Oracle Support. Note: Use CNC NF Data Collector tool for capturing logs. For more information on how to collect logs using Data Collector tool, see Oracle Communications Cloud Native Core, Network Function Data Collector User Guide. |
5.1.21 OcnefMemoryUsageCrossedMajorThreshold
Table 5-21 OcnefMemoryUsageCrossedMajorThreshold
Field | Details |
---|---|
Description | "NEF Memory Usage for pod {{ $labels.pod }} has crossed the configured major threshold (60%) (value = {{ $value }}) of its limit." |
Summary | "namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Memory Usage of pod exceeded 60% of its limit." |
Severity | Major |
Condition | A pod has reached the configured major threshold (60%) of its memory resource limits. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7021 |
Metric Used |
'container_memory_usage_bytes' 'container_spec_memory_limit_bytes' Note: This is a Kubernetes metric used for instance availability monitoring. If the metric is not available, use the similar metric as exposed by the monitoring system. |
Recommended Actions | The alert gets cleared when the memory utilization falls
below the Major Threshold or crosses the critical threshold, in which
case OcnefMemoryUsageCrossedCriticalThreshold alert is raised.
Note: The threshold is configurable in the NefAlertrules alert file. In case the issue persists, capture all the outputs for the above steps and contact My Oracle Support. Note: Use CNC NF Data Collector tool for capturing logs. For more information on how to collect logs using Data Collector tool, see Oracle Communications Cloud Native Core, Network Function Data Collector User Guide. |
5.1.22 OcnefMemoryUsageCrossedCriticalThreshold
Table 5-22 OcnefMemoryUsageCrossedCriticalThreshold
Field | Details |
---|---|
Description | "NEF Memory Usage for pod {{ $labels.pod }} has crossed the configured major threshold (70%) (value = {{ $value }}) of its limit." |
Summary | "namespace: {{$labels.namespace}}, podname: {{$labels.pod}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Memory Usage of pod exceeded 70% of its limit." |
Severity | Critical |
Condition | A pod has reached the configured critical threshold (70%) of its memory resource limits. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7022 |
Metric Used |
'container_memory_usage_bytes' 'container_spec_memory_limit_bytes' Note: This is a Kubernetes metric used for instance availability monitoring. If the metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions | The alert gets cleared when the memory utilization falls
below the Critical threshold.
Note: The threshold is configurable in the NefAlertrules alert file. In case the issue persists, capture all the outputs for the above steps and contact My Oracle Support. Note: Use CNC NF Data Collector tool for capturing logs. For more information on how to collect logs using Data Collector tool, see Oracle Communications Cloud Native Core, Network Function Data Collector User Guide. |
5.1.23 OcnefIngressGatewayServiceDown
Table 5-23 OcnefIngressGatewayServiceDown
Field | Details |
---|---|
Description | "NEF Ingress-Gateway service {{$labels.app_kubernetes_io_name}} is down" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Ingress-gateway service down" |
Severity | Critical |
Condition | None of the pods of the Ingress-Gateway microservice is available. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the Ingress Gateway service is available. Steps:
|
5.1.24 OcnefApiRouterServiceDown
Table 5-24 OcnefApiRouterServiceDown
Field | Details |
---|---|
Description | "NEF API Router service {{$labels.app_kubernetes_io_name}} is down" |
Summary | "namespace: {{$labels.namespace}}, podname: {{$labels.kubernetes_pod_name}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : ApiRouter service down" |
Severity | Critical |
Condition | The API Router service is down. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the NEF API Router service is available. Steps:
|
5.1.25 OcnefFiveGcAgentServiceDown
Table 5-25 OcnefFiveGcAgentServiceDown
Field | Details |
---|---|
Description | "NEF FiveGc Agent service down {{$labels.app_kubernetes_io_name}} is down" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : FiveGc Agent service down" |
Severity | Critical |
Condition | The 5GC Agent service is down. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the NEF 5GC Agent service is available. Steps:
|
5.1.26 OcnefMonitoringEventServiceDown
Table 5-26 OcnefMonitoringEventServiceDown
Field | Details |
---|---|
Description | "NEF MonitoringEvent service {{$labels.app_kubernetes_io_name}} is down" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : MonitoringEvent service down" |
Severity | Critical |
Condition | The Monitoring Event (ME) service is down. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the NEF Monitoring Event (ME) service is available. Steps:
|
5.1.27 OcnefCCFClientServiceDown
Table 5-27 OcnefCCFClientServiceDown
Field | Details |
---|---|
Description | "NEF CCFClient service {{$labels.app_kubernetes_io_name}} is down" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : CCFClient service down" |
Severity | Critical |
Condition | The CCF Client service is down. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the NEF CCF Client service is available. Steps:
|
5.1.28 OcnefExpiryAuditorServiceDown
Table 5-28 OcnefExpiryAuditorServiceDown
Field | Details |
---|---|
Description | "NEF Expiry Auditor service {{$labels.app_kubernetes_io_name}} is down" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Expiry Auditor service down" |
Severity | Critical |
Condition | The expiry auditor service is down. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the NEF Expiry Auditor service is available. Steps:
|
5.1.29 OcnefQOSServiceDown
Table 5-29 OcnefQOSServiceDown
Field | Details |
---|---|
Description | "NEF QOS service {{$labels.app_kubernetes_io_name}} is down" |
Summary | namespace: {{$labels.namespace}}, podname: {{$labels.kubernetes_pod_name}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : QOS service down |
Severity | Critical |
Condition | The QoS service is down. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the NEF QoS service is available. Steps:
|
5.1.30 OcnefDTServiceDown
Table 5-30 OcnefDTServiceDown
Field | Details |
---|---|
Description | "OCNEF Device Trigger service {{$labels.app_kubernetes_io_name}} is down" |
Summary | "namespace: {{$labels.namespace}}, podname: {{$labels.kubernetes_pod_name}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : DT service down" |
Severity | Critical |
Condition | The Device Trigger service is down. |
Metric Used | 'up'
Note: This is a Prometheus metric used for instance availability monitoring. If this metric is not available, use a similar metric as exposed by the monitoring system. |
Recommended Actions |
The alert is cleared when the NEF DT service is available. Steps:
|
5.2 Application Level Alerts
This section lists the application level alerts for NEF.
5.2.1 MENotificationFailureRateCrossedThreshold
Table 5-31 MENotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | "Failure Rate of Delete ME Notifications Is Crossing the Threshold (10%)" |
Summary | "namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Failure Rate Of delete ME Subscriptions requests is above 10 percent of total requests." |
Severity | Error |
Condition | The failure rate of the DELETE Monitoring Event notification requests is reaching the threshold value. |
Metric Used | ocnef_me_af_resp_total |
Recommended Actions |
The alert is cleared when the failure rate of DELETE ME notification requests is below the threshold. Steps:
|
5.2.2 MEDeleteSubscriptionFailureRateCrossedThreshold
Table 5-32 MEDeleteSubscriptionFailureRateCrossedThreshold
Field | Details |
---|---|
Description | "Failure Rate of Delete ME Subscriptions Is Crossing the Threshold (10%)" |
Summary | "namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Failure Rate Of delete ME Subscriptions requests is above 10 percent of total requests." |
Severity | Error |
Condition | The failure rate of the Monitoring Event subscription deletion requests is reaching the threshold value. |
Metric Used | ocnef_me_af_resp_total |
Recommended Actions |
The alert is cleared when the failure rate of Monitoring Event subscription deletion requests is below the threshold. Steps:
|
5.2.3 MEAddSubscriptionFailureRateCrossedThreshold
Table 5-33 MEAddSubscriptionFailureRateCrossedThreshold
Field | Details |
---|---|
Description | "Failure Rate of ME Subscriptions Is Crossing the Threshold (10%)" |
Summary | "namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Failure Rate Of ME Subscriptions requests is above 10 percent of total requests." |
Severity | Error |
Condition | The failure rate of the Monitoring Event subscription requests is reaching the threshold value. |
Metric Used | ocnef_me_af_resp_total |
Recommended Actions |
The alert is cleared when the failure rate of Monitoring Event subscription requests is below the threshold. Steps:
|
5.2.4 AEFApiRouterOauthValidationFailureRateCrossedThreshold
Table 5-34 AEFApiRouterOauthValidationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | "Failure Rate of API Router Oauth Validation Is Crossing the Threshold (10%)" |
Summary | "{{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Failure Rate Of Oauth Validation is above 10 percent of total requests." |
Severity | Error |
Condition | The failure rate of the OAuth validations at API Router is reaching the threshold value. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7032 |
Metric Used | ocnef_aef_apirouter_resp_total |
Recommended Actions |
The alert is cleared when the failure rate of OAuth validations at API Router is below the threshold. Steps:
|
5.2.5 FiveGcInvalidConfiguration
Table 5-35 FiveGcInvalidConfiguration
Field | Details |
---|---|
Description | "Invalid Configuration For Five GC Service" |
Summary | "namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Invalid Configuration For Five GC Service." |
Severity | Error |
Condition | Invalid configuration of the 5GCAgent service. |
Metric Used | ocnef_5gc_invalid_config |
Recommended Actions |
The alert is cleared when the 5GCAgent service configuration are valid. Steps:
|
5.2.6 MENotificationFailureRateCrossedThreshold
Table 5-36 MENotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | "Failure Rate of Delete ME Notifications Is Crossing the Threshold (10%)" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Failure Rate Of delete ME Subscriptions requests is above 10 percent of total requests." |
Severity | Error |
Condition | The failure rate of the DELETE Monitoring Event notification requests is reaching the threshold value. |
Metric Used | ocnef_efc_svc_me_notification_resp_total |
Recommended Actions |
The alert is cleared when the failure rate of DELETE ME notification requests is below the threshold. Steps:
|
5.2.7 MENotificationFailureRateCrossedThreshold
Table 5-37 MENotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | "Failure Rate of Delete ME Notifications Is Crossing the Threshold (10%)" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Failure Rate Of delete ME Subscriptions requests is above 10 percent of total requests." |
Severity | Error |
Condition | The failure rate of the DELETE Monitoring Event notification requests is reaching the threshold value. |
Metric Used | ocnef_efc_svc_me_notification_resp_total |
Recommended Actions |
The alert is cleared when the failure rate of DELETE ME notification requests is below the threshold. Steps:
|
5.2.8 MENotificationFailureRateCrossedThreshold
Table 5-38 MENotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | "Failure Rate of Delete ME Notifications Is Crossing the Threshold (10%)" |
Summary | "kubernetes_namespace: {{$labels.kubernetes_namespace}}, timestamp: {{ with query \"time()\" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Failure Rate Of delete ME Subscriptions requests is above 10 percent of total requests." |
Severity | Error |
Condition | The failure rate of the DELETE Monitoring Event notification requests is reaching the threshold value. |
Metric Used | ocnef_efc_svc_me_notification_resp_total |
Recommended Actions |
The alert is cleared when the failure rate of DELETE ME notification requests is below the threshold. Steps:
|
5.2.9 OcnefAllSiteStatus
Table 5-39 OcnefAllSiteStatus
Field | Details |
---|---|
Description | "Alert for any NEF sites status if SUSPENDED in Georedundant setup" |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Alert for any NEF sites status if SUSPENDED in Georedundant setup |
Severity | Error |
Condition | An NEF site of a georedundant deployment is in Suspended state. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7040 |
Metric Used | ocnef_all_site_status |
Recommended Actions |
The alert is cleared when all the sites in a georedundant deployment are UP.
|
5.2.10 OcnefDBReplicationStatus
Table 5-40 OcnefDBReplicationStatus
Field | Details |
---|---|
Description | "Alert for NEF sites status if DB Replication down in Georedundant setup" |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} : Alert for NEF sites status if DB Replication down in Georedundant setup |
Severity | Error |
Condition | The database replication channel status between the given site and the georedundant site(s) is inactive. The alert is raised per replication channel. The alarm is raised or cleared only if the georedundancy feature is enabled. |
OID | 1.3.6.1.4.1.323.5.3.39.1.2.7041 |
Metric Used | ocnef_db_replication_status |
Recommended Actions | The alert is cleared when the database channel replication status between the given site and the georedundant site(s) is UP. For more information on how to check the database replication status, see Oracle Communications Cloud Native Core, cnDBTier User Guide. |
5.2.11 CHFAddChargingDataRequestFailureRateCrossedErrorThreshold
Table 5-41 CHFAddChargingDataRequestFailureRateCrossedErrorThreshold
Field | Details |
---|---|
Description | Failure rate of CHF Create Charging Data request is crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of CHF Create Charging Data request is above 10 percent of total requests. |
Metric Used | ocnef_chf_qos_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.12 CHFAddChargingDataRequestFailureRateCrossedCriticalThreshold
Table 5-42 CHFAddChargingDataRequestFailureRateCrossedCriticalThreshold
Field | Details |
---|---|
Description | Failure rate of CHF Create Charging Data request is crossing the threshold (25%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | critical |
Condition | Failure rate of CHF Create Charging Data request is above 25 percent of total requests. |
Metric Used | ocnef_chf_qos_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.13 CHFAddChargingDataRequestFailureRateCrossedMinorThreshold
Table 5-43 CHFAddChargingDataRequestFailureRateCrossedMinorThreshold
Field | Details |
---|---|
Description | Failure rate of CHF Create Charging Data request is crossing the threshold (5%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | minor |
Condition | Failure rate of CHF Create Charging Data request is above 5 percent of total requests. |
Metric Used | ocnef_chf_qos_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.14 MeEPCAddSubscriptionFailureRateCrossedThreshold
Table 5-44 MeEPCAddSubscriptionFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of ME subscriptions to EPC crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure Rate of ME EPC Subscriptions requests is above 10 percent of total requests. |
Metric Used | ocnef_me_epc_sub_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.15 DiameterGwT6InvocationFailureRateCrossedThreshold
Table 5-45 DiameterGwT6InvocationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway T6x Invocation requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway T6x Invocation requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.16 DiameterGwT4InvocationFailureRateCrossedThreshold
Table 5-46 DiameterGwT4InvocationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway T4 Invocation requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway T4 Invocation requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.17 DiameterGwRxInvocationFailureRateCrossedThreshold
Table 5-47 DiameterGwRxInvocationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway Rx Invocation requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway Rx Invocation requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.18 DiameterGwSgdT4InvocationFailureRateCrossedThreshold
Table 5-48 DiameterGwSgdT4InvocationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway SgdT4 Invocation requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway SgdT4 Invocation requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.19 DiameterGwT6NotificationFailureRateCrossedThreshold
Table 5-49 DiameterGwT6NotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway T6x Notification requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway T6x Notification requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.20 DiameterGwT4NotificationFailureRateCrossedThreshold
Table 5-50 DiameterGwT4NotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway T4 Notification requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway T4 Notification requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.21 DiameterGwRxNotificationFailureRateCrossedThreshold
Table 5-51 DiameterGwRxNotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway Rx Notification requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway Rx Notification requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.22 DiameterGwSgdT4NotificationFailureRateCrossedThreshold
Table 5-52 DiameterGwSgdT4NotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of Diameter Gateway SgdT4 Notification requests crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure rate of Diameter Gateway SgdT4 Notification requests crossing the threshold (10%). |
Metric Used | ocnef_diamgw_diam_resp_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.23 DiameterGwT6TranslationFailureRateCrossedThreshold
Table 5-53 DiameterGwT6TranslationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure Rate of T6x Translations In Diameter GW is above 10 percent of total requests. |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure Rate of T6x Translations In Diameter GW is above 10 percent of total requests. |
Metric Used | ocnef_diamgw_translator_request_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.24 DiameterGwT4TranslationFailureRateCrossedThreshold
Table 5-54 DiameterGwT4TranslationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure Rate of T4 Translations In Diameter GW is above 10 percent of total requests. |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure Rate of T4 Translations In Diameter GW is above 10 percent of total requests. |
Metric Used | ocnef_diamgw_translator_request_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.25 DiameterGwRxTranslationFailureRateCrossedThreshold
Table 5-55 DiameterGwRxTranslationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure Rate of Rx Translations In Diameter GW is above 10 percent of total requests. |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure Rate of Rx Translations In Diameter GW is above 10 percent of total requests. |
Metric Used | ocnef_diamgw_translator_request_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.26 DiameterGwSgdT4TranslationFailureRateCrossedThreshold
Table 5-56 DiameterGwSgdT4TranslationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure Rate of SgdT4 Translations in Diameter GW is above 10 percent of total requests. |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | error |
Condition | Failure Rate of SgdT4 Translations in Diameter GW is above 10 percent of total requests. |
Metric Used | ocnef_diamgw_translator_request_total |
Recommended Actions | The alert is cleared when the failure rate of CHF
Requests is below the failure threshold.
Steps:
|
5.2.27 MSISDNLessMoSMSRequestFailureRateCrossedCriticalThreshold
Table 5-57 MSISDNLessMoSMSRequestFailureRateCrossedCriticalThreshold
Field | Details |
---|---|
Description | Failure rate of MSISDNLess MO SMS notification request is crossing the threshold (25%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with
query "time()" }}
{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | Critical |
Condition | Failure rate Of MSISDNLess MO SMS request is above 25 percent of total requests. |
Metric Used | ocnef_msisdnless_mo_sms_diamgw_notify_resp_total |
Recommended Actions | The alert is cleared when the failure rate of
notification requests is below the failure threshold.
Steps:
|
5.2.28 MSISDNLessMoSMSRequestFailureRateCrossedMajorThreshold
Table 5-58 MSISDNLessMoSMSRequestFailureRateCrossedMajorThreshold
Field | Details |
---|---|
Description | Failure rate of MSISDNLess MO SMS notification request is crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with
query "time()" }}
{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | Major |
Condition | Failure rate of MSISDNLess MO SMS request is above 10 percent of total requests. |
Metric Used | ocnef_msisdnless_mo_sms_diamgw_notify_resp_total |
Recommended Actions | The alert is cleared when the failure rate of
notification requests is below the failure threshold.
Steps:
|
5.2.29 MSISDNLessMoSMSRequestFailureRateCrossedMinorThreshold
Table 5-59 MSISDNLessMoSMSRequestFailureRateCrossedMinorThreshold
Field | Details |
---|---|
Description | Failure rate of MSISDNLess MO SMS notification request is crossing the threshold (5%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with
query "time()" }}
{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | Minor |
Condition | Failure rate of MSISDNLess MO SMS request is above 5 percent of total requests. |
Metric Used | ocnef_msisdnless_mo_sms_diamgw_notify_resp_total |
Recommended Actions | The alert is cleared when the failure rate of
notification requests is below the failure threshold.
Steps:
|
5.2.30 MSISDNLessMoSMSShortCodeConfigMatchFailure
Table 5-60 MSISDNLessMoSMSShortCodeConfigMatchFailure
Field | Details |
---|---|
Description | Failure when shortcode configured doesn't match the shortcode from incoming request. |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with
query "time()" }}
{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | Error |
Condition | Failure when shortcode configured doesn't match the shortcode from SMSSC. |
Metric Used | ocnef_diamgw_http_resp_total |
Recommended Actions | The alert is cleared when the failure rate of
notification requests is below the failure threshold.
Steps:
|
5.2.31 QOSAddSubscriptionFailureRateCrossedThreshold
Table 5-61 QOSAddSubscriptionFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of QoS subscriptions is crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | Error |
Condition | Failure rate of QoS subscription requests is above 10 percent of total requests. |
Metric Used | ocnef_qos_af_resp_total |
Recommended Actions | The alert is cleared when the failure rate of
subscription requests is below the failure threshold.
Steps:
|
5.2.32 QOSDeleteSubscriptionFailureRateCrossedThreshold
Table 5-62 QOSDeleteSubscriptionFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of delete QoS subscriptions is crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | Error |
Condition | Failure rate of delete QoS subscriptions requests is above 10 percent of total requests. |
Metric Used | ocnef_qos_af_resp_total |
Recommended Actions | The alert is cleared when the failure rate of
subscription requests is below the failure threshold.
Steps:
|
5.2.33 QOSNotificationFailureRateCrossedThreshold
Table 5-63 QOSNotificationFailureRateCrossedThreshold
Field | Details |
---|---|
Description | Failure rate of QoS notifications is crossing the threshold (10%). |
Summary | namespace: {{$labels.namespace}}, timestamp: {{ with query "time()" }}{{ . | first | value | humanizeTimestamp }}{{ end }} |
Severity | Error |
Condition | Failure rate of QoS notifications requests is above 10 percent of total requests. |
Metric Used | ocnef_qos_5g_resp_total |
Recommended Actions | The alert is cleared when the failure rate of
notification requests is below the failure threshold.
Steps:
|