3.3.3 Configuring Service Communication Proxy Alert using the SCPAlertrules.yaml file
Note:
Default NameSpace is scpsvc for Service Communication Proxy. You can update the NameSpace as per the deployment.To access the scpAlertsrules_<scp release
number>.yaml file from the Scripts folder of
ocscp_csar_25_1_1_0_0_0.zip, download the SCP package from
My Oracle Support as described in "Downloading
the SCP Package " in Oracle Communications Cloud Native Core, Service Communication Proxy
Installation, Upgrade, and Fault Recovery Guide.
Alerts Details
Description and summary for alerts are added by the Prometheus alert manager.
Alerts are supported for three different resources/routing crosses
threshold.
- SCPIngress Traffic Rate
Above Threshold
- Has three threshold level Minor (above 9800 mps to 11200 mps), Major (11200 to 13300 mps), Critical (above 13300 mps). These values are configurable.
- In the description, information is presented similar to: "Ingress Traffic Rate at Locality: <Locality of scp> is above <threshold level (minor/major/critical> threshold (i.e. <value of threshold>)"
- In Summary: "Namespace: <Namespace of scp
deployment that Locality>, Pod: <SCP-worker Pod name>:
Current Ingress Traffic Rate is <Current rate of Ingress traffic
> mps which is above 70 Percent of Max MPS(<upper limit of
ingress traffic rate per pod>)"
Note:
Ingress traffic rate is per scp-worker pod in a namespace at particular SCP-Locality. Currently, 14000mps is the upper limit for per scp-worker pod.
- SCP Routing Failed For
Service
- It alerts for which NF Service Type and NF Type at particular locality, Routing failed
- Description: "Routing failed for service"
- Summary: "Routing failed for service: NFService
Type = <Message NF Service Type>, NFType = <Message NF Type>, Locality =
<SCP Locality where Routing Failed> and value = <Accumulated failure till
now, of such message for NFType and NFService Type>"
Note:
The value field currently does not provide the number of failures in particular time interval, instead it provides the total number of Routing failures.
- SCP Pod Memory Usage: Type of alert is
SCPWorkerPodMemoryUsage.
- Pod memory usage for SCP Pods (Soothsayer and Worker) deployed at a particular node instance is provided.
- The Soothsayer pod threshold is 8 GB
- The Worker pod threshold is 16 Gi
- Summary: Instance: "<Node Instance name>, NameSpace: <Namespace of SCP deployment>, Pod: <(Soothsayer/Worker) Pod name>: <Soothsayer/Worker> Pod High Memory usage detected"
- Summary: "Instance: "<Node Instance name>, Namespace: <Namespace of SCP deployment>, Pod: <(Soothsayer/Worker) Pod name>: Memory usage is above <threshold value>G (current value is: <current value of memory usage>)"