10 CNC Policy KPIs
This section provides information about Key Performance Indicators (KPIs) used for Cloud Native Core Policy.
Note:
Sample CNC Policy dashboard for Grafana is delivered to the customer through CNC Policy Custom Templates. The metrics and functions used to achieve KPIs are covered in CNC Policy Custom Templates as well.CPU and Memory Usage
KPI Name | KPI Details | Metric used for KPI |
---|---|---|
POD-Count | Measures the number of PCF SM Service pods available in the system. | count(container_memory_usage_bytes{container_name='pcf-smservice',namespace=\"$namespace\"}) |
POD-Count | Measures the number of PCF User Service pods available in the system. | count(container_memory_usage_bytes{container_name='pcf-userservice',namespace=\"$namespace\"}) |
POD-Count | Measures the number of PRE pods available in the system. | count(container_memory_usage_bytes{container_name='ocpm-pre',namespace=\"$namespace\"}) |
POD-Count | Measures the number of PCF AM Service pods available in the system. | count(container_memory_usage_bytes{container_name='pcf-amservice',namespace=\"$namespace\"}) |
POD-Count | Measures the number of NRF Client-NRF Discovery pods available in the system. | count(container_memory_usage_bytes{container_name='nrf-client-nfdiscovery',namespace=\"$namespace\"}) |
POD-Count | Measures the number of Ingress Gateway pods available in the system. | count(container_memory_usage_bytes{container_name='ingress-gateway',namespace=\"$namespace\"}) |
POD-Count | Measures the number of Egress Gateway pods available in the system. | count(container_memory_usage_bytes{container_name='egress-gateway',namespace=\"$namespace\"}) |
Total TPS | Measures the rate of (Ingress + Egress + Diameter) Gateway requests received at CNC Policy. |
sum(rate(occnp_oc_ingressgateway_http_requests_total{$namespacelbl="$namespace"}[2m])) + sum(rate(occnp_oc_egressgateway_http_requests_total{$namespacelbl="$namespace",Direction="egressOut"}[2m])) + sum(rate(occnp_diam_request_network_total{$namespacelbl="$namespace", appId!="0"}[2m])) |
Memory-Usage | Measures the current memory usage in bytes. | sum(container_memory_usage_bytes{image!='',namespace=\"$namespace\"}) |
Memory-Usage | Measures the memory usage (in bytes) for the top 16 memory users by each pod. | topk(16,sum(container_memory_usage_bytes{image!='',namespace=\"$namespace\"}) by (container_name)) |
CPU-Usage | Measures the number of cores being used by each pod. | sum(rate(container_cpu_usage_seconds_total{image!='',namespace=\"$namespace\",container_name!='POD'}[2m])) by (container_name) |
Session Management Service
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
SM Create Requests | Measures the rate of requests received at PCF to create SM policy association. | sum(rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"create\",dnn=\"dnn1\",snssai=\"11-abc123\"}[2m])) | Create | Not Applicable |
SM Create Success Response | Measures the rate of requests for which SM policy associations are created successfully. | sum
(rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"create\",response_code=\"2xx\"}[2m]))
Note: [2m] determines the rate interval at which the value needs to be calculated. This value is strictly used for calculation purpose and can be modified accordingly as per customer requirements. It can be changed either before or after adding the sample to Policy Grafana dashboard. |
Create | 2xx |
SM Create Failure Response | Measures the rate at which create requests for SM policy association are rejected by PCF due to certain errors. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"create\",response_code!=\"2xx\"}[2m])) | Create | 2xx |
SM Update Request | Measures the rate at which requests are received at PCF to update the SM Policy association data. | sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update\",dnn=\"dnn1\",snssai=\"11-abc123\"}[2m])) | Update | Not Applicable |
SM Update Success Response | Measures the rate at which requests for updating SM policy association (already existing) are updated successfully. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update\",response_code=\"2xx\"}[2m])) | Update | 2xx |
SM Update Failure Response | Measures the rate at which requests for updating SM policy association (already existing) are rejected by PCF due to certain errors at the consumer end. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update\",response_code=\"4xx\"}[2m])) | Update | 4xx |
SM Delete Requests | Measures the rate of requests received at PCF to delete (or deregister) SM policy association. | sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"delete\",dnn=\"dnn1\",snssai=\"11-abc123\"}[2m])) | Delete | Not Applicable |
SM Delete Success Responses | Measures the rate at which requests for deleting SM policy association (already existing) are deleted successfully. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"delete\",response_code=\"2xx\"}[2m])) | Delete | 2xx |
SM Delete Failure Responses | Measures the rate at which requests for deleting SM policy association (already existing) are rejected by PCF due to certain errors at the consumer end. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"delete\",response_code!=\"2xx\"}[2m])) | Delete | 2xx |
SM Update Notify Requests | Measures the rate at which requests are initiated by PCF towards SMF to update the PCC rules PDU session related policy context. | sum (rate(ocpm_egress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update_notify\"}[2m])) | Update Notify | Not Applicable |
SM Update Notify Success Response | Measures the rate at which update requests received at SMF (initiated by PCF) are updated successfully. | sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update_notify\",response_code=\"2xx\"}[2m])) | Update Notify | 2xx |
SM Update Notify Failure Response | Measures the rate at which update requests received at SMF (initiated by PCF) are rejected due to certain errors. | sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update_notify\",response_code!=\"2xx\"}[2m])) | Update Notify | 2xx |
SM Terminate Notify Requests | Measures the rate at which requests are initiated by PCF towards SMF to delete SM Policy association of a PDU session. | sum (rate(ocpm_egress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"terminate_notify\"}[2m])) | Terminate Notify | Not Applicable |
SM Terminate Notify Success Response | Measures the rate at which delete requests received at SMF (initiated by PCF) are processed successfully. | sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"terminate_notify\",response_code=\"2xx\"}[2m])) | Terminate Notify | 2xx |
SM Terminate Notify Failure Response | Measures the rate at which delete requests received at SMF (initiated by PCF) are rejected due to certain errors. | sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"terminate_notify\",response_code!=\"2xx\"}[2m])) | Terminate Notify | 2xx |
Diameter Gateway Request and Response
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Rx AAR Initial Request | Measures the rate of Rx AAR initial messages received by Diameter Gateway. | sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"create\",servicename_3gpp=\"rx\"}[2m])) | Create | Not Applicable |
Rx AAA Initial Response Success | Measures the rate at which Diameter Gateway processes Rx AAR messages successfully by sending Rx AAA as initial response. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"create\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) | Create | 2xxx |
Rx AAR Update Request | Measures the rate of Rx AAR update requests received by Diameter Gateway. | sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"update\",servicename_3gpp=\"rx\"}[2m])) | Update | Not Applicable |
Rx AAR Update Response Success | Measures the rate at which Diameter Gateway processes Rx AAR update requests successfully. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"update\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) | Update | 2xxx |
Rx STR Request | Measures the rate of the Session-Termination-Request (STR) messages received by Diameter Gateway. | sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"terminate\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) | Terminate | 2xxx |
Rx STR Response Success | Measures the rate at which Diameter Gateway processes Rx STR messages successfully. | sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"terminate\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) | Terminate | 2xxx |
UDR Tracking Request and Response
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Query SM Data UDR Request | Measures the number of query requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",service_version=\"v1\",servicename_3gpp=\"nudr-dr\"}[2m])) | GET | Not Applicable |
Unsubscribe UDR Request | Measures the number of unsubscribe requests sent by PCF to UDR. | sum(rate(ocpm_udr_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) | Unsubscribe | Not Applicable |
Subscribe Total UDR Request | Measures the number of subscribe requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) | Subscribe | Not Applicable |
Subscribe UDR Success | Measures the number of success responses received for subscribe requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\",response_code=\"2xx\"}[2m])) | Subscribe | 2xx |
Subscribe UDR Failed | Measures the number of unsuccessful responses received for subscribe requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\",response_code!=\"2xx\"}[2m])) | Subscribe | 2xx |
Query SM Data Success | Measures the number of success responses received for query requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",servicename_3gpp=\"nudr-dr\",response_code=\"2xx\"}[2m])) | GET | 2xx |
Query SM Data Failed | Measures the number of unsuccessful responses received for query requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",servicename_3gpp=\"nudr-dr\",response_code!=\"2xx\"}[2m])) | GET | 2xx |
Unsubscribe UDR Success | Measures the number of success responses received for unsubscribe requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\",response_code=\"2xx\"}[2m])) | Unsubscribe | 2xx |
Unsubscribe UDR Failed | Measures the number of unsuccessful responses received for unsubscribe requests sent by PCF to UDR. | sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\",response_code!=\"2xx\"}[2m])) | Unsubscribe | 2xx |
Unsubscribe UDR Timeout | Measures the number of unsubscribe requests not processed successfully at UDR due to timeout error. | sum (rate(ocpm_udr_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) | Unsubscribe | Not Applicable |
Subscribe UDR Timeout | Measures the number of subscribe requests not processed successfully at UDR due to timeout error. | sum (rate(ocpm_udr_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) | Subscribe | Not Applicable |
Query UDR Timeout | Measures the number of query requests not processed successfully at UDR due to timeout error. | sum (rate(ocpm_udr_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",servicename_3gpp=\"nudr-dr\"}[2m])) | GET | Not Applicable |
Diameter Egress Request and Response
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Rx RAR Request | Measures the number of requests sent to external NFs through Egress Gateway. | sum (rate(ocpm_egress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"update_notify\",servicename_3gpp=\"rx\"}[2m])) | Update Notify | Not Applicable |
Rx RAR Response Success | Measures the number of success responses received for requests sent to external NFs through Egress Gateway. | sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"update_notify\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) | Update Notify | 2xxx |
User Service Inbound
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
GET Request to User Service | Measures the number of query requests received at User Service. | sum (rate(ocpm_userservice_inbound_count_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",service_resource=\"user-service\"}[2m])) | GET | Not Applicable |
Delete Request to User Service | Measures the number of delete requests received at User Service. | sum (rate(ocpm_userservice_inbound_count_total{kubernetes_namespace=\"$namespace\",operation_type=\"delete\",service_resource=\"user-service\"}[2m])) | DELETE | Not Applicable |
Notify Request to User Service | Measures the number of notify requests received at User Service. | sum (rate(ocpm_userservice_inbound_count_total{kubernetes_namespace=\"$namespace\",operation_type=\"notify\",service_resource=\"user-service\"}[2m])) | Notify | Not Applicable |
Diameter Connector
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Diameter Connector | Measures the total number of connections established towards Diameter Gateway. | sum (rate(occnp_diam_conn_network{kubernetes_namespace=\"$namespace\"}[2m])) | Not Applicable | Not Applicable |
Diameter Connector | Measures the total number of connections established towards SM service application. | sum (rate(occnp_diam_conn_app_network{kubernetes_namespace=\"$namespace\"}[2m])) | Not Applicable | Not Applicable |
Egress Request and Response
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Egress Delete Request | Measures the number of delete requests sent to external NFs through Egress Gateway. | sum(rate(oc_egressgateway_http_requests_total{kubernetes_namespace=\"$namespace\",Method='DELETE'}[2m])) | DELETE | Not Applicable |
Egress Delete Response | Measures the number of responses to Delete requests – sent to external NFs through Egress Gateway. | sum(rate(oc_egressgateway_http_responses_total{kubernetes_namespace=\"$namespace\",Method='DELETE'}[2m])) | DELETE | Not Applicable |
Egress GET request | Measures the number of query requests sent to external NFs through Egress Gateway. | sum(rate(oc_egressgateway_http_requests_total{kubernetes_namespace=\"$namespace\",Method='GET'}[2m])) | GET | Not Applicable |
Egress GET response | Measures the number of responses to query requests – sent to external NFs through Egress Gateway. | sum(rate(oc_egressgateway_http_responses_total{kubernetes_namespace=\"$namespace\",Method='GET'}[2m])) | GET | Not Applicable |
Egress POST request | Measures the number of POST requests sent to external NFs through Egress Gateway. | sum(rate(oc_egressgateway_http_requests_total{kubernetes_namespace=\"$namespace\",Method='POST'}[2m])) | POST | Not Applicable |
Egress POST response | Measures the number of responses to POST requests – sent to external NFs through Egress Gateway. | sum(rate(oc_egressgateway_http_responses_total{kubernetes_namespace=\"$namespace\",Method='POST'}[2m])) | POST | Not Applicable |
CHF Tracking Request
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
CHF-Subscribe-request | Total number of subscribe requests (spending limit status information) from CHF server. | sum (rate(ocpm_chf_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) | Subscribe | Not Applicable |
CHF-Unsubscribe-request | Total number of unsubscribe requests (spending limit status information) being sent to CHF server. | sum (rate(ocpm_chf_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) | Unsubscribe | Not Applicable |
PolicyDS
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Client Request Total Search | Measures the number of messages sent to LDAP gateway from PolicyDS for LDAP search requests. | sum (rate(client_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\"}[2m])) | ||
Client Response Total Search | Measures the number of responses sent by LDAP gateway for LDAP search requests from PolicyDS. | sum (rate(client_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\",response=\"200\"}[2m])) | ||
Server Request User Service | Measures the number of messages sent to LDAP gateway from PolicyDS for User service search requests. | sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",task=\"USER_SERVICE\"}[2m])) | ||
Server Request LDAP | Measures the number of messages sent to LDAP gateway from PolicyDS for LDAP retrieve requests. | sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"GET\",task=\"LDAP\"}[2m])) | ||
Server Request PRE | Measures the number of messages sent to PRE from PolicyDS for inserting the user data. | sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"INSERT\",task=\"PRE\"}[2m])) | ||
Server Response Success POST | Measures the number of success responses received for POST requests by PolicyDS. | sum (rate(server_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"POST\",response=\"200\"}[2m])) | POST | 200 |
CHF Tracking Response
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Subscribe Success | Measures the total number of success responses for subscribe/fetch spendingLimitStatus requests at CHF connector. | sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code=\"2xx\"}[2m])) | Subscribe | 2xx |
Subscribe Failure | Measures the total number of failed responses for subscribe/fetch spendingLimitStatus requests at CHF connector. | sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code!=\"2xx\"}[2m])) | Subscribe | 2xx |
Unsubscribe Success | Measures the total number of success responses for unsubscribe spendingLimitStatus requests at CHF connector. | sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code=\"2xx\"}[2m])) | Unsubscribe | 2xx |
Unsubscribe Failure | Measures the total number of failed responses for unsubscribe spendingLimitStatus requests at CHF connector. | sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code!=\"2xx\"}[2m])) | Unsubscribe | 2xx |
Unsubscribe Timeout | Measures the total number of requests that got timed out at CHF Connector when trying to unsubscribe spendingLimitStatus. | sum (rate(ocpm_chf_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) | Unsubscribe | Not Applicable |
Subscribe Timeout | Measures the total number of requests that got timed out at CHF Connector when trying to subscribe/fetch spendingLimitStatus. | sum (rate(ocpm_chf_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) | Subscribe | Not Applicable |
LDAP
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
LDAP Total Requests | Measures the rate of total number of requests received at LDAP. | sum (rate(ldap_request_total{kubernetes_namespace=\"$namespace\",ReqType=\"GET\"}[2m])) | All | Not Applicable |
LDAP Response Success | Measures the rate of total number of successful responses sent to network NFs by LDAP. | sum (rate(ldap_response_total{kubernetes_namespace=\"$namespace\",ReqType=\"GET\",Code=\"2xx\"}[2m])) | All | 2xx |
LDAP Response Failure | Measures the rate of total number of requests that have been rejected by LDAP due to errors at the end of NF consumers. | sum (rate(ldap_response_total{kubernetes_namespace=\"$namespace\",ReqType=\"GET\",Code!=\"2xx\"}[2m])) | All | 2xx |
Policy DS
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
Client_request_total_search | Measures the sum of total number of requests policyds sends for LDAP Search. | sum (rate(client_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\"}[2m])) | SEARCH | Not Applicable |
Client_response_total_search | Measures the sum of total number of success responses received by policyds for LDAP Search. | sum (rate(client_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\",response=\"200\"}[2m])) | SEARCH | 200 |
Server_request_user_service | Measures the total requests received by PolicyDS from NF consumers for UserService search. | sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",task=\"USER_SERVICE\"}[2m])) | SEARCH | Not Applicable |
Server_request_LDAP | Measures the total number or requests policyds received for LDAP Get request. | sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"GET\",task=\"LDAP\"}[2m])) | GET | Not Applicable |
Server_request_PRE | Measures the total number or requests policyds received for PRE Insert request. | sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"INSERT\",task=\"PRE\"}[2m])) | INSERT | Not Applicable |
Server_response_sucess_POST | sum (rate(server_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"POST\",response=\"200\"}[2m])) | POST | 200 |
10.1 PCRF KPIs
This section provides information about Key Performance Indicators (KPIs) used for PCRF in CNC Policy.
Note:
Sample PCRF dashboard for Grafana is provided in the CNC Policy Custom Templates folder as part of the documentation.CPU, Memory, and Load Level
KPI Name | KPI Details | Metric used for KPI |
---|---|---|
Pod Health | Shows the health of pods available in the system. | appinfo_service_running{vendor=\"Oracle\", application=\"occnp\", namespace=\"$namespace\"} |
POD-Count | Shows the number of PCRF pods available in the system. | count(container_memory_usage_bytes{container='pcrf-core',namespace=\"$namespace\"}) |
POD-Count | Shows the number of Diameter Gateway pods available in the system. | count(container_memory_usage_bytes{container='diam-gateway',namespace=\"$namespace\"}) |
POD-Count | Shows the number of PRE pods available in the system. | count(container_memory_usage_bytes{container='pre-service',namespace=\"$namespace\"}) |
POD-Count | Shows the number of Policy Datasource pods available in the system. | count(container_memory_usage_bytes{container='policyds',namespace=\"$namespace\"}) |
POD-Count | Shows the number of LDAP Gateway pods available in the system. | count(container_memory_usage_bytes{container='ldap-gateway',namespace=\"$namespace\"}) |
POD-Count | Shows the number of CM service pods available in the system. | count(container_memory_usage_bytes{container='cm-service',namespace=\"$namespace\"}) |
POD-Count | Shows the number of Config Server pods available in the system. | count(container_memory_usage_bytes{container='config-server',namespace=\"$namespace\"}) |
Memory Usage | Shows the current memory usage in bytes. | sum(container_memory_usage_bytes{image!='',namespace=\"$namespace\"}) |
Memory Usage | Shows the memory usage (in bytes) for the top 10 memory users by each pod. | topk(10, sum(container_memory_usage_bytes{namespace=\"$namespace\"}) by (container)) |
Service Load Level | Shows the load level of each service. | avg(load_level) by (service) |
CPU Usage | Shows the CPU usage by each pod. | sum(rate(container_cpu_usage_seconds_total{image!='',namespace=\"$namespace\",container_name!='POD'}[2m])) by (container_name) |
PCRF Success Response
KPI Name | KPI Details | Metric used for KPI | Response Code |
---|---|---|---|
CCA_SUCCESS_COUNT | Shows the rate of success responses received at PCRF for CCA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"CCA.*\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | 2xx |
AAA_SUCCESS_COUNT | Shows the rate of success responses received at PCRF for AAA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"AAA.*\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | 2xx |
ASA_SUCCESS_COUNT | Shows the rate of success responses received at PCRF for ASA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"ASA\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | 2xx |
RAA_SUCCESS_COUNT | Shows the rate of success responses received at PCRF for RAA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"RAA\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | 2xx |
STA_SUCCESS_COUNT | Shows the rate of success responses received at PCRF for STA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"STA\", responseCode!=\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | 2xx |
PCRF Failure Response
KPI Name | KPI Details | Metric used for KPI | Response Code |
---|---|---|---|
CCA_FAIL_COUNT | Shows the rate of failure responses received at PCRF for CCA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"CCA.*\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | Other than 2xx |
AAA_FAIL_COUNT | Shows the rate of failure responses received at PCRF for AAA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"AAA.*\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | Other than 2xx |
ASA_FAIL_COUNT | Shows the rate of failure responses received at PCRF for ASA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"ASA\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | Other than 2xx |
RAA_FAIL_COUNT | Shows the rate of success responses received at PCRF for RAA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"RAA\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | Other than 2xx |
STA_FAIL_COUNT | Shows the rate of failure responses received at PCRF for STA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"STA\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) | Other than 2xx |
PCRF Timeout Failure
KPI Name | KPI Details | Metric used for KPI |
---|---|---|
ASA_TIMEOUT_COUNT | Shows the rate of failure responses due to timeout for ASA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"ASA\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) |
RAA_TIMEOUT_COUNT | Shows the rate of failure responses due to timeout for RAA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"RAA\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) |
Diameter Gateway Timeout Failure
KPI Name | KPI Details | Metric used for KPI |
---|---|---|
CCA_TIMEOUT_COUNT | Shows the rate of failure responses due to timeout received at Diameter Gateway for CCA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"CCA.*\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"diam-gateway\"}[5m])) |
AAA_TIMEOUT_COUNT | Shows the rate of failure responses due to timeout received at Diameter Gateway for AAA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"AAA.*\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"diam-gateway\"}[5m])) |
STA_TIMEOUT_COUNT | Shows the rate of failure responses due to timeout received at Diameter Gateway for STA message type. | sum(rate(occnp_diam_response_local_total{msgType=~\"STA\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"diam-gateway\"}[5m])) |
Diameter Gateway Response
KPI Name | KPI Details | Metric used for KPI | Service Operation | Response Code |
---|---|---|---|---|
FAILURE_RATE | Shows the percentage of total failure responses of all incoming messages (for Diameter Gateway and PCRF) per unit time. | sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\",responseCode!~\"2.*\"}[5m])) / sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\"}[5m]))*100 | Not Applicable | Other than 2xx |
SUCCESS_RATE | Shows the percentage of total success responses of all incoming messages (for Diameter Gateway and PCRF) per unit time. | sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\",responseCode=~\"2.*\"}[5m])) / sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\"}[5m]))*100 | Not Applicable | 2xx |