10 CNC Policy KPIs

This section provides information about Key Performance Indicators (KPIs) used for Cloud Native Core Policy.

Note:

Sample CNC Policy dashboard for Grafana is delivered to the customer through CNC Policy Custom Templates. The metrics and functions used to achieve KPIs are covered in CNC Policy Custom Templates as well.

CPU and Memory Usage

KPI Name KPI Details Metric used for KPI
POD-Count Measures the number of PCF SM Service pods available in the system. count(container_memory_usage_bytes{container_name='pcf-smservice',namespace=\"$namespace\"})
POD-Count Measures the number of PCF User Service pods available in the system. count(container_memory_usage_bytes{container_name='pcf-userservice',namespace=\"$namespace\"})
POD-Count Measures the number of PRE pods available in the system. count(container_memory_usage_bytes{container_name='ocpm-pre',namespace=\"$namespace\"})
POD-Count Measures the number of PCF AM Service pods available in the system. count(container_memory_usage_bytes{container_name='pcf-amservice',namespace=\"$namespace\"})
POD-Count Measures the number of NRF Client-NRF Discovery pods available in the system. count(container_memory_usage_bytes{container_name='nrf-client-nfdiscovery',namespace=\"$namespace\"})
POD-Count Measures the number of Ingress Gateway pods available in the system. count(container_memory_usage_bytes{container_name='ingress-gateway',namespace=\"$namespace\"})
POD-Count Measures the number of Egress Gateway pods available in the system. count(container_memory_usage_bytes{container_name='egress-gateway',namespace=\"$namespace\"})
Total TPS Measures the rate of (Ingress + Egress + Diameter) Gateway requests received at CNC Policy.

sum(rate(occnp_oc_ingressgateway_http_requests_total{$namespacelbl="$namespace"}[2m])) + sum(rate(occnp_oc_egressgateway_http_requests_total{$namespacelbl="$namespace",Direction="egressOut"}[2m])) + sum(rate(occnp_diam_request_network_total{$namespacelbl="$namespace", appId!="0"}[2m]))

Memory-Usage Measures the current memory usage in bytes. sum(container_memory_usage_bytes{image!='',namespace=\"$namespace\"})
Memory-Usage Measures the memory usage (in bytes) for the top 16 memory users by each pod. topk(16,sum(container_memory_usage_bytes{image!='',namespace=\"$namespace\"}) by (container_name))
CPU-Usage Measures the number of cores being used by each pod. sum(rate(container_cpu_usage_seconds_total{image!='',namespace=\"$namespace\",container_name!='POD'}[2m])) by (container_name)

Session Management Service

KPI Name KPI Details Metric used for KPI Service Operation Response Code
SM Create Requests Measures the rate of requests received at PCF to create SM policy association. sum(rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"create\",dnn=\"dnn1\",snssai=\"11-abc123\"}[2m])) Create Not Applicable
SM Create Success Response Measures the rate of requests for which SM policy associations are created successfully. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"create\",response_code=\"2xx\"}[2m]))

Note: [2m] determines the rate interval at which the value needs to be calculated. This value is strictly used for calculation purpose and can be modified accordingly as per customer requirements. It can be changed either before or after adding the sample to Policy Grafana dashboard.

Create 2xx
SM Create Failure Response Measures the rate at which create requests for SM policy association are rejected by PCF due to certain errors. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"create\",response_code!=\"2xx\"}[2m])) Create 2xx
SM Update Request Measures the rate at which requests are received at PCF to update the SM Policy association data. sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update\",dnn=\"dnn1\",snssai=\"11-abc123\"}[2m])) Update Not Applicable
SM Update Success Response Measures the rate at which requests for updating SM policy association (already existing) are updated successfully. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update\",response_code=\"2xx\"}[2m])) Update 2xx
SM Update Failure Response Measures the rate at which requests for updating SM policy association (already existing) are rejected by PCF due to certain errors at the consumer end. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update\",response_code=\"4xx\"}[2m])) Update 4xx
SM Delete Requests Measures the rate of requests received at PCF to delete (or deregister) SM policy association. sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"delete\",dnn=\"dnn1\",snssai=\"11-abc123\"}[2m])) Delete Not Applicable
SM Delete Success Responses Measures the rate at which requests for deleting SM policy association (already existing) are deleted successfully. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"delete\",response_code=\"2xx\"}[2m])) Delete 2xx
SM Delete Failure Responses Measures the rate at which requests for deleting SM policy association (already existing) are rejected by PCF due to certain errors at the consumer end. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"delete\",response_code!=\"2xx\"}[2m])) Delete 2xx
SM Update Notify Requests Measures the rate at which requests are initiated by PCF towards SMF to update the PCC rules PDU session related policy context. sum (rate(ocpm_egress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update_notify\"}[2m])) Update Notify Not Applicable
SM Update Notify Success Response Measures the rate at which update requests received at SMF (initiated by PCF) are updated successfully. sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update_notify\",response_code=\"2xx\"}[2m])) Update Notify 2xx
SM Update Notify Failure Response Measures the rate at which update requests received at SMF (initiated by PCF) are rejected due to certain errors. sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"update_notify\",response_code!=\"2xx\"}[2m])) Update Notify 2xx
SM Terminate Notify Requests Measures the rate at which requests are initiated by PCF towards SMF to delete SM Policy association of a PDU session. sum (rate(ocpm_egress_request_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"terminate_notify\"}[2m])) Terminate Notify Not Applicable
SM Terminate Notify Success Response Measures the rate at which delete requests received at SMF (initiated by PCF) are processed successfully. sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"terminate_notify\",response_code=\"2xx\"}[2m])) Terminate Notify 2xx
SM Terminate Notify Failure Response Measures the rate at which delete requests received at SMF (initiated by PCF) are rejected due to certain errors. sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",servicename_3gpp=\"npcf-smpolicycontrol\",operation_type=\"terminate_notify\",response_code!=\"2xx\"}[2m])) Terminate Notify 2xx

Diameter Gateway Request and Response

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Rx AAR Initial Request Measures the rate of Rx AAR initial messages received by Diameter Gateway. sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"create\",servicename_3gpp=\"rx\"}[2m])) Create Not Applicable
Rx AAA Initial Response Success Measures the rate at which Diameter Gateway processes Rx AAR messages successfully by sending Rx AAA as initial response. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"create\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) Create 2xxx
Rx AAR Update Request Measures the rate of Rx AAR update requests received by Diameter Gateway. sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"update\",servicename_3gpp=\"rx\"}[2m])) Update Not Applicable
Rx AAR Update Response Success Measures the rate at which Diameter Gateway processes Rx AAR update requests successfully. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"update\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) Update 2xxx
Rx STR Request Measures the rate of the Session-Termination-Request (STR) messages received by Diameter Gateway. sum (rate(ocpm_ingress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"terminate\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) Terminate 2xxx
Rx STR Response Success Measures the rate at which Diameter Gateway processes Rx STR messages successfully. sum (rate(ocpm_ingress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"terminate\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) Terminate 2xxx

UDR Tracking Request and Response

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Query SM Data UDR Request Measures the number of query requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",service_version=\"v1\",servicename_3gpp=\"nudr-dr\"}[2m])) GET Not Applicable
Unsubscribe UDR Request Measures the number of unsubscribe requests sent by PCF to UDR. sum(rate(ocpm_udr_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) Unsubscribe Not Applicable
Subscribe Total UDR Request Measures the number of subscribe requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) Subscribe Not Applicable
Subscribe UDR Success Measures the number of success responses received for subscribe requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\",response_code=\"2xx\"}[2m])) Subscribe 2xx
Subscribe UDR Failed Measures the number of unsuccessful responses received for subscribe requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\",response_code!=\"2xx\"}[2m])) Subscribe 2xx
Query SM Data Success Measures the number of success responses received for query requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",servicename_3gpp=\"nudr-dr\",response_code=\"2xx\"}[2m])) GET 2xx
Query SM Data Failed Measures the number of unsuccessful responses received for query requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",servicename_3gpp=\"nudr-dr\",response_code!=\"2xx\"}[2m])) GET 2xx
Unsubscribe UDR Success Measures the number of success responses received for unsubscribe requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\",response_code=\"2xx\"}[2m])) Unsubscribe 2xx
Unsubscribe UDR Failed Measures the number of unsuccessful responses received for unsubscribe requests sent by PCF to UDR. sum (rate(ocpm_udr_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\",response_code!=\"2xx\"}[2m])) Unsubscribe 2xx
Unsubscribe UDR Timeout Measures the number of unsubscribe requests not processed successfully at UDR due to timeout error. sum (rate(ocpm_udr_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) Unsubscribe Not Applicable
Subscribe UDR Timeout Measures the number of subscribe requests not processed successfully at UDR due to timeout error. sum (rate(ocpm_udr_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nudr-dr\"}[2m])) Subscribe Not Applicable
Query UDR Timeout Measures the number of query requests not processed successfully at UDR due to timeout error. sum (rate(ocpm_udr_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",servicename_3gpp=\"nudr-dr\"}[2m])) GET Not Applicable

Diameter Egress Request and Response

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Rx RAR Request Measures the number of requests sent to external NFs through Egress Gateway. sum (rate(ocpm_egress_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"update_notify\",servicename_3gpp=\"rx\"}[2m])) Update Notify Not Applicable
Rx RAR Response Success Measures the number of success responses received for requests sent to external NFs through Egress Gateway. sum (rate(ocpm_egress_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"update_notify\",servicename_3gpp=\"rx\",response_code=\"2xxx\"}[2m])) Update Notify 2xxx

User Service Inbound

KPI Name KPI Details Metric used for KPI Service Operation Response Code
GET Request to User Service Measures the number of query requests received at User Service. sum (rate(ocpm_userservice_inbound_count_total{kubernetes_namespace=\"$namespace\",operation_type=\"get\",service_resource=\"user-service\"}[2m])) GET Not Applicable
Delete Request to User Service Measures the number of delete requests received at User Service. sum (rate(ocpm_userservice_inbound_count_total{kubernetes_namespace=\"$namespace\",operation_type=\"delete\",service_resource=\"user-service\"}[2m])) DELETE Not Applicable
Notify Request to User Service Measures the number of notify requests received at User Service. sum (rate(ocpm_userservice_inbound_count_total{kubernetes_namespace=\"$namespace\",operation_type=\"notify\",service_resource=\"user-service\"}[2m])) Notify Not Applicable

Diameter Connector

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Diameter Connector Measures the total number of connections established towards Diameter Gateway. sum (rate(occnp_diam_conn_network{kubernetes_namespace=\"$namespace\"}[2m])) Not Applicable Not Applicable
Diameter Connector Measures the total number of connections established towards SM service application. sum (rate(occnp_diam_conn_app_network{kubernetes_namespace=\"$namespace\"}[2m])) Not Applicable Not Applicable

Egress Request and Response

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Egress Delete Request Measures the number of delete requests sent to external NFs through Egress Gateway. sum(rate(oc_egressgateway_http_requests_total{kubernetes_namespace=\"$namespace\",Method='DELETE'}[2m])) DELETE Not Applicable
Egress Delete Response Measures the number of responses to Delete requests – sent to external NFs through Egress Gateway. sum(rate(oc_egressgateway_http_responses_total{kubernetes_namespace=\"$namespace\",Method='DELETE'}[2m])) DELETE Not Applicable
Egress GET request Measures the number of query requests sent to external NFs through Egress Gateway. sum(rate(oc_egressgateway_http_requests_total{kubernetes_namespace=\"$namespace\",Method='GET'}[2m])) GET Not Applicable
Egress GET response Measures the number of responses to query requests – sent to external NFs through Egress Gateway. sum(rate(oc_egressgateway_http_responses_total{kubernetes_namespace=\"$namespace\",Method='GET'}[2m])) GET Not Applicable
Egress POST request Measures the number of POST requests sent to external NFs through Egress Gateway. sum(rate(oc_egressgateway_http_requests_total{kubernetes_namespace=\"$namespace\",Method='POST'}[2m])) POST Not Applicable
Egress POST response Measures the number of responses to POST requests – sent to external NFs through Egress Gateway. sum(rate(oc_egressgateway_http_responses_total{kubernetes_namespace=\"$namespace\",Method='POST'}[2m])) POST Not Applicable

CHF Tracking Request

KPI Name KPI Details Metric used for KPI Service Operation Response Code
CHF-Subscribe-request Total number of subscribe requests (spending limit status information) from CHF server. sum (rate(ocpm_chf_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) Subscribe Not Applicable
CHF-Unsubscribe-request Total number of unsubscribe requests (spending limit status information) being sent to CHF server. sum (rate(ocpm_chf_tracking_request_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) Unsubscribe Not Applicable

PolicyDS

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Client Request Total Search Measures the number of messages sent to LDAP gateway from PolicyDS for LDAP search requests. sum (rate(client_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\"}[2m]))    
Client Response Total Search Measures the number of responses sent by LDAP gateway for LDAP search requests from PolicyDS. sum (rate(client_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\",response=\"200\"}[2m]))    
Server Request User Service Measures the number of messages sent to LDAP gateway from PolicyDS for User service search requests. sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",task=\"USER_SERVICE\"}[2m]))    
Server Request LDAP Measures the number of messages sent to LDAP gateway from PolicyDS for LDAP retrieve requests. sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"GET\",task=\"LDAP\"}[2m]))    
Server Request PRE Measures the number of messages sent to PRE from PolicyDS for inserting the user data. sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"INSERT\",task=\"PRE\"}[2m]))    
Server Response Success POST Measures the number of success responses received for POST requests by PolicyDS. sum (rate(server_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"POST\",response=\"200\"}[2m])) POST 200

CHF Tracking Response

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Subscribe Success Measures the total number of success responses for subscribe/fetch spendingLimitStatus requests at CHF connector. sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code=\"2xx\"}[2m])) Subscribe 2xx
Subscribe Failure Measures the total number of failed responses for subscribe/fetch spendingLimitStatus requests at CHF connector. sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code!=\"2xx\"}[2m])) Subscribe 2xx
Unsubscribe Success Measures the total number of success responses for unsubscribe spendingLimitStatus requests at CHF connector. sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code=\"2xx\"}[2m])) Unsubscribe 2xx
Unsubscribe Failure Measures the total number of failed responses for unsubscribe spendingLimitStatus requests at CHF connector. sum (rate(ocpm_chf_tracking_response_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\",response_code!=\"2xx\"}[2m])) Unsubscribe 2xx
Unsubscribe Timeout Measures the total number of requests that got timed out at CHF Connector when trying to unsubscribe spendingLimitStatus. sum (rate(ocpm_chf_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"unsubscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) Unsubscribe Not Applicable
Subscribe Timeout Measures the total number of requests that got timed out at CHF Connector when trying to subscribe/fetch spendingLimitStatus. sum (rate(ocpm_chf_tracking_request_timeout_total{kubernetes_namespace=\"$namespace\",operation_type=\"subscribe\",servicename_3gpp=\"nchf-spendinglimitcontrol\"}[2m])) Subscribe Not Applicable

LDAP

KPI Name KPI Details Metric used for KPI Service Operation Response Code
LDAP Total Requests Measures the rate of total number of requests received at LDAP. sum (rate(ldap_request_total{kubernetes_namespace=\"$namespace\",ReqType=\"GET\"}[2m])) All Not Applicable
LDAP Response Success Measures the rate of total number of successful responses sent to network NFs by LDAP. sum (rate(ldap_response_total{kubernetes_namespace=\"$namespace\",ReqType=\"GET\",Code=\"2xx\"}[2m])) All 2xx
LDAP Response Failure Measures the rate of total number of requests that have been rejected by LDAP due to errors at the end of NF consumers. sum (rate(ldap_response_total{kubernetes_namespace=\"$namespace\",ReqType=\"GET\",Code!=\"2xx\"}[2m])) All 2xx

Policy DS

KPI Name KPI Details Metric used for KPI Service Operation Response Code
Client_request_total_search Measures the sum of total number of requests policyds sends for LDAP Search. sum (rate(client_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\"}[2m])) SEARCH Not Applicable
Client_response_total_search Measures the sum of total number of success responses received by policyds for LDAP Search. sum (rate(client_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",workflow=\"LDAP\",response=\"200\"}[2m])) SEARCH 200
Server_request_user_service Measures the total requests received by PolicyDS from NF consumers for UserService search. sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"SEARCH\",task=\"USER_SERVICE\"}[2m])) SEARCH Not Applicable
Server_request_LDAP Measures the total number or requests policyds received for LDAP Get request. sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"GET\",task=\"LDAP\"}[2m])) GET Not Applicable
Server_request_PRE Measures the total number or requests policyds received for PRE Insert request. sum (rate(server_request_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"INSERT\",task=\"PRE\"}[2m])) INSERT Not Applicable
Server_response_sucess_POST   sum (rate(server_response_total{kubernetes_namespace=\"$namespace\",application=\"policyds\",operation=\"POST\",response=\"200\"}[2m])) POST 200

10.1 PCRF KPIs

This section provides information about Key Performance Indicators (KPIs) used for PCRF in CNC Policy.

Note:

Sample PCRF dashboard for Grafana is provided in the CNC Policy Custom Templates folder as part of the documentation.

CPU, Memory, and Load Level

KPI Name KPI Details Metric used for KPI
Pod Health Shows the health of pods available in the system. appinfo_service_running{vendor=\"Oracle\", application=\"occnp\", namespace=\"$namespace\"}
POD-Count Shows the number of PCRF pods available in the system. count(container_memory_usage_bytes{container='pcrf-core',namespace=\"$namespace\"})
POD-Count Shows the number of Diameter Gateway pods available in the system. count(container_memory_usage_bytes{container='diam-gateway',namespace=\"$namespace\"})
POD-Count Shows the number of PRE pods available in the system. count(container_memory_usage_bytes{container='pre-service',namespace=\"$namespace\"})
POD-Count Shows the number of Policy Datasource pods available in the system. count(container_memory_usage_bytes{container='policyds',namespace=\"$namespace\"})
POD-Count Shows the number of LDAP Gateway pods available in the system. count(container_memory_usage_bytes{container='ldap-gateway',namespace=\"$namespace\"})
POD-Count Shows the number of CM service pods available in the system. count(container_memory_usage_bytes{container='cm-service',namespace=\"$namespace\"})
POD-Count Shows the number of Config Server pods available in the system. count(container_memory_usage_bytes{container='config-server',namespace=\"$namespace\"})
Memory Usage Shows the current memory usage in bytes. sum(container_memory_usage_bytes{image!='',namespace=\"$namespace\"})
Memory Usage Shows the memory usage (in bytes) for the top 10 memory users by each pod. topk(10, sum(container_memory_usage_bytes{namespace=\"$namespace\"}) by (container))
Service Load Level Shows the load level of each service. avg(load_level) by (service)
CPU Usage Shows the CPU usage by each pod. sum(rate(container_cpu_usage_seconds_total{image!='',namespace=\"$namespace\",container_name!='POD'}[2m])) by (container_name)

PCRF Success Response

KPI Name KPI Details Metric used for KPI Response Code
CCA_SUCCESS_COUNT Shows the rate of success responses received at PCRF for CCA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"CCA.*\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) 2xx
AAA_SUCCESS_COUNT Shows the rate of success responses received at PCRF for AAA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"AAA.*\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) 2xx
ASA_SUCCESS_COUNT Shows the rate of success responses received at PCRF for ASA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"ASA\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) 2xx
RAA_SUCCESS_COUNT Shows the rate of success responses received at PCRF for RAA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"RAA\", responseCode=~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) 2xx
STA_SUCCESS_COUNT Shows the rate of success responses received at PCRF for STA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"STA\", responseCode!=\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) 2xx

PCRF Failure Response

KPI Name KPI Details Metric used for KPI Response Code
CCA_FAIL_COUNT Shows the rate of failure responses received at PCRF for CCA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"CCA.*\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) Other than 2xx
AAA_FAIL_COUNT Shows the rate of failure responses received at PCRF for AAA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"AAA.*\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) Other than 2xx
ASA_FAIL_COUNT Shows the rate of failure responses received at PCRF for ASA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"ASA\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) Other than 2xx
RAA_FAIL_COUNT Shows the rate of success responses received at PCRF for RAA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"RAA\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) Other than 2xx
STA_FAIL_COUNT Shows the rate of failure responses received at PCRF for STA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"STA\", responseCode!~\"2.*\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m])) Other than 2xx

PCRF Timeout Failure

KPI Name KPI Details Metric used for KPI
ASA_TIMEOUT_COUNT Shows the rate of failure responses due to timeout for ASA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"ASA\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m]))
RAA_TIMEOUT_COUNT Shows the rate of failure responses due to timeout for RAA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"RAA\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"pcrf-core\"}[5m]))

Diameter Gateway Timeout Failure

KPI Name KPI Details Metric used for KPI
CCA_TIMEOUT_COUNT Shows the rate of failure responses due to timeout received at Diameter Gateway for CCA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"CCA.*\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"diam-gateway\"}[5m]))
AAA_TIMEOUT_COUNT Shows the rate of failure responses due to timeout received at Diameter Gateway for AAA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"AAA.*\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"diam-gateway\"}[5m]))
STA_TIMEOUT_COUNT Shows the rate of failure responses due to timeout received at Diameter Gateway for STA message type. sum(rate(occnp_diam_response_local_total{msgType=~\"STA\", responseCode=\"timeout\", namespace=\"$namespace\",container=\"diam-gateway\"}[5m]))

Diameter Gateway Response

KPI Name KPI Details Metric used for KPI Service Operation Response Code
FAILURE_RATE Shows the percentage of total failure responses of all incoming messages (for Diameter Gateway and PCRF) per unit time. sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\",responseCode!~\"2.*\"}[5m])) / sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\"}[5m]))*100 Not Applicable Other than 2xx
SUCCESS_RATE Shows the percentage of total success responses of all incoming messages (for Diameter Gateway and PCRF) per unit time. sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\",responseCode=~\"2.*\"}[5m])) / sum(rate(occnp_diam_response_local_total{namespace=\"$namespace\"}[5m]))*100 Not Applicable 2xx