Autoscaling

Learn about autoscaling for Oracle Communications Unified Assurance microservices.

Overview

Autoscaling provides an automated approach to increase or decrease the allocated computational resources, as the workload changes.

In Unified Assurance, Kubernetes Event-Driven Autoscaling (KEDA) provides real-time automatic scaling decisions for microservices. If the application has a high workload, Kubernetes assigns more work units as required to keep the application running optimally. When the workload decreases, Kubernetes scales down the work units to keep resource use optimal and efficient.

Default autoscaling configuration

Name	Value	Possible Values	Notes
enabled	true	true, false	Whether to enable or disable autoscaling. Enabled by default.
pollingInterval	5	Integer	The interval in seconds at which each metric value is checked against the threshold. If any metrics surpassing the threshold, replicas are scaled.
cooldownPeriod	300	Integer	The period in seconds to wait before scaling the resources back to the minimum number of replicas.
minReplicaCount	1	Integer	The minimum number of replicas when the resources are scaled down.
maxReplicaCount	20	Integer	The maximum number of replicas when the resources are scaled up.

Example of changing autoscaling configuration

Auto-scaling is enabled by default. To disable auto-scaling, pass the following during the installation command:

a1helm install ... --set autoscaling.enabled=false

To change multiple options, configuration can be passed in the following way in the installation command:

a1helm install ... --set autoscaling.cooldownPeriod=150 --set autoscaling.maxReplicaCount=10

Title and Copyright Information

Implementation Guide

F88952-01