Autoscaling
Learn about autoscaling for Oracle Communications Unified Assurance microservices.
Overview
Autoscaling provides an automated approach to increase or decrease the allocated computational resources, as the workload changes.
In Unified Assurance, Kubernetes Event-Driven Autoscaling (KEDA) provides real-time automatic scaling decisions for microservices. If the application has a high workload, Kubernetes assigns more work units as required to keep the application running optimally. When the workload decreases, Kubernetes scales down the work units to keep resource use optimal and efficient.
Default autoscaling configuration
Name | Value | Possible Values | Notes |
---|---|---|---|
enabled | true | true, false | Whether to enable or disable autoscaling. Enabled by default. |
pollingInterval | 5 | Integer | The interval in seconds at which each metric value is checked against the threshold. If any metrics surpassing the threshold, replicas are scaled. |
cooldownPeriod | 300 | Integer | The period in seconds to wait before scaling the resources back to the minimum number of replicas. |
minReplicaCount | 1 | Integer | The minimum number of replicas when the resources are scaled down. |
maxReplicaCount | 20 | Integer | The maximum number of replicas when the resources are scaled up. |
Example of changing autoscaling configuration
Auto-scaling is enabled by default. To disable auto-scaling, pass the following during the installation command:
a1helm install ... --set autoscaling.enabled=false
To change multiple options, configuration can be passed in the following way in the installation command:
a1helm install ... --set autoscaling.cooldownPeriod=150 --set autoscaling.maxReplicaCount=10