Defines configuration for automatically scaling container instances based on queue length. The autoscaler monitors a queue and adjusts the number of running replicas to maintain the desired queue length.
Properties
| Name | Type | Required | Description |
|---|---|---|---|
| desired_queue_length | int | ✅ | The target number of items in the queue that the autoscaler attempts to maintain by scaling the containers up or down |
| max_replicas | int | ✅ | The maximum number of instances the container can scale up to |
| min_replicas | int | ✅ | The minimum number of instances the container can scale down to, ensuring baseline availability |
| max_downscale_per_minute | int | ❌ | The maximum number of instances that can be removed per minute to prevent rapid downscaling |
| max_upscale_per_minute | int | ❌ | The maximum number of instances that can be added per minute to prevent rapid upscaling |
| polling_period | int | ❌ | The period (in seconds) in which the autoscaler checks the queue length and applies the scaling formula |