KEDA does not provide support for high-availability due to upstream limitations.
Here is an overview of all KEDA deployments and the supported replicas:
Cluster capacity requirements
The KEDA runtime require the following resources in a production-ready setup:
|Operator||Limit: 1, Request: 100m||Limit: 1000Mi, Request: 100Mi|
|Metrics Server||Limit: 1, Request: 100m||Limit: 1000Mi, Request: 100Mi|
These are used by default when deploying through YAML.
💡 For more info on CPU and Memory resource units and their meaning, see this link.
Some scalers issue HTTP requests to external servers (i.e. cloud services). Each applicable scaler uses its own dedicated HTTP client with its own connection pool, and by default each client is set to time out any HTTP request after 3 seconds.
You can override this default by setting the
KEDA_HTTP_DEFAULT_TIMEOUT environment variable to your desired timeout in milliseconds. For example, on Linux/Mac/Windows WSL2 operating systems, you’d use this command to set to 1 second:
And on Windows Powershell, you’d use this command:
All applicable scalers will use this timeout. Setting a per-scaler timeout is currently unsupported.