job.autoscaler.backlog-processing.lag-threshold |
5 min |
Duration |
Lag threshold which will prevent unnecessary scalings while removing the pending messages responsible for the lag. |
job.autoscaler.catch-up.duration |
30 min |
Duration |
The target duration for fully processing any backlog after a scaling operation. Set to 0 to disable backlog based scaling. |
job.autoscaler.enabled |
false |
Boolean |
Enable job autoscaler module. |
job.autoscaler.flink.rest-client.timeout |
10 s |
Duration |
The timeout for waiting the flink rest client to return. |
job.autoscaler.history.max.age |
86400000 ms |
Duration |
Maximum age for past scaling decisions to retain. |
job.autoscaler.history.max.count |
3 |
Integer |
Maximum number of past scaling decisions to retain per vertex. |
job.autoscaler.metrics.busy-time.aggregator |
MAX |
Enum |
Metric aggregator to use for busyTime metrics. This affects how true processing/output rate will be computed. Using max allows us to handle jobs with data skew more robustly, while avg may provide better stability when we know that the load distribution is even.
Possible values: |
job.autoscaler.metrics.window |
15 min |
Duration |
Scaling metrics aggregation window size. |
job.autoscaler.observed-true-processing-rate.lag-threshold |
30 s |
Duration |
Lag threshold for enabling observed true processing rate measurements. |
job.autoscaler.observed-true-processing-rate.min-observations |
2 |
Integer |
Minimum nr of observations used when estimating / switching to observed true processing rate. |
job.autoscaler.observed-true-processing-rate.switch-threshold |
0.15 |
Double |
Percentage threshold for switching to observed from busy time based true processing rate if the measurement is off by at least the configured fraction. For example 0.15 means we switch to observed if the busy time based computation is at least 15% higher during catchup. |
job.autoscaler.restart.time |
5 min |
Duration |
Expected restart time to be used until the operator can determine it reliably from history. |
job.autoscaler.scale-down.max-factor |
0.6 |
Double |
Max scale down factor. 1 means no limit on scale down, 0.6 means job can only be scaled down with 60% of the original parallelism. |
job.autoscaler.scale-up.grace-period |
1 h |
Duration |
Duration in which no scale down of a vertex is allowed after it has been scaled up. |
job.autoscaler.scale-up.max-factor |
100000.0 |
Double |
Max scale up factor. 2.0 means job can only be scaled up with 200% of the current parallelism. |
job.autoscaler.scaling.effectiveness.detection.enabled |
false |
Boolean |
Whether to enable detection of ineffective scaling operations and allowing the autoscaler to block further scale ups. |
job.autoscaler.scaling.effectiveness.threshold |
0.1 |
Double |
Processing rate increase threshold for detecting ineffective scaling threshold. 0.1 means if we do not accomplish at least 10% of the desired capacity increase with scaling, the action is marked ineffective. |
job.autoscaler.scaling.enabled |
true |
Boolean |
Enable vertex scaling execution by the autoscaler. If disabled, the autoscaler will only collect metrics and evaluate the suggested parallelism for each vertex but will not upgrade the jobs. |
job.autoscaler.scaling.event.interval |
30 min |
Duration |
Time interval to resend the identical event |
job.autoscaler.stabilization.interval |
5 min |
Duration |
Stabilization period in which no new scaling will be executed |
job.autoscaler.target.utilization |
0.7 |
Double |
Target vertex utilization |
job.autoscaler.target.utilization.boundary |
0.3 |
Double |
Target vertex utilization boundary. Scaling won't be performed if the current processing rate is within [target_rate / (target_utilization - boundary), (target_rate / (target_utilization + boundary)] |
job.autoscaler.vertex.exclude.ids |
|
List<String> |
A (semicolon-separated) list of vertex ids in hexstring for which to disable scaling. Caution: For non-sink vertices this will still scale their downstream operators until https://issues.apache.org/jira/browse/FLINK-31215 is implemented. |
job.autoscaler.vertex.max-parallelism |
200 |
Integer |
The maximum parallelism the autoscaler can use. Note that this limit will be ignored if it is higher than the max parallelism configured in the Flink config or directly on each operator. |
job.autoscaler.vertex.min-parallelism |
1 |
Integer |
The minimum parallelism the autoscaler can use. |