Presto Configuration Properties¶

This section describes configuration properties that may be used to tune Presto or alter its behavior when required.

The following is not a complete list of all configuration properties available in Presto, and does not include any connector-specific catalog configuration properties.

For information on catalog configuration properties, see the connector documentation.

For information on session properties, see Presto Session Properties.

General Properties¶

`join-distribution-type`¶

Type: string
Allowed values: AUTOMATIC, PARTITIONED, BROADCAST
Default value: AUTOMATIC

The type of distributed join to use. When set to PARTITIONED, presto will use hash distributed joins. When set to BROADCAST, it will broadcast the right table to all nodes in the cluster that have data from the left table. Partitioned joins require redistributing both tables using a hash of the join key. This can be slower (sometimes substantially) than broadcast joins, but allows much larger joins. In particular broadcast joins will be faster if the right table is much smaller than the left. However, broadcast joins require that the tables on the right side of the join after filtering fit in memory on each node, whereas distributed joins only need to fit in distributed memory across all nodes. When set to AUTOMATIC, Presto will make a cost based decision as to which distribution type is optimal. It will also consider switching the left and right inputs to the join. In AUTOMATIC mode, Presto will default to hash distributed joins if no cost could be computed, such as if the tables do not have statistics.

The corresponding session property is join_distribution_type.

`redistribute-writes`¶

Type: boolean
Default value: false

This property enables redistribution of data before writing. This can eliminate the performance impact of data skew when writing by hashing it across nodes in the cluster. It can be disabled when it is known that the output data set is not skewed in order to avoid the overhead of hashing and redistributing all the data across the network.

When both scale-writers and redistribute-writes are set to true, scale-writers takes precedence.

The corresponding session property is redistribute_writes.

`scale-writers`¶

Type: boolean
Default value: true

This property enables dynamic scaling of writer tasks based on throughput. When enabled, Presto automatically adjusts the number of writer tasks to use the minimum necessary for optimal performance. This can improve resource utilization by scaling out writers only when needed based on data throughput.

When both scale-writers and redistribute-writes are set to true, scale-writers takes precedence.

The corresponding session property is scale_writers.

`check-access-control-on-utilized-columns-only`¶

Type: boolean
Default value: true

Apply access control rules on only those columns that are required to produce the query output.

Note: Setting this property to true with the following kinds of queries:

queries that have USING in a join condition
queries that have duplicate named common table expressions (CTE)

causes the query to be evaluated as if the property is set to false and checks the access control for all columns.

To avoid these problems:

replace all USING join conditions in a query with ON join conditions
set unique names for all CTEs in a query

The corresponding session property is check_access_control_on_utilized_columns_only.

`always-analyze-create-table-query-enabled`¶

Type: boolean
Default value: false

When enabled, CREATE TABLE AS SELECT IF NOT EXISTS statements that target an existing table will still analyze the inner SELECT query.

The corresponding session property is always_analyze_create_table_query_enabled.

`eager-plan-validation-enabled`¶

Type: boolean
Default value: false

This property enables the eager building and validation of a logical plan. When enabled, the logical plan will begin to be built and validated before queueing and allocation of cluster resources so that any errors or incompatibilities in the query plan will fail quickly and inform the user.

`single-node-execution-enabled`¶

Type: boolean
Default value: false

This property ensures that queries scheduled in this cluster use only a single node for execution, which may improve performance for small queries which can be executed within a single node.

The corresponding session property is single_node_execution_enabled.

`exclude-invalid-worker-session-properties`¶

Type: boolean
Default value: false

When exclude-invalid-worker-session-properties is true, worker session properties that are incompatible with the cluster type are excluded. For example, when native-execution-enabled is true, java-worker only session properties are excluded and the native-worker only session properties are included.

`per-query-retry-limit`¶

Type: integer
Minimum value: 0
Default value: 0

The number of times that a query is automatically retried in the case of a transient query or communications failure. The default value 0 means that retries are disabled.

`http-server.max-request-header-size`¶

Type: data size
Default value: 8 kB

The maximum size of the request header from the HTTP server.

Note: The default value can cause errors when large session properties or other large session information is involved. See Request Header Fields Too Large.

`offset-clause-enabled`¶

Type: boolean
Default value: false

To enable the OFFSET clause in SQL query expressions, set this property to true.

The corresponding session property is offset_clause_enabled.

`max-serializable-object-size`¶

Type: long
Default value: 1000

Maximum object size in bytes that can be considered serializable in a function call by the coordinator.

The corresponding session property is max_serializable_object_size.

`max-prefixes-count`¶

Type: integer
Minimum value: 1
Default value: 100

Maximum number of prefixes (catalog/schema/table scopes used to narrow metadata lookups) that Presto generates when querying information_schema. If the number of computed prefixes exceeds this limit, Presto falls back to a single broader prefix (catalog only). If it’s below the limit, the generated prefixes are used.

The corresponding session property is max_prefixes_count.

`cluster-tag`¶

Type: string
Default value: (none)

An optional identifier for the cluster. When set, this tag is included in the response from the /v1/cluster REST API endpoint, allowing clients to identify which cluster provided the response.

`try-function-catchable-errors`¶

Type: string
Default value: "" (empty string)

A comma-separated list of error code names that the TRY() function should catch and return NULL for, in addition to the default catchable errors (such as DIVISION_BY_ZERO, INVALID_CAST_ARGUMENT, INVALID_FUNCTION_ARGUMENT, and NUMERIC_VALUE_OUT_OF_RANGE).

This allows administrators to configure which additional errors TRY() should suppress at the server level. Error codes are matched by their name (such as GENERIC_INTERNAL_ERROR, INVALID_ARGUMENTS).

The corresponding session property is try_function_catchable_errors.

Memory Management Properties¶

`query.max-memory-per-node`¶

Type: data size
Default value: JVM max memory * 0.1

This is the max amount of user memory a query can use on a worker. User memory is allocated during execution for things that are directly attributable to or controllable by a user query. For example, memory used by the hash tables built during execution, memory used during sorting, etc. When the user memory allocation of a query on any worker hits this limit it will be killed.

`query.max-total-memory-per-node`¶

Type: data size
Default value: query.max-memory-per-node * 2

This is the max amount of user and system memory a query can use on a worker. System memory is allocated during execution for things that are not directly attributable to or controllable by a user query. For example, memory allocated by the readers, writers, network buffers, etc. When the sum of the user and system memory allocated by a query on any worker hits this limit it will be killed. The value of query.max-total-memory-per-node must be greater than query.max-memory-per-node.

`query.max-memory`¶

Type: data size
Default value: 20GB

This is the max amount of user memory a query can use across the entire cluster. User memory is allocated during execution for things that are directly attributable to or controllable by a user query. For example, memory used by the hash tables built during execution, memory used during sorting, etc. When the user memory allocation of a query across all workers hits this limit it will be killed.

`query.max-total-memory`¶

Type: data size
Default value: query.max-memory * 2

This is the max amount of user and system memory a query can use across the entire cluster. System memory is allocated during execution for things that are not directly attributable to or controllable by a user query. For example, memory allocated by the readers, writers, network buffers, etc. When the sum of the user and system memory allocated by a query across all workers hits this limit it will be killed. The value of query.max-total-memory must be greater than query.max-memory.

`memory.heap-headroom-per-node`¶

Type: data size
Default value: JVM max memory * 0.3

This is the amount of memory set aside as headroom/buffer in the JVM heap for allocations that are not tracked by Presto.

`query.low-memory-killer.policy`¶

Type: string
Default value: none

The policy used for selecting the query to kill when the cluster is out of memory (OOM). This property can have one of the following values: none, total-reservation, or total-reservation-on-blocked-nodes. none disables the cluster OOM killer. The value of total-reservation configures a policy that kills the query with the largest memory reservation across the cluster. The value of total-reservation-on-blocked-nodes configures a policy that kills the query using the most memory on the workers that are out of memory (blocked).

Spilling Properties¶

`experimental.spill-enabled`¶

Type: boolean
Default value: false

Try spilling memory to disk to avoid exceeding memory limits for the query.

Spilling works by offloading memory to disk. This process can allow a query with a large memory footprint to pass at the cost of slower execution times. See Supported Operations for a list of operations that support spilling.

Be aware that this is an experimental feature and should be used with care.

The corresponding session property is spill_enabled.

`experimental.join-spill-enabled`¶

Type: boolean
Default value: true

When spill_enabled is true, this determines whether Presto will try spilling memory to disk for joins to avoid exceeding memory limits for the query.

The corresponding session property is join_spill_enabled.

`experimental.aggregation-spill-enabled`¶

Type: boolean
Default value: true

When spill_enabled is true, this determines whether Presto will try spilling memory to disk for aggregations to avoid exceeding memory limits for the query.

The corresponding session property is aggregation_spill_enabled.

`experimental.distinct-aggregation-spill-enabled`¶

Type: boolean
Default value: true

When aggregation_spill_enabled is true, this determines whether Presto will try spilling memory to disk for distinct aggregations to avoid exceeding memory limits for the query.

The corresponding session property is distinct_aggregation_spill_enabled.

`experimental.order-by-aggregation-spill-enabled`¶

Type: boolean
Default value: true

When aggregation_spill_enabled is true, this determines whether Presto will try spilling memory to disk for order by aggregations to avoid exceeding memory limits for the query.

The corresponding session property is order_by_aggregation_spill_enabled.

`experimental.window-spill-enabled`¶

Type: boolean
Default value: true

When spill_enabled is true, this determines whether Presto will try spilling memory to disk for window functions to avoid exceeding memory limits for the query.

The corresponding session property is window_spill_enabled.

`experimental.order-by-spill-enabled`¶

Type: boolean
Default value: true

When spill_enabled is true, this determines whether Presto will try spilling memory to disk for order by to avoid exceeding memory limits for the query.

The corresponding session property is order_by_spill_enabled.

`experimental.spiller.task-spilling-strategy`¶

Type: string
Allowed values: ORDER_BY_CREATE_TIME, ORDER_BY_REVOCABLE_BYTES, PER_TASK_MEMORY_THRESHOLD
Default value: ORDER_BY_CREATE_TIME

Determines the strategy to use to choose when to revoke memory and from which tasks.

ORDER_BY_CREATE_TIME and ORDER_BY_REVOCABLE_BYTES will trigger spilling when the memory pool is filled beyond the experimental.memory-revoking-threshold until the memory pool usage is below experimental.memory-revoking-target. ORDER_BY_CREATE_TIME will trigger revocation from older tasks first, while ORDER_BY_REVOCABLE_BYTES will trigger revocation from tasks that are using more revocable memory first.

PER_TASK_MEMORY_THRESHOLD will trigger spilling whenever the revocable memory used by a task exceeds experimental.spiller.max-revocable-task-memory.

Warning

The PER_TASK_MEMORY_THRESHOLD strategy does not trigger spilling when the memory pool is full, which can prevent the out of memory query killer from kicking in. This is particularly risky if Presto is running without a reserved memory pool.

`experimental.memory-revoking-threshold`¶

Type: double
Minimum value: 0
Maximum value: 1
Default value: 0.9

Trigger memory revocation when the memory pool is filled above this percentage.

`experimental.memory-revoking-target`¶

Type: double
Minimum value: 0
Maximum value: 1
Default value: 0.5

When revoking memory, try to revoke enough that the memory pool is filled below the target percentage at the end.

`experimental.query-limit-spill-enabled`¶

Type: boolean
Default value: false

When spill is enabled and experimental.spiller.task-spilling-strategy is ORDER_BY_CREATE_TIME or ORDER_BY_REVOCABLE_BYTES, then also spill revocable memory from a query whenever its combined revocable, user, and system memory exceeds query_max_total_memory_per_node. This allows queries to have more consistent performance regardless of the load on the cluster at the cost of less efficient use of available memory.

`experimental.spiller.max-revocable-task-memory`¶

Type: data size
Default value: 500MB

If experimental.spiller.task-spilling-strategy is set to PER_TASK_MEMORY_THRESHOLD, this property defines the threshold at which to trigger spilling for a task. This property is ignored for any other spilling strategy.

`experimental.max-revocable-memory-per-node`¶

Type: data size
Default value: 16GB

This property defines the amount of revocable memory a query can use on each node

`experimental.spiller-spill-path`¶

Type: string
No default value. Must be set when spilling is enabled

Directory where spilled content will be written. It can be a comma separated list to spill simultaneously to multiple directories, which helps to utilize multiple drives installed in the system.

It is not recommended to spill to system drives. Most importantly, do not spill to the drive on which the JVM logs are written, as disk overutilization might cause JVM to pause for lengthy periods, causing queries to fail.

`experimental.spiller-max-used-space-threshold`¶

Type: double
Default value: 0.9

If disk space usage ratio of a given spill path is above this threshold, this spill path will not be eligible for spilling.

`experimental.spiller-threads`¶

Type: integer
Default value: 4

Number of spiller threads. Increase this value if the default is not able to saturate the underlying spilling device (for example, when using RAID).

`experimental.max-spill-per-node`¶

Type: data size
Default value: 100 GB

Max spill space to be used by all queries on a single node.

`experimental.query-max-spill-per-node`¶

Type: data size
Default value: 100 GB

Max spill space to be used by a single query on a single node.

`experimental.aggregation-operator-unspill-memory-limit`¶

Type: data size
Default value: 4 MB

Limit for memory used for unspilling a single aggregation operator instance.

The corresponding session property is aggregation_operator_unspill_memory_limit.

`experimental.spill-compression-codec`¶

Type: string
Allowed value: SNAPPY, NONE, GZIP, LZ4, LZO,, ZLIB ZSTD
Default value: NONE

The data compression codec to be used for pages spilled to disk.

`experimental.spill-encryption-enabled`¶

Type: boolean
Default value: false

Enables using a randomly generated secret key (per spill file) to encrypt and decrypt data spilled to disk

`experimental.spiller.single-stream-spiller-choice`¶

Type: String
Default value: LOCAL_FILE

The Single Stream Spiller to be used when spilling is enabled. There are two options LOCAL_FILE (default) and TEMP_STORAGE.

`experimental.spiller.spiller-temp-storage`¶

Type: String
Default value: local

Temp storage used by spiller when experimental.spiller.single-stream-spiller-choice is set to TEMP_STORAGE

`experimental.temp-storage-buffer-size`¶

Type: Data Size
Default value: 4KB

Size of buffer when experimental.spiller.single-stream-spiller-choice is set to TEMP_STORAGE

Exchange Properties¶

Exchanges transfer data between Presto nodes for different stages of a query. Adjusting these properties may help to resolve inter-node communication issues or improve network utilization.

`exchange.client-threads`¶

Type: integer
Minimum value: 1
Default value: 25

Number of threads used by exchange clients to fetch data from other Presto nodes. A higher value can improve performance for large clusters or clusters with very high concurrency, but excessively high values may cause a drop in performance due to context switches and additional memory usage.

`exchange.concurrent-request-multiplier`¶

Type: integer
Minimum value: 1
Default value: 3

Multiplier determining the number of concurrent requests relative to available buffer memory. The maximum number of requests is determined using a heuristic of the number of clients that can fit into available buffer space based on average buffer usage per request times this multiplier. For example, with an exchange.max-buffer-size of 32 MB and 20 MB already used and average size per request being 2MB, the maximum number of clients is multiplier * ((32MB - 20MB) / 2MB) = multiplier * 6. Tuning this value adjusts the heuristic, which may increase concurrency and improve network utilization.

`exchange.max-buffer-size`¶

Type: data size
Default value: 32MB

Size of buffer in the exchange client that holds data fetched from other nodes before it is processed. A larger buffer can increase network throughput for larger clusters and thus decrease query processing time, but will reduce the amount of memory available for other usages.

`exchange.max-response-size`¶

Type: data size
Minimum value: 1MB
Default value: 16MB

Maximum size of a response returned from an exchange request. The response will be placed in the exchange client buffer which is shared across all concurrent requests for the exchange.

Increasing the value may improve network throughput if there is high latency. Decreasing the value may improve query performance for large clusters as it reduces skew due to the exchange client buffer holding responses for more tasks (rather than hold more data from fewer tasks).

`sink.max-buffer-size`¶

Type: data size
Default value: 32MB

Output buffer size for task data that is waiting to be pulled by upstream tasks. If the task output is hash partitioned, then the buffer will be shared across all of the partitioned consumers. Increasing this value may improve network throughput for data transferred between stages if the network has high latency or if there are many nodes in the cluster.

`use-connector-provided-serialization-codecs`¶

Type: boolean
Default value: false

Enables the use of custom connector-provided serialization codecs for handles. This feature allows connectors to use their own serialization format for handle objects (such as table handles, column handles, and splits) instead of standard JSON serialization.

When enabled, connectors that provide a ConnectorCodecProvider with appropriate codecs will have their handles serialized using custom binary formats, which are then Base64-encoded for transport. Connectors without codec support automatically fall back to standard JSON serialization. Internal Presto handles (prefixed with $) always use JSON serialization regardless of this setting.

Task Properties¶

`task.concurrency`¶

Type: integer
Restrictions: must be a power of two
Default value: 16

Default local concurrency for parallel operators such as joins and aggregations. This value should be adjusted up or down based on the query concurrency and worker resource utilization. Lower values are better for clusters that run many queries concurrently because the cluster will already be utilized by all the running queries, so adding more concurrency will result in slow downs due to context switching and other overhead. Higher values are better for clusters that only run one or a few queries at a time.

The corresponding session property is task_concurrency.

`task.http-response-threads`¶

Type: integer
Minimum value: 1
Default value: 100

Maximum number of threads that may be created to handle HTTP responses. Threads are created on demand and are cleaned up when idle, thus there is no overhead to a large value if the number of requests to be handled is small. More threads may be helpful on clusters with a high number of concurrent queries, or on clusters with hundreds or thousands of workers.

`task.http-timeout-threads`¶

Type: integer
Minimum value: 1
Default value: 3

Number of threads used to handle timeouts when generating HTTP responses. This value should be increased if all the threads are frequently in use. This can be monitored via the com.facebook.presto.server:name=AsyncHttpExecutionMBean:TimeoutExecutor JMX object. If ActiveCount is always the same as PoolSize, increase the number of threads.

`task.info-update-interval`¶

Type: duration
Minimum value: 1ms
Maximum value: 10s
Default value: 3s

Controls staleness of task information, which is used in scheduling. Larger values can reduce coordinator CPU load, but may result in suboptimal split scheduling.

`task.max-partial-aggregation-memory`¶

Type: data size
Default value: 16MB

Maximum size of partial aggregation results for distributed aggregations. Increasing this value can result in less network transfer and lower CPU utilization by allowing more groups to be kept locally before being flushed, at the cost of additional memory usage.

`task.max-worker-threads`¶

Type: integer
Default value: Node CPUs * 2

Sets the number of threads used by workers to process splits. Increasing this number can improve throughput if worker CPU utilization is low and all the threads are in use, but will cause increased heap space usage. Setting the value too high may cause a drop in performance due to a context switching. The number of active threads is available via the RunningSplits property of the com.facebook.presto.execution.executor:name=TaskExecutor.RunningSplits JMX object.

The number of threads can be configured using either an absolute value (for example, 10) or a value relative to the number of available CPU cores (for example, 1.5C). When using a relative value, the number of threads is calculated based on the available CPU cores multiplied by the specified factor (for example, 1.5) and rounded to the nearest integer.

`task.min-drivers`¶

Type: integer
Default value: task.max-worker-threads * 2

The target number of running leaf splits on a worker. This is a minimum value because each leaf task is guaranteed at least 3 running splits. Non-leaf tasks are also guaranteed to run in order to prevent deadlocks. A lower value may improve responsiveness for new tasks, but can result in underutilized resources. A higher value can increase resource utilization, but uses additional memory.

`task.writer-count`¶

Type: integer
Restrictions: must be a power of two
Default value: 1

The number of concurrent writer threads per worker per query. Increasing this value may increase write speed, especially when a query is not I/O bound and can take advantage of additional CPU for parallel writes (some connectors can be bottlenecked on CPU when writing due to compression or other factors). Setting this too high may cause the cluster to become overloaded due to excessive resource utilization.

The corresponding session property is task_writer_count.

`task.interrupt-runaway-splits-timeout`¶

Type: duration
Default value: 10m

Timeout for interrupting split threads blocked without yielding control. Only threads blocked in specific locations are interrupted. Currently this is just threads blocked in the Joni regular expression library.

Node Scheduler Properties¶

`node-scheduler.max-splits-per-node`¶

Type: integer
Default value: 100

The target value for the number of splits that can be running for each worker node, assuming all splits have the standard split weight.

Using a higher value is recommended if queries are submitted in large batches (e.g., running a large group of reports periodically) or for connectors that produce many splits that complete quickly but do not support assigning split weight values to express that to the split scheduler. Increasing this value may improve query latency by ensuring that the workers have enough splits to keep them fully utilized.

When connectors do support weight based split scheduling, the number of splits assigned will depend on the weight of the individual splits. If splits are small, more of them are allowed to be assigned to each worker to compensate.

Setting this too high will waste memory and may result in lower performance due to splits not being balanced across workers. Ideally, it should be set such that there is always at least one split waiting to be processed, but not higher.

`node-scheduler.max-splits-per-task`¶

Type: integer
Default value: 10

The target value for the number of splits that can be running for each task, assuming all splits have the standard split weight.

Using a higher value is recommended if tasks parallelism is higher than 10. Increasing this value may improve query latency by ensuring that the workers have enough splits to keep them fully utilized.

The corresponding session property is schedule_splits_based_on_task_load.

`node-scheduler.max-pending-splits-per-task`¶

Type: integer
Default value: 10

The number of outstanding splits with the standard split weight that can be queued for each worker node for a single stage of a query, even when the node is already at the limit for total number of splits. Allowing a minimum number of splits per stage is required to prevent starvation and deadlocks.

This value must be smaller than node-scheduler.max-splits-per-node, will usually be increased for the same reasons, and has similar drawbacks if set too high.

`node-scheduler.min-candidates`¶

Type: integer
Minimum value: 1
Default value: 10

The minimum number of candidate nodes that will be evaluated by the node scheduler when choosing the target node for a split. Setting this value too low may prevent splits from being properly balanced across all worker nodes. Setting it too high may increase query latency and increase CPU usage on the coordinator.

`node-scheduler.network-topology`¶

Type: string
Allowed values: legacy, flat
Default value: legacy

Sets the network topology to use when scheduling splits. legacy will ignore the topology when scheduling splits. flat will try to schedule splits on the host where the data is located by reserving 50% of the work queue for local splits. It is recommended to use flat for clusters where distributed storage runs on the same nodes as Presto workers.

Optimizer Properties¶

`optimizer.dictionary-aggregation`¶

Type: boolean
Default value: false

Enables optimization for aggregations on dictionaries.

The corresponding session property is dictionary_aggregation.

`optimizer.optimize-hash-generation`¶

Type: boolean
Default value: true

Compute hash codes for distribution, joins, and aggregations early during execution, allowing result to be shared between operations later in the query. This can reduce CPU usage by avoiding computing the same hash multiple times, but at the cost of additional network transfer for the hashes. In most cases it will decrease overall query processing time.

It is often helpful to disable this property when using EXPLAIN in order to make the query plan easier to read.

The corresponding session property is optimize_hash_generation.

`optimizer.optimize-metadata-queries`¶

Type: boolean
Default value: false

Enable optimization of some aggregations by using values that are stored as metadata. This allows Presto to execute some simple queries in constant time. Currently, this optimization applies to max, min and approx_distinct of partition keys and other aggregation insensitive to the cardinality of the input (including DISTINCT aggregates). Using this may speed up some queries significantly.

The main drawback is that it can produce incorrect results if the connector returns partition keys for partitions that have no rows. In particular, the Hive connector can return empty partitions if they were created by other systems (Presto cannot create them).

`optimizer.optimize-single-distinct`¶

Type: boolean
Default value: true

The single distinct optimization will try to replace multiple DISTINCT clauses with a single GROUP BY clause, which can be substantially faster to execute.

`optimizer.pre-aggregate-before-grouping-sets`¶

Type: boolean
Default value: false

When enabled, inserts a partial aggregation below the GroupId node in grouping sets queries to reduce the number of rows that GroupId multiplies across grouping sets. Only applies to decomposable aggregation functions.

The corresponding session property is pre_aggregate_before_grouping_sets.

`optimizer.push-aggregation-through-join`¶

Type: boolean
Default value: true

When an aggregation is above an outer join and all columns from the outer side of the join are in the grouping clause, the aggregation is pushed below the outer join. This optimization is particularly useful for correlated scalar subqueries, which get rewritten to an aggregation over an outer join. For example:

SELECT * FROM item i
    WHERE i.i_current_price > (
        SELECT AVG(j.i_current_price) FROM item j
            WHERE i.i_category = j.i_category);

Enabling this optimization can substantially speed up queries by reducing the amount of data that needs to be processed by the join. However, it may slow down some queries that have very selective joins.

The corresponding session property is push_aggregation_through_join.

`optimizer.push-partial-aggregation-through-join`¶

Type: boolean
Default value: false

When a partial aggregation is above an inner join and all aggregation inputs come from only one side of the join, the partial aggregation is pushed below the join to that side. This reduces the amount of data flowing into the join operator, which can improve performance by allowing the aggregation to pre-reduce data before the join is performed.

The corresponding session property is push_partial_aggregation_through_join.

`optimizer.push-semi-join-through-union`¶

Type: boolean
Default value: false

When enabled, pushes semi-joins through UNION sources so that each union branch performs the semi-join independently. This can improve performance for IN and EXISTS queries where the semi-join probe side is a UNION.

The corresponding session property is push_semi_join_through_union.

`optimizer.push-projection-through-cross-join`¶

Type: boolean
Default value: false

When enabled, pushes projection expressions through cross join nodes so that each expression is evaluated only on the side of the cross join that provides its input variables. This reduces the number of columns flowing through the cross join and avoids recomputing expressions on the multiplied output rows.

Only deterministic expressions are pushed. Expressions that reference variables from both sides of the cross join, or constant expressions, remain above the join.

The corresponding session property is push_projection_through_cross_join.

`optimizer.push-table-write-through-union`¶

Type: boolean
Default value: true

Parallelize writes when using UNION ALL in queries that write data. This improves the speed of writing output tables in UNION ALL queries because these writes do not require additional synchronization when collecting results. Enabling this optimization can improve UNION ALL speed when write speed is not yet saturated. However, it may slow down queries in an already heavily loaded system.

The corresponding session property is push_table_write_through_union.

`optimizer.join-reordering-strategy`¶

Type: string
Allowed values: AUTOMATIC, ELIMINATE_CROSS_JOINS, NONE
Default value: AUTOMATIC

The join reordering strategy to use. NONE maintains the order the tables are listed in the query. ELIMINATE_CROSS_JOINS reorders joins to eliminate cross joins where possible and otherwise maintains the original query order. When reordering joins it also strives to maintain the original table order as much as possible. AUTOMATIC enumerates possible orders and uses statistics-based cost estimation to determine the least cost order. If stats are not available or if for any reason a cost could not be computed, the ELIMINATE_CROSS_JOINS strategy is used.

The corresponding session property is join_reordering_strategy.

`optimizer.max-reordered-joins`¶

Type: integer
Default value: 9

When optimizer.join-reordering-strategy is set to cost-based, this property determines the maximum number of joins that can be reordered at once.

Warning

The number of possible join orders scales factorially with the number of relations, so increasing this value can cause serious performance issues.

`optimizer.use-defaults-for-correlated-aggregation-pushdown-through-outer-joins`¶

Type: boolean
Default value: true

Aggregations can sometimes be pushed below outer joins (see optimizer.push-aggregation-through-join). In general, aggregate functions have custom null-handling behavior. In order to correctly process the null padded rows that may be produced by the outer join, the optimizer introduces a subsequent cross join with corresponding aggregations over a single null value and then coalesces the aggregations from the join output with these null aggregated values.

For certain aggregate functions (those that ignore nulls, COUNT, etc) the cross join may be avoided and the default/known aggregate value over NULL may be coalesced directly with the aggregate outputs of the join. This optimization eliminates the cross join, may convert the outer join into an inner join and thereby produces more optimal plans.

`optimizer.rewrite-expression-with-constant-variable`¶

Type: boolean
Default value: true

Extract expressions which have constant value from filter and assignment expressions, and replace the expressions with constant value.

`optimizer.simplify-aggregations-over-constant`¶

Type: boolean
Default value: false

When enabled, replaces supported aggregation functions over constant arguments with constant expressions. This can improve performance for queries that apply aggregations such as MIN, MAX, ARBITRARY, or APPROX_DISTINCT to constant inputs.

The corresponding session property is simplify_aggregations_over_constant.

`optimizer.simplify-coalesce-over-join-keys`¶

Type: boolean
Default value: false

When enabled, simplifies redundant COALESCE expressions over equi-join keys based on join type. This can produce simpler plans and enable additional join optimizations, including bucketed join optimizations.

The corresponding session property is simplify_coalesce_over_join_keys.

`optimizer.history-based-optimizer-plan-canonicalization-strategies`¶

Type: string
Default value: IGNORE_SAFE_CONSTANTS

Plan canonicalization strategies used to canonicalize a query plan for history based optimization.

`optimizer.track-history-stats-from-failed-queries`¶

Type: boolean
Default value: true

Track history based plan statistics from complete plan fragments in failed queries.

`optimizer.log-plans-used-in-history-based-optimizer`¶

Type: boolean
Default value: false

Log the stats equivalent plan and canonicalized plans used in history based optimization.

`optimizer.exploit-constraints`¶

Type: boolean
Default value: true

Enable analysis and propagation of logical properties like distinct keys or cardinality among the nodes of a query plan. The optimizer may then use these properties to perform various optimizations.

`optimizer.confidence-based-broadcast`¶

Type: boolean
Default value: false

Enable broadcasting based on the confidence of the statistics that are being used, by broadcasting the side of a joinNode which has the highest (HIGH or FACT) confidence statistics. If both sides have the same confidence statistics, then the original behavior will be followed.

The corresponding session property is confidence_based_broadcast.

`optimizer.treat-low-confidence-zero-estimation-as-unknown`¶

Type: boolean
Default value: false

Enable treating LOW confidence, zero estimations as UNKNOWN during joins.

The corresponding session property is treat-low-confidence-zero-estimation-as-unknown.

`optimizer.retry-query-with-history-based-optimization`¶

Type: boolean
Default value: false

Enable retry for failed queries who can potentially be helped by HBO.

The corresponding session property is retry-query-with-history-based-optimization.

`optimizer.inner-join-pushdown-enabled`¶

Type: boolean
Default value: false

Enable push down inner join predicates to database. Only allows equality joins to be pushed down. Use optimizer.inequality-join-pushdown-enabled along with this configuration to push down inequality join predicates.

The corresponding session property is optimizer_inner_join_pushdown_enabled.

`optimizer.inequality-join-pushdown-enabled`¶

Type: boolean
Default value: false

Enable push down inner join inequality predicates to database. For this configuration to be enabled, optimizer.inner-join-pushdown-enabled should be set to true. The corresponding session property is optimizer_inequality_join_pushdown_enabled.

`optimizer.rewrite-bucketed-semi-join-to-join`¶

Type: boolean
Default value: false

When both sides of a semi-join are backed by tables bucketed on the semi-join key, rewrite the SemiJoinNode to a colocated LEFT JOIN with a DISTINCT on the right side. This avoids data shuffle since both sides are already co-partitioned by the join key.

The corresponding session property is rewrite_bucketed_semi_join_to_join.

`optimizer.use-histograms`¶

Type: boolean
Default Value: false

Enables the optimizer to use histograms when available to perform cost estimate calculations during query optimization. When set to false, this parameter prevents histograms from being collected by ANALYZE, and also prevents the existing histograms from being used during query optimization. This behavior can be controlled on a per-query basis using the optimizer_use_histograms session property.

`optimizer.skip-pushdown-through-exchange-for-remote-projection`¶

Type: boolean
Default value: false

When enabled, skips pushing remote projections through exchange nodes. This can preserve exchanges around projections that call external functions, where fixed parallelism or exchange placement is needed to control execution.

The corresponding session property is skip_pushdown_through_exchange_for_remote_projection.

`optimizer.table-scan-shuffle-parallelism-threshold`¶

Type: double
Default value: 0.1

Parallelism threshold for adding a shuffle above table scan. When the table’s parallelism factor is below this threshold (0.0-1.0) and optimizer.table-scan-shuffle-strategy is COST_BASED, a round-robin shuffle exchange is added above the table scan to redistribute data.

The corresponding session property is table_scan_shuffle_parallelism_threshold.

`optimizer.table-scan-shuffle-strategy`¶

Type: string
Allowed values: DISABLED, ALWAYS_ENABLED, COST_BASED
Default value: DISABLED

Strategy for adding shuffle above table scan to redistribute data. When set to DISABLED, no shuffle is added. When set to ALWAYS_ENABLED, a round-robin shuffle exchange is always added above table scans. When set to COST_BASED, a shuffle is added only when the table’s parallelism factor is below the optimizer.table-scan-shuffle-parallelism-threshold.

The corresponding session property is table_scan_shuffle_strategy.

`optimizer.remote-function-names-for-fixed-parallelism`¶

Type: string
Default value: "" (empty string, disabled)

A regular expression pattern to match fully qualified remote function names, such as catalog.schema.function_name, that should use fixed parallelism. When a remote function matches this pattern, the optimizer inserts round-robin shuffle exchanges before and after the projection containing the remote function call. This ensures that the remote function executes with a fixed degree of parallelism, which can be useful for controlling resource usage when calling external services.

This property only applies to external/remote functions (functions where isExternalExecution() returns true, such as functions using THRIFT, GRPC, or REST implementation types).

Example patterns:

myschema.myfunction - matches an exact function name
catalog.schema.remote_.* - matches all functions starting with remote_ in the specified catalog and schema
.*remote.* - matches any function containing remote in its fully qualified name

The corresponding session property is remote_function_names_for_fixed_parallelism.

`optimizer.remote-function-fixed-parallelism-task-count`¶

Type: integer
Default value: null (uses the default hash partition count)

The number of tasks to use for remote functions matching the optimizer.remote-function-names-for-fixed-parallelism pattern. When set, this value determines the degree of parallelism for the round-robin shuffle exchanges inserted around matching remote function projections. If not set, the default hash partition count will be used.

This property is only effective when optimizer.remote-function-names-for-fixed-parallelism is set to a non-empty pattern.

The corresponding session property is remote_function_fixed_parallelism_task_count.

`optimizer.local-exchange-parent-preference-strategy`¶

Type: string
Allowed values: ALWAYS, NEVER, AUTOMATIC
Default value: ALWAYS

Strategy to consider parent preferences when adding local exchange partitioning for aggregations. When set to ALWAYS, the optimizer always uses parent preferences for local exchange partitioning. When set to NEVER, it never uses parent preferences and instead uses the aggregation’s own grouping keys. When set to AUTOMATIC, the optimizer makes a cost-based decision, using parent preferences only when the estimated partition cardinality is greater than or equal to the task concurrency.

The corresponding session property is local_exchange_parent_preference_strategy.

Planner Properties¶

`planner.query-analyzer-timeout`¶

Type: duration
Default value: 3m

Maximum running time for the query analyzer in case the processing takes too long or is stuck in an infinite loop. When timeout expires the planner thread is interrupted and throws exception.

Regular Expression Function Properties¶

The following properties allow tuning the Regular Expression Functions.

`regex-library`¶

Type: string
Allowed values: JONI, RE2J
Default value: JONI

Which library to use for regular expression functions. JONI is generally faster for common usage, but can require exponential time for certain expression patterns. RE2J uses a different algorithm which guarantees linear time, but is often slower.

`re2j.dfa-states-limit`¶

Type: integer
Minimum value: 2
Default value: 2147483647

The maximum number of states to use when RE2J builds the fast but potentially memory intensive deterministic finite automaton (DFA) for regular expression matching. If the limit is reached, RE2J will fall back to the algorithm that uses the slower, but less memory intensive non-deterministic finite automaton (NFA). Decreasing this value decreases the maximum memory footprint of a regular expression search at the cost of speed.

`re2j.dfa-retries`¶

Type: integer
Minimum value: 0
Default value: 5

The number of times that RE2J will retry the DFA algorithm when it reaches a states limit before using the slower, but less memory intensive NFA algorithm for all future inputs for that search. If hitting the limit for a given input row is likely to be an outlier, you want to be able to process subsequent rows using the faster DFA algorithm. If you are likely to hit the limit on matches for subsequent rows as well, you want to use the correct algorithm from the beginning so as not to waste time and resources. The more rows you are processing, the larger this value should be.

Logging Properties¶

`log.max-history`¶

Type: integer
Default value: 30

The log.max-history property controls the number of archive log periods that the application retains. In Presto, one log period corresponds to one day. For instance, if log.max-history is set to 30, the system will keep logs for the past 30 days.

`log.max-size`¶

Type: data size
Default value: 100MB

The maximum file size for the general application log file.

`http-server.log.enabled`¶

Type: boolean
Default value: true

Flag to enable or disable logging for the HTTP server.

`http-server.log.compression.enabled`¶

Type: boolean
Default value: true

Flag to enable or disable compression of the log files of the HTTP server.

`http-server.log.path`¶

Type: string
Default value: var/log/http-request.log

The path to the log file used by the HTTP server. The path is relative to the data directory, configured by the launcher script as detailed in Running Presto.

`http-server.log.max-history`¶

Type: integer
Default value: 15

The http-server.log.max-history property controls the number of archive log periods that the HTTP server retains. In Presto, one log period corresponds to one day. For instance, if http-server.log.max-history is set to 15, the system will keep logs for the past 15 days.

`http-server.log.max-size`¶

Type: data size
Default value: 100MB

The maximum file size for the log file of the HTTP server.

Query Manager Properties¶

`query.client.timeout`¶

Type: Duration
Default value: 5m

This property can be used to configure how long a query runs without contact from the client application, such as the CLI, before it’s abandoned.

The corresponding session property is query_client_timeout.

`query.max-queued-time`¶

Type: Duration
Default value: 100d

Use to configure how long a query can be queued before it is terminated.

The corresponding session property is query_max_queued_time.

`query-manager.query-pacing.max-queries-per-second`¶

Type: integer
Minimum value: 1
Default value: 2147483647 (unlimited)

Maximum number of queries that can be admitted per second globally across all resource groups. This property enables query admission pacing to prevent worker overload when many queries start simultaneously. Pacing only activates when the number of running queries exceeds the threshold configured by query-manager.query-pacing.min-running-queries.

Set to a lower value such as 10 to limit query admission rate during periods of high cluster load. The default value effectively disables pacing.

`query-manager.query-pacing.min-running-queries`¶

Type: integer
Minimum value: 0
Default value: 30

Minimum number of running queries required before query admission pacing is applied. When the total number of running queries is below this threshold, queries are admitted immediately without rate limiting, regardless of the query-manager.query-pacing.max-queries-per-second setting.

This allows the cluster to quickly ramp up when idle while still providing protection against overload when the cluster is busy. Set to 0 to always apply pacing when max-queries-per-second is configured.

`max-total-running-task-count-to-not-execute-new-query`¶

Type: integer
Minimum value: 1
Default value: 2147483647 (unlimited)

Maximum total running task count across all queries on the coordinator. When this threshold is exceeded, new queries are held in the queue rather than being scheduled for execution. This helps prevent coordinator overload by limiting the number of concurrent tasks being managed.

Unlike max-total-running-task-count-to-kill-query which kills queries when the limit is exceeded, this property proactively prevents new queries from starting while allowing existing queries to complete normally.

This property works in conjunction with query admission pacing (query-manager.query-pacing.max-queries-per-second) to provide comprehensive coordinator load management. When both are configured:

Pacing controls the rate at which queries are admitted
This property provides a hard cap on total concurrent tasks

Without query-pacing, the cluster can admit multiple queries at once, which can lead to significantly more concurrent tasks than expected over this limit.

Set to a lower value (e.g., 50000) to limit coordinator task management overhead. The default value effectively disables this feature.

Note

For backwards compatibility, this property can also be configured using the legacy name experimental.max-total-running-task-count-to-not-execute-new-query.

Query Retry Properties¶

`retry.enabled`¶

Type: boolean
Default value: true

Enable cross-cluster retry functionality. When enabled, queries that fail with specific error codes can be automatically retried on a backup cluster if a retry URL is provided.

`retry.allowed-domains`¶

Type: string
Default value: (empty, signifying current second-level domain allowed only)

Comma-separated list of allowed domains for retry URLs. Supports wildcards like *.example.com. For example: cluster1.example.com,*.backup.example.net. When empty (default), only retry URLs from the same domain as the current server are allowed.

`retry.require-https`¶

Type: boolean
Default value: false

Require HTTPS for retry URLs. When enabled, only HTTPS URLs will be accepted for cross-cluster retry operations.

`retry.cross-cluster-error-codes`¶

Type: string
Default value: REMOTE_TASK_ERROR

Comma-separated list of error codes that allow cross-cluster retry. When a query fails with one of these error codes, it can be automatically retried on a backup cluster if a retry URL is provided. Available error codes include standard Presto error codes such as REMOTE_TASK_ERROR, CLUSTER_OUT_OF_MEMORY, etc.

View and Materialized View Properties¶

`default-view-security-mode`¶

Type: string
Allowed values: DEFINER, INVOKER
Default value: DEFINER

Sets the default security mode for views and materialized views when the SECURITY clause is not explicitly specified in CREATE VIEW or CREATE MATERIALIZED VIEW statements.

DEFINER: Views execute with the permissions of the user who created them
INVOKER: Views execute with the permissions of the user querying them

The corresponding session property is default_view_security_mode.

`experimental.legacy-materialized-views`¶

Type: boolean
Default value: true

Use legacy materialized views implementation. Set to false to enable materialized views with security modes (DEFINER and INVOKER), automatic query rewriting, and freshness tracking.

The corresponding session property is legacy_materialized_views.

Warning

Materialized views are experimental. The SPI and behavior may change in future releases.

`experimental.allow-legacy-materialized-views-toggle`¶

Type: boolean
Default value: false

Allow the legacy_materialized_views session property to be changed at runtime. By default, the session property value is locked to the server configuration value and cannot be changed per-session.

Set this to true to allow users to toggle between legacy and new materialized views implementations using the session property. This is intended for testing and migration purposes only.

Warning

This should only be enabled in non-production environments.

`materialized-view-query-rewrite-cost-based-selection-enabled`¶

Type: boolean
Default value: false

Enable cost-based selection when multiple materialized views are available for query rewriting. When enabled, the optimizer evaluates all compatible materialized view rewrites and selects the plan with the lowest estimated cost, instead of using the first compatible view.

The corresponding session property is materialized_view_query_rewrite_cost_based_selection_enabled.

`materialized-view-stale-read-behavior`¶

Type: string
Default value: USE_VIEW_QUERY

Controls behavior when a materialized view is stale and no per-view staleness config is set. Valid values are FAIL (throw an error) or USE_VIEW_QUERY (query base tables instead).

The corresponding session property is materialized_view_stale_read_behavior.

Grouped Execution Properties¶

`grouped-execution-enabled`¶

Type: boolean
Default value: true

Enables grouped execution for queries on bucketed tables. Grouped execution schedules operations in lifespans (one per bucket), processing a subset of data at a time to reduce memory usage.

The corresponding session property is grouped_execution.

`partition-aware-grouped-execution-enabled`¶

Type: boolean
Default value: false

When enabled alongside grouped-execution-enabled, schedules each (bucket, partition-values) pair as a separate lifespan instead of processing all partitions per bucket in a single lifespan. This reduces per-lifespan hash table size for joins and aggregations on bucketed + partitioned tables.

The feature activates when partition columns appear as equi-join keys or GROUP BY keys, and at least 2 distinct partition values exist. When not applicable, queries fall back to standard grouped execution automatically.

The corresponding session property is partition_aware_grouped_execution.

Cluster Overload Properties¶

These properties control cluster-overload throttling, which monitors worker CPU and memory utilization and throttles query admission when the cluster is under heavy load. Cluster-overload throttling is independent of per-resource-group concurrency, memory, and CPU limits, which continue to apply regardless of these settings.

`cluster-overload.enable-throttling`¶

Type: boolean
Default value: false

Enables cluster-overload throttling. When enabled, the coordinator periodically checks worker health and prevents new queries from starting when the cluster is overloaded. When disabled, no overload checks are performed and all queries are admitted immediately (subject to other admission gates such as resource group limits).

`cluster-overload.overload-policy-type`¶

Type: string
Allowed values: overload_worker_cnt_based_throttling, overload_worker_pct_based_throttling
Default value: overload_worker_cnt_based_throttling

The policy used to determine whether the cluster is overloaded. overload_worker_cnt_based_throttling declares the cluster overloaded when the number of overloaded workers exceeds cluster-overload.allowed-overload-workers-cnt. overload_worker_pct_based_throttling declares it overloaded when the fraction of overloaded workers exceeds cluster-overload.allowed-overload-workers-pct.

`cluster-overload.allowed-overload-workers-cnt`¶

Type: integer
Default value: 0

Maximum number of workers that may be in an overloaded state before the cluster is considered overloaded. Only used when cluster-overload.overload-policy-type is set to overload_worker_cnt_based_throttling.

`cluster-overload.allowed-overload-workers-pct`¶

Type: double
Default value: 0.01

Maximum fraction of workers (between 0 and 1) that may be in an overloaded state before the cluster is considered overloaded. Only used when cluster-overload.overload-policy-type is set to overload_worker_pct_based_throttling.

`cluster.overload-check-cache-ttl-secs`¶

Type: integer
Default value: 5

How frequently (in seconds) the coordinator re-evaluates the cluster overload state. Between checks, the most recently computed state is cached and reused.

`cluster-overload.bypass-resource-groups`¶

Type: string
Default value: (empty)

Comma-separated list of fully-qualified resource group ids (dot-separated segments, such as global.admin, global.etl.priority) that bypass cluster-overload throttling. Listed groups are still subject to all other admission gates — per-group concurrency, memory, and CPU limits continue to apply.

A resource group bypasses throttling when it lies on the same root-to-leaf path as any configured entry: equal to a listed group, an ancestor of one, or a descendant of one. Sibling groups that do not appear in the list remain throttled. Path-based matching is required because the admission check runs at every ancestor in the resource group hierarchy; without it a non-listed ancestor would veto a listed leaf group.

The bypass set is snapshotted at coordinator startup; changes require a coordinator restart. A throttlingBypassCount JMX counter on ClusterResourceChecker tracks how many times the bypass was applied.

Malformed entries (such as global..admin) cause a startup failure.

Resource Manager Properties¶

`resource-manager-enabled`¶

Type: boolean
Default value: false

Set to true when a resource manager exists in the cluster.

`resource-manager.http-server-enabled`¶

Type: boolean
Default value: true

Controls whether the resource manager’s REST server is turned on. This will enable nodes to communicate with the resource manager using HTTP/S.

`internal-communication.resource-manager-communication-protocol`¶

Type: string
Allowed values: THRIFT, HTTP
Default value: THRIFT

Controls whether the node will communicate with the resource manager using Thrift, or HTTP/S. HTTPS are supported using the same internal communication HTTPS configs.

To enable SSL/TLS, see Secure Internal Communication.

Presto Configuration Properties¶

General Properties¶

join-distribution-type¶

redistribute-writes¶

scale-writers¶

check-access-control-on-utilized-columns-only¶

always-analyze-create-table-query-enabled¶

eager-plan-validation-enabled¶

single-node-execution-enabled¶

exclude-invalid-worker-session-properties¶

per-query-retry-limit¶

http-server.max-request-header-size¶

offset-clause-enabled¶

max-serializable-object-size¶

max-prefixes-count¶

cluster-tag¶

try-function-catchable-errors¶

Memory Management Properties¶

query.max-memory-per-node¶

query.max-total-memory-per-node¶

query.max-memory¶

query.max-total-memory¶

memory.heap-headroom-per-node¶

query.low-memory-killer.policy¶

Spilling Properties¶

experimental.spill-enabled¶

experimental.join-spill-enabled¶

experimental.aggregation-spill-enabled¶

experimental.distinct-aggregation-spill-enabled¶

experimental.order-by-aggregation-spill-enabled¶

experimental.window-spill-enabled¶

experimental.order-by-spill-enabled¶

experimental.spiller.task-spilling-strategy¶

experimental.memory-revoking-threshold¶

experimental.memory-revoking-target¶

experimental.query-limit-spill-enabled¶

experimental.spiller.max-revocable-task-memory¶

experimental.max-revocable-memory-per-node¶

experimental.spiller-spill-path¶

experimental.spiller-max-used-space-threshold¶

experimental.spiller-threads¶

experimental.max-spill-per-node¶

experimental.query-max-spill-per-node¶

experimental.aggregation-operator-unspill-memory-limit¶

experimental.spill-compression-codec¶

experimental.spill-encryption-enabled¶

experimental.spiller.single-stream-spiller-choice¶

experimental.spiller.spiller-temp-storage¶

experimental.temp-storage-buffer-size¶

Exchange Properties¶

exchange.client-threads¶

exchange.concurrent-request-multiplier¶

exchange.max-buffer-size¶

exchange.max-response-size¶

sink.max-buffer-size¶

use-connector-provided-serialization-codecs¶

Task Properties¶

task.concurrency¶

task.http-response-threads¶

task.http-timeout-threads¶

task.info-update-interval¶

task.max-partial-aggregation-memory¶

task.max-worker-threads¶

task.min-drivers¶

task.writer-count¶

task.interrupt-runaway-splits-timeout¶

Node Scheduler Properties¶

node-scheduler.max-splits-per-node¶

node-scheduler.max-splits-per-task¶

node-scheduler.max-pending-splits-per-task¶

node-scheduler.min-candidates¶

node-scheduler.network-topology¶

Optimizer Properties¶

optimizer.dictionary-aggregation¶

optimizer.optimize-hash-generation¶

optimizer.optimize-metadata-queries¶

optimizer.optimize-single-distinct¶

optimizer.pre-aggregate-before-grouping-sets¶

optimizer.push-aggregation-through-join¶

optimizer.push-partial-aggregation-through-join¶

`join-distribution-type`¶

`redistribute-writes`¶

`scale-writers`¶

`check-access-control-on-utilized-columns-only`¶

`always-analyze-create-table-query-enabled`¶

`eager-plan-validation-enabled`¶

`single-node-execution-enabled`¶

`exclude-invalid-worker-session-properties`¶

`per-query-retry-limit`¶

`http-server.max-request-header-size`¶

`offset-clause-enabled`¶

`max-serializable-object-size`¶

`max-prefixes-count`¶

`cluster-tag`¶

`try-function-catchable-errors`¶

`query.max-memory-per-node`¶

`query.max-total-memory-per-node`¶

`query.max-memory`¶

`query.max-total-memory`¶

`memory.heap-headroom-per-node`¶

`query.low-memory-killer.policy`¶

`experimental.spill-enabled`¶

`experimental.join-spill-enabled`¶

`experimental.aggregation-spill-enabled`¶

`experimental.distinct-aggregation-spill-enabled`¶

`experimental.order-by-aggregation-spill-enabled`¶

`experimental.window-spill-enabled`¶

`experimental.order-by-spill-enabled`¶

`experimental.spiller.task-spilling-strategy`¶

`experimental.memory-revoking-threshold`¶

`experimental.memory-revoking-target`¶

`experimental.query-limit-spill-enabled`¶

`experimental.spiller.max-revocable-task-memory`¶

`experimental.max-revocable-memory-per-node`¶

`experimental.spiller-spill-path`¶

`experimental.spiller-max-used-space-threshold`¶

`experimental.spiller-threads`¶

`experimental.max-spill-per-node`¶

`experimental.query-max-spill-per-node`¶

`experimental.aggregation-operator-unspill-memory-limit`¶

`experimental.spill-compression-codec`¶

`experimental.spill-encryption-enabled`¶

`experimental.spiller.single-stream-spiller-choice`¶

`experimental.spiller.spiller-temp-storage`¶

`experimental.temp-storage-buffer-size`¶

`exchange.client-threads`¶

`exchange.concurrent-request-multiplier`¶

`exchange.max-buffer-size`¶

`exchange.max-response-size`¶

`sink.max-buffer-size`¶

`use-connector-provided-serialization-codecs`¶

`task.concurrency`¶

`task.http-response-threads`¶

`task.http-timeout-threads`¶

`task.info-update-interval`¶

`task.max-partial-aggregation-memory`¶

`task.max-worker-threads`¶

`task.min-drivers`¶

`task.writer-count`¶

`task.interrupt-runaway-splits-timeout`¶

`node-scheduler.max-splits-per-node`¶

`node-scheduler.max-splits-per-task`¶

`node-scheduler.max-pending-splits-per-task`¶

`node-scheduler.min-candidates`¶

`node-scheduler.network-topology`¶

`optimizer.dictionary-aggregation`¶

`optimizer.optimize-hash-generation`¶

`optimizer.optimize-metadata-queries`¶

`optimizer.optimize-single-distinct`¶

`optimizer.pre-aggregate-before-grouping-sets`¶

`optimizer.push-aggregation-through-join`¶

`optimizer.push-partial-aggregation-through-join`¶

`optimizer.push-semi-join-through-union`¶

`optimizer.push-projection-through-cross-join`¶

`optimizer.push-table-write-through-union`¶

`optimizer.join-reordering-strategy`¶

`optimizer.max-reordered-joins`¶

`optimizer.use-defaults-for-correlated-aggregation-pushdown-through-outer-joins`¶

`optimizer.rewrite-expression-with-constant-variable`¶

`optimizer.simplify-aggregations-over-constant`¶