MAX_NUM_RUNTIME_FILTERS Query Option (CDH 5.7 or higher only)
The MAX_NUM_RUNTIME_FILTERS query option sets an upper limit on the number of runtime filters that can be produced for each query.
Type: integer
Default: 10
Added in: CDH 5.7.0 / Impala 2.5.0
Usage notes:
Each runtime filter imposes some memory overhead on the query. Depending on the setting of the RUNTIME_BLOOM_FILTER_SIZE query option, each filter might consume between 1 and 16 megabytes per plan fragment. There are typically 5 or fewer filters per plan fragment.
Impala evaluates the effectiveness of each filter, and keeps the ones that eliminate the largest number of partitions or rows. Therefore, this setting can protect against potential problems due to excessive memory overhead for filter production, while still allowing a high level of optimization for suitable queries.
Because the runtime filtering feature applies mainly to resource-intensive and long-running queries, only adjust this query option when tuning long-running queries involving some combination of large partitioned tables and joins involving large tables.
Kudu considerations:
This query option affects only Bloom filters, not the min/max filters that are applied to Kudu tables. Therefore, it does not affect the performance of queries against Kudu tables.
Runtime Filtering for Impala Queries (CDH 5.7 or higher only), RUNTIME_BLOOM_FILTER_SIZE Query Option (CDH 5.7 or higher only), RUNTIME_FILTER_MODE Query Option (CDH 5.7 or higher only)
<< MAX_IO_BUFFERS Query Option | ©2016 Cloudera, Inc. All rights reserved | MAX_ROW_SIZE Query Option >> |
Terms and Conditions Privacy Policy |