The PartitionedTableScan operator

0.0.201

This topic describes how to use the PartitionedTableScan (PTS) operator, the limits of using this operator, and a performance comparison between this operator and the Append operator.

Prerequisites

The feature is supported on PolarDB for PostgreSQL clusters that run the following engine:

PostgreSQL 14 (revision version 14.9.15.0 or later)

Note

You can run the following statement to view the revision version of a PolarDB for PostgreSQL cluster:

select version();

Background information

During the scan of a partitioned table, the optimizer generates the optimal query plan for each partition, and then uses the Append operator to combine these plans in parallel as the optimal query plan for the entire partitioned table. If the number of partitions is small, this process is fast. However, PolarDB for PostgreSQL does not limit the number of partitions in a partitioned table. As the number of partitions grows, the time consumed by the optimizer and the memory consumed by SQL statement executions significantly increase. The performance of querying the partitioned table is notably lower than the performance of querying a common table of the same size.

To meet the requirements for query performance, PolarDB for PostgreSQL provides the PTS operator as a more efficient solution. Compared with the Append operator, the PTS operator significantly reduces the time required by the optimizer to generate query plans and uses less memory to execute SQL statements to avoid OOM errors.

Limits

The PTS operator can be used only in SELECT statements.
The PTS operator cannot be used in partition-wise joins. If you enable partition-wise joins, the execution plan does not contain PTS operators.

Parameters

Parameter	description

Parameter

description

polar_num_parts_for_pts

The number of partitions in a partitioned table that is used as the threshold for enabling the PTS operator. Default value: 64. Valid values:

A positive integer: The PTS operator is used when the number of partitions exceeds this value.
0: The PTS operator is not used regardless of the number of partitions.
-1: The PTS operator is used regardless of the number of partitions.

Usage

Enable the PTS operator by using parameters

SET polar_num_parts_for_pts TO 64;

Use hints

Use the hint PTScan(tablealias). See the following example:

EXPLAIN (COSTS OFF, ANALYZE) /*+ PTScan(part_range) */ SELECT * FROM part_range;
                                   QUERY PLAN
--------------------------------------------------------------------------------
 PartitionedTableScan on part_range (actual time=86.404..86.405 rows=0 loops=1)
   Scan 1000 Partitions: part_range_p0, part_range_p1, part_range_p2,...
   ->  Seq Scan on part_range
 Planning Time: 36.613 ms
 Execution Time: 89.246 ms
(5 rows)

Parallel query

The PTS operator supports parallel queries. Inter-partition parallelism and hybrid parallelism are supported and enabled by default. No further configurations are required.

Inter-partition parallelism: Each worker queries only one partition.
Hybrid parallelism: Both inter-partition parallel queries and intra-partition parallel queries can be performed.

Examples

Create two partitioned tables and create 1,000 partitions.

CREATE TABLE part_range (a INT, b VARCHAR, c NUMERIC, d INT8) PARTITION BY RANGE (a);
SELECT 'CREATE TABLE part_range_p' || i || ' PARTITION OF part_range FOR VALUES FROM (' || 10 * i || ') TO (' || 10 * (i + 1) || ');'
FROM generate_series(0,999) i;\gexec

CREATE TABLE part_range2 (a INT, b VARCHAR, c NUMERIC, d INT8) PARTITION BY RANGE (a);
SELECT 'CREATE TABLE part_range2_p' || i || ' PARTITION OF part_range2 FOR VALUES FROM (' || 10 * i || ') TO (' || 10 * (i + 1) || ');'
FROM generate_series(0,999) i;\gexec

The following query plan for a full table scan on the partitioned table is generated.

SET polar_num_parts_for_pts TO 0;
EXPLAIN (COSTS OFF, ANALYZE) SELECT * FROM part_range;
                                         QUERY PLAN
---------------------------------------------------------------------------------------------
 Append (actual time=8.376..8.751 rows=0 loops=1)
   ->  Seq Scan on part_range_p0 part_range_1 (actual time=0.035..0.036 rows=0 loops=1)
   ->  Seq Scan on part_range_p1 part_range_2 (actual time=0.009..0.009 rows=0 loops=1)
   ->  Seq Scan on part_range_p2 part_range_3 (actual time=0.010..0.011 rows=0 loops=1)
  ...
  ...
  ...
   ->  Seq Scan on part_range_p997 part_range_998 (actual time=0.009..0.009 rows=0 loops=1)
   ->  Seq Scan on part_range_p998 part_range_999 (actual time=0.010..0.010 rows=0 loops=1)
   ->  Seq Scan on part_range_p999 part_range_1000 (actual time=0.009..0.009 rows=0 loops=1)
 Planning Time: 785.169 ms
 Execution Time: 163.534 ms
(1003 rows)

When two partitioned tables are joined for queries, the query performance is even lower and even more memory is consumed by SQL statement executions.

=> SET polar_num_parts_for_pts TO 0;
=> EXPLAIN (COSTS OFF, ANALYZE)
      SELECT COUNT(*) FROM part_range a
      JOIN part_range2 b ON a.a = b.a
      WHERE b.c = '0001';
                                                         QUERY PLAN
----------------------------------------------------------------------------------------------------------------------------
 Finalize Aggregate (actual time=3191.718..3212.437 rows=1 loops=1)
   ->  Gather (actual time=2735.417..3212.288 rows=3 loops=1)
         Workers Planned: 2
         Workers Launched: 2
         ->  Partial Aggregate (actual time=2667.247..2667.789 rows=1 loops=3)
               ->  Parallel Hash Join (actual time=1.957..2.497 rows=0 loops=3)
                     Hash Cond: (a.a = b.a)
                     ->  Parallel Append (never executed)
                           ->  Parallel Seq Scan on part_range_p0 a_1 (never executed)
                           ->  Parallel Seq Scan on part_range_p1 a_2 (never executed)
                           ->  Parallel Seq Scan on part_range_p2 a_3 (never executed)
                           ...
                           ...
                           ...
                           ->  Parallel Seq Scan on part_range_p997 a_998 (never executed)
                           ->  Parallel Seq Scan on part_range_p998 a_999 (never executed)
                           ->  Parallel Seq Scan on part_range_p999 a_1000 (never executed)
                     ->  Parallel Hash (actual time=0.337..0.643 rows=0 loops=3)
                           Buckets: 4096  Batches: 1  Memory Usage: 0kB
                           ->  Parallel Append (actual time=0.935..1.379 rows=0 loops=1)
                                 ->  Parallel Seq Scan on part_range2_p0 b_1 (actual time=0.001..0.001 rows=0 loops=1)
                                       Filter: (c = '1'::numeric)
                                 ->  Parallel Seq Scan on part_range2_p1 b_2 (actual time=0.001..0.001 rows=0 loops=1)
                                       Filter: (c = '1'::numeric)
                                 ->  Parallel Seq Scan on part_range2_p2 b_3 (actual time=0.001..0.001 rows=0 loops=1)
                                       Filter: (c = '1'::numeric)
                                 ...
                                 ...
                                 ...
                                 ->  Parallel Seq Scan on part_range2_p997 b_998 (actual time=0.001..0.001 rows=0 loops=1)
                                       Filter: (c = '1'::numeric)
                                 ->  Parallel Seq Scan on part_range2_p998 b_999 (actual time=0.000..0.001 rows=0 loops=1)
                                       Filter: (c = '1'::numeric)
                                 ->  Parallel Seq Scan on part_range2_p999 b_1000 (actual time=0.002..0.002 rows=0 loops=1)
                                       Filter: (c = '1'::numeric)
 Planning Time: 1900.615 ms
 Execution Time: 3694.320 ms
(3013 rows)

The example shows that the performance of a full table query of a partitioned table is lower than that of a common table. This is because a full table query does not include any condition that can limit the query in a specific partition. Partition pruning can be used in a full table query to query only a few partitions. However, some online analytical processing (OLAP) queries still need to scan the entire partitioned table. In this case, the PTS operator is more efficient than the Append operator.

SET polar_num_parts_for_pts TO 10;
EXPLAIN (COSTS OFF, ANALYZE) SELECT * FROM part_range;
                                   QUERY PLAN
--------------------------------------------------------------------------------
 PartitionedTableScan on part_range (actual time=86.404..86.405 rows=0 loops=1)
   Scan 1000 Partitions: part_range_p0, part_range_p1, part_range_p2,...
   ->  Seq Scan on part_range
 Planning Time: 36.613 ms
 Execution Time: 89.246 ms
(5 rows)

SET polar_num_parts_for_pts TO 10;
EXPLAIN (COSTS OFF, ANALYZE)
      SELECT COUNT(*) FROM part_range a
      JOIN part_range2 b ON a.a = b.a
      WHERE b.c = '0001';
                                             QUERY PLAN
----------------------------------------------------------------------------------------------------
 Aggregate (actual time=61.384..61.388 rows=1 loops=1)
   ->  Merge Join (actual time=61.378..61.381 rows=0 loops=1)
         Merge Cond: (a.a = b.a)
         ->  Sort (actual time=61.377..61.378 rows=0 loops=1)
               Sort Key: a.a
               Sort Method: quicksort  Memory: 25kB
               ->  PartitionedTableScan on part_range a (actual time=61.342..61.343 rows=0 loops=1)
                     Scan 1000 Partitions: part_range_p0, part_range_p1, part_range_p2, ...
                     ->  Seq Scan on part_range a
         ->  Sort (never executed)
               Sort Key: b.a
               ->  PartitionedTableScan on part_range2 b (never executed)
                     ->  Seq Scan on part_range2 b
                           Filter: (c = '1'::numeric)
 Planning Time: 96.675 ms
 Execution Time: 64.913 ms
(16 rows)

The results show that the time required to generate the query plan by using the PTS operator is much less than that by using the Append operator.

Performance comparison

Note

The following test data is generated in a testing environment, and is provided only for reference. The tests are conducted in consistent environment configurations to compare the performance differences between Append and PTS. The number of partitions is the only variable.

The time required to generate the query plan of a single SQL statement

Number of partitions	Append	PTS

Number of partitions	Append	PTS
16	0.266 ms	0.067 ms
32	1.820 ms	0.258 ms
64	3.654 ms	0.402 ms
128	7.010 ms	0.664 ms
256	14.095 ms	1.247 ms
512	27.697 ms	2.328 ms
1024	73.176 ms	4.165 ms

Memory usage by a single SQL statement

Number of partitions	Append	PTS

Number of partitions	Append	PTS
16	1170 KB	1044 KB
32	1240 KB	1044 KB
64	2120 KB	1624 KB
128	2244 KB	1524 KB
256	2888 KB	2072 KB
512	4720 KB	3012 KB
1024	8236 KB	5280 KB

PGBench QPS

Number of partitions	Append	PTS

Number of partitions	Append	PTS
16	25318	93950
32	10906	61879
64	5281	30839
128	2195	16684
256	920	8372
512	92	3708
1024	21	1190

Feedback

Previous: Improve the performance of query planningNext: Indexes

On this page （1, T）

Prerequisites

Background information

Limits

Parameters

Usage

Examples

Performance comparison

Chat now with Alibaba Cloud Customer Service to assist you in finding the right products and services to meet your needs.

Prerequisites

Background information

Limits

Parameters

Usage

Enable the PTS operator by using parameters

Use hints

Parallel query

Examples

Performance comparison

The time required to generate the query plan of a single SQL statement

Memory usage by a single SQL statement

PGBench QPS

Sales Support

Technical Support

Connect & Report Abuse

About Alibaba Cloud

Our Global Network

Quick Start

Global Offices

Olympic Games Paris 2024 New

Stade Roland Garros – Glitz from the Past New

Place de la Concorde – “Breaking” the Barriers New

Vaires-sur-Marne Nautical Stadium – Sports with Sustainability New

International Broadcast Center – Images, Sounds, and Data that Captivate Billions New

Customer Success Stories New

Trust Center

Security & Compliance Center

Cloud Compliance Resources

Security Compliance FAQs

Product & Feature Update New

Cloud Forward

Press Room

Alibaba Cloud e-Magazine New

Alibaba Cloud in Analyst Research

Notice

Go Global Service New

Go Global Alliance with Alibaba Cloud

Asia Accelerator Hot

Information Compliance

China Gateway - MLPS 2.0 Compliance New

China Gateway - Networking

China Gateway - Global Application Acceleration New

China Gateway - Security

China Gateway - Data Security New

ICP Support Hot

China Gateway - Omnichannel Data Mid-End New

China Gateway - Organizational Data Mid-End New

China Gateway - Business Mid-End New

China Gateway - AI Service for Conversational Chatbots New

China Gateway - Online Education

China Gateway - Domain Registration

Work at Alibaba Cloud

Experienced Professionals

Students and Graduates

Free Trial

Pricing

Promo Center

Price Reduction

Pay Less and Deploy More

FinOps

Elastic Compute Service (ECS)

Simple Application Server (SAS)

Elastic GPU Service

Elastic Desktop Service (EDS)

Object Storage Service (OSS)

Cloud Enterprise Network (CEN)

Web Application Firewall (WAF)

Domain Names

Container Compute Service (ACS)

Secure Access Service Edge (SASE)

Intelligent Media Services(IMS)

Edge Security Acceleration (ESA)(Original DCDN)

Intelligent Media Management

DingTalk Enterprise

YiDA

Alibaba Cloud Model Studio

Apsara Prime - For Easy Cloud Product Selection

Alibaba Cloud ECS - Cater All Your Cloud Hosting Needs

1TB CDN—Get Free 1 TB Outbound Traffic Plan Now

Security—Under Attack? Get Free Security Support

Short Message Service - Free Testing is Available

Elastic Compute Service (ECS) Hot

CloudBox

Compute Nest

Dedicated Host Hot

ECS Bare Metal Instance

Elastic GPU Service Featured

Simple Application Server (SAS) Hot

Auto Scaling

Cloud Phone Beta

Elastic Desktop Service (EDS) Featured

Batch Compute

Elastic High Performance Computing (E-HPC)

Super Computing Cluster (SCC)

Function Compute (FC)