You can execute the CREATE PARTITION TABLE statement to create a partitioned table. This topic describes how to use the CREATE PARTITION TABLE statement.
Description
A parent table is partitioned into different child tables based on partition key values. Data in the child tables is publicly available. Unless otherwise stated, parent tables mentioned in this topic refer to parent partitioned tables, and child tables mentioned in this topic refer to child partitioned tables.
Before you can use a partitioned table, you must create child tables in advance. You can execute the CREATE PARTITION TABLE
statement to create a partitioned table. You can also enable dynamic partitioning for a table to automatically create child tables.
Child tables of a partitioned table are stored in different files. To query data in a partitioned table, you must specify a partition. This way, the system does not need to scan all the partitions in the table and can quickly locate the desired file. This improves the efficiency. In most cases, a fact table is divided into different partitions by date. A child table of a partitioned table is equivalent to a non-partitioned table in metadata storage. Therefore, a large number of partitions cause the volume of metadata to increase and result in a large number of small files and fragments.
If your data source is a database, we recommend that you do not use partitioned tables. If you use partitioned tables in this scenario, excessive partitions may waste I/O resources. To resolve this issue and implement index-based query acceleration, you can specify the commonly used partition fields to constitute the segment key.
Limits
Hologres allows you to import data to a child table rather than a parent table.
NoteRealtime Compute for Apache Flink allows you to import data to a parent table in Hologres in real time. For more information, see Write data to a partitioned result table in Hologres in real time.
Each partitioning rule can be used to create only one partitioned table.
The
PARTITION BY
clause supports onlylist partitioning
. The partition key must be a single column.If a partitioned table has a primary key, the partition key must be a subset of the primary key.
Usage notes
If the number of data records in a single day is less than 100 million, we recommend that you do not use the date as a partitioning condition and do not create a partitioned table by date. Otherwise, the size of each partition is small and queries are not significantly accelerated. We recommend that you specify a coarser granularity.
If you need to frequently replace the data of a partition by performing TRUNCATE or DROP operations, we recommend that you use partitioned tables. If you use partitioned tables in this scenario, you can perform TRUNCATE or DROP operations in a more efficient manner, without the need to scan a large amount of data.
Create a partitioned table
Syntax
You can use the following statements to create a partitioned table:
-- Create a parent table. CREATE TABLE [if not exists] [<schema_name>.]<table_name> ([ { <column_name> <column_type> [ <column_constraints>, [...]] | <table_constraints> [, ...] } ]) PARTITION BY LIST(<column_name>); -- Create child tables. CREATE TABLE [if not exists] [<schema_name>.]<table_name> PARTITION OF <parent_table> FOR VALUES IN (<string_literal>);
Parameter description
The following table describes the parameters in the preceding syntax.
Parameter
Description
if not exists
Specifies that if a table with the same name already exists, the system does not return an error message but notifies you that the table already exists.
schema_name
The name of the schema in which the table resides. If you create a parent table and child tables in the same schema, you do not need to specify a schema name. If you create a parent table and child tables across schemas, you must specify a schema name.
table_name
The name of the parent table or the child table that you want to create.
column_name
The name of the column that you want to create in the new table.
column_type
The data type of the column.
column_constraints
The name of the column constraint.
table_constraints
The name of the table constraint.
parent_table
The name of the parent table of the child table.
string_literal
The partition key column.
Columns of the TEXT, VARCHAR, and INT data types can be used as partition key columns. In Hologres V1.3.22 and later, columns of the DATE data type can be used as partition key columns.
Examples
The following examples show how to create partitioned tables:
Example 1: Create a parent table that does not have a primary key and its child tables in the public schema.
Syntax supported in Hologres V2.1 and later:
BEGIN; CREATE TABLE public.hologres_parent ( a text, b int, c timestamp, d text ) PARTITION BY LIST (a) WITH (orientation = 'column'); CREATE TABLE public.hologres_child1 PARTITION OF public.hologres_parent FOR VALUES IN ('v1'); CREATE TABLE public.hologres_child2 PARTITION OF public.hologres_parent FOR VALUES IN ('v2'); CREATE TABLE public.hologres_child3 PARTITION OF public.hologres_parent FOR VALUES IN ('v3'); COMMIT;
Syntax supported in all Hologres versions:
BEGIN; CREATE TABLE public.hologres_parent( a text, b int, c timestamp, d text ) PARTITION BY LIST(a); CALL set_table_property('public.hologres_parent', 'orientation', 'column'); CREATE TABLE public.hologres_child1 PARTITION OF public.hologres_parent FOR VALUES IN('v1'); CREATE TABLE public.hologres_child2 PARTITION OF public.hologres_parent FOR VALUES IN('v2'); CREATE TABLE public.hologres_child3 PARTITION OF public.hologres_parent FOR VALUES IN('v3'); COMMIT;
Example 2: Create a parent table that has a primary key and its child tables in the public schema.
Syntax supported in Hologres V2.1 and later:
BEGIN; CREATE TABLE public.hologres_parent_2 ( a text, b int, c timestamp, d text, ds text, PRIMARY KEY (ds, b) ) PARTITION BY LIST (ds) WITH (orientation = 'column'); CREATE TABLE public.holo_child_1 PARTITION OF public.hologres_parent_2 FOR VALUES IN ('20201215'); CREATE TABLE public.holo_child_2 PARTITION OF public.hologres_parent_2 FOR VALUES IN ('20201216'); CREATE TABLE public.holo_child_3 PARTITION OF public.hologres_parent_2 FOR VALUES IN ('20201217'); COMMIT;
Syntax supported in all Hologres versions:
BEGIN; CREATE TABLE public.hologres_parent_2( a text , b int, c timestamp, d text, ds text, primary key(ds,b) ) PARTITION BY LIST(ds); CALL set_table_property('public.hologres_parent_2', 'orientation', 'column'); CREATE TABLE public.holo_child_1 PARTITION OF public.hologres_parent_2 FOR VALUES IN('20201215'); CREATE TABLE public.holo_child_2 PARTITION OF public.hologres_parent_2 FOR VALUES IN('20201216'); CREATE TABLE public.holo_child_3 PARTITION OF public.hologres_parent_2 FOR VALUES IN('20201217'); COMMIT;
Query all child tables
You can query all child tables of a parent table by using one of the following methods:
Query all child tables of a parent table in a visualized manner by using HoloWeb.
Query all child tables of a parent table by executing the following SQL statement. In the SQL statement, change parent_table_name based on your business requirements.
SELECT nmsp_parent.nspname AS parent_schema, parent.relname AS parent, nmsp_child.nspname AS child_schema, child.relname AS child FROM pg_inherits JOIN pg_class parent ON pg_inherits.inhparent = parent.oid JOIN pg_class child ON pg_inherits.inhrelid = child.oid JOIN pg_namespace nmsp_parent ON nmsp_parent.oid = parent.relnamespace JOIN pg_namespace nmsp_child ON nmsp_child.oid = child.relnamespace WHERE parent.relname='parent_table_name';
Rules on property settings for a parent table and its child tables
The following table describes the rules on property settings that apply when a child table is attached to a parent table. The following rules apply:
Must be consistent with the parent table: A property of the child table must be consistent with that of the parent table. Otherwise, an error is reported when the child table is attached to the parent table, and another child table must be created.
Not need to be consistent with the parent table: A property of the child table can be different from that of the parent table. If the property of the child table is not explicitly specified, the child table inherits the corresponding property setting of the parent table. If the property of the child table is explicitly specified, the property setting of the child table is retained.
Must include the indexed columns of the parent table: The indexed columns of the child table must include those of the parent table. Columns that are not specified as indexed columns for the parent table can be explicitly specified for the child table.
Category | Table property | Description | Whether a child table created by executing the CREATE TABLE PARTITION OF statement inherits the property setting from its parent table | Rule that applies when a child table is attached to a parent table |
Table property | orientation | The storage format of the table. | Yes | Must be consistent with the parent table. |
table_group | The table group to which the table belongs. This property also specifies the shard count for the table group. | Yes | Must be consistent with the parent table. | |
time_to_live_in_seconds | The time-to-live (TTL) of the data in the table. | Yes | Not need to be consistent with the parent table.
| |
Index | primary key | The primary key of the table. | Yes | Must be consistent with the parent table. |
distribution_key | The distribution key of the table. | Yes | Must be consistent with the parent table. | |
clustering_key | The clustering key of the table. | Yes | Must be consistent with the parent table. | |
event_time_column | The event time column of the table. | Yes | Must be consistent with the parent table. | |
bitmap_columns | The dictionary encoding columns of the table. | Yes | Not need to be consistent with the parent table. | |
dictionary_encoding_columns | The field indexes of the table. | Yes | Not need to be consistent with the parent table. | |
binlog_level | Specifies whether to enable binary logging. | Yes | Must be consistent with the parent table. | |
proxima_vectors | The indexes used to perform vector searches on the table. | Yes | Must be consistent with the parent table. | |
Column constraint | nullable | The NOT NULL constraint. | Yes | Must be consistent with the parent table. |
default value | The default value. | Yes | Must be consistent with the parent table. |
Dynamic partitioning
The dynamic partitioning feature of Hologres automatically creates and manages child tables based on the dynamic partitioning rules that you configure when you create a partitioned table. You do not need to configure all partitions when you create the partitioned table. For more information, see Dynamic partitioning.