Creates a knowledge base.
Try it now
Test
RAM authorization
|
Action |
Access level |
Resource type |
Condition key |
Dependent action |
|
gpdb:CreateDocumentCollection |
create |
*Collection
|
None | None |
Request parameters
|
Parameter |
Type |
Required |
Description |
Example |
| DBInstanceId |
string |
Yes |
The ID of the AnalyticDB for PostgreSQL instance. Note
You can call the DescribeDBInstances operation to view details of all AnalyticDB for PostgreSQL instances in a region, including their instance IDs. |
gp-xxxxxxxxx |
| ManagerAccount |
string |
Yes |
The name of the management account that has the rds_superuser permission. Note
You can create an account in the console by choosing Account Management. You can also call the CreateAccount operation to create an account. |
testaccount |
| ManagerAccountPassword |
string |
Yes |
The password of the management account. |
testpassword |
| Namespace |
string |
No |
The namespace. Default value: public. Note
You can call the CreateNamespace operation to create a namespace. You can call the ListNamespaces operation to view the list of namespaces. |
mynamespace |
| Collection |
string |
Yes |
The name of the document collection to create. Note
The name must comply with PostgreSQL object naming rules. |
document |
| RegionId |
string |
Yes |
The ID of the region where the instance resides. |
cn-hangzhou |
| EmbeddingModel |
string |
No |
The embedding algorithm. Default value: text-embedding-v3. Note
Supported algorithms:
|
text-embedding-v1 |
| Dimension |
integer |
No |
The vector dimension. Default value is the dimension supported by the embedding algorithm. |
1024 |
| FullTextRetrievalFields |
string |
No |
The fields used for full-text search. Separate multiple fields with commas (,). Each field must be a key defined in the Metadata parameter. |
title,page |
| Metadata |
string |
No |
The metadata of vector data, formatted as a JSON string in MAP format. Keys represent field names. Values represent data types. Note
Supported data types:
Warning The fields id, vector, doc_name, content, loader_metadata, source, and to_tsvector are reserved. Do not use them. |
{"title":"text","page":"int"} |
| Parser |
string |
No |
The tokenizer used for full-text search. Default value: zh_cn. |
zh_cn |
| Metrics |
string |
No |
The method used to build the vector index. Valid values:
|
cosine |
| HnswM |
integer |
No |
The maximum number of neighbors in the HNSW algorithm. The API sets this value automatically based on the vector dimension. Manual configuration is usually unnecessary. Note
Valid values:
Note
We recommend setting this value based on the vector dimension:
|
64 |
| HnswEfConstruction |
string |
No |
The candidate set size used when building the HNSW index. Valid values: 4 to 1000. Default value: 64. Note
This parameter applies only to AnalyticDB for PostgreSQL 7.0 instances. Its value must be ≥ 2 × HNSW_M. |
128 |
| PqEnable |
integer |
No |
Whether to enable product quantization (PQ) to accelerate indexing. We recommend enabling PQ if your data volume exceeds 500,000 items.
|
1 |
| ExternalStorage |
integer |
No |
Whether to use memory-mapped files (mmap) to build the HNSW index. Default value: 0. Set this to 1 if you do not need to delete data and require high upload performance. Valid values:
Important Only AnalyticDB for PostgreSQL 6.0 supports the ExternalStorage parameter. AnalyticDB for PostgreSQL 7.0 does not support it. |
0 |
| MetadataIndices |
string |
No |
The scalar index fields. Separate multiple fields with commas (,). Each field must be a key defined in the Metadata parameter. |
title |
| EnableGraph |
boolean |
No |
Whether to enable knowledge graph construction. Default value: false. Note
Before using this parameter, upgrade your instance to a version that supports the graph engine. During public preview, submit a ticket to request an upgrade. |
true |
| LLMModel |
string |
No |
The large language model (LLM) name.
Note
This parameter takes effect only when EnableGraph is set to true. |
knowledge-extract-standard |
| Language |
string |
No |
The language used for knowledge graph construction.
Note
This parameter takes effect only when EnableGraph is set to true. |
Simplified Chinese |
| EntityTypes |
array |
No |
The list of entity types. Note
This parameter is required when EnableGraph is set to true. |
|
|
string |
No |
The entity type. |
地点 |
|
| RelationshipTypes |
array |
No |
The list of relationship edge types. Note
This parameter is required when EnableGraph is set to true. |
|
|
string |
No |
The relationship edge type. |
发生 |
|
| SupportSparse |
boolean |
No |
Whether to support sparse vectors. Default value: false. |
true |
| SparseVectorIndexConfig |
object |
No |
The sparse vector index configuration. If provided, a sparse vector index is created. |
|
| HnswM |
integer |
No |
The maximum number of neighbors in the HNSW algorithm. The API sets this value automatically based on the vector dimension. Manual configuration is usually unnecessary. Note
Valid values:
Note
We recommend setting this value based on the vector dimension:
|
64 |
| HnswEfConstruction |
integer |
No |
The candidate set size used when building the HNSW index. Valid values: 4 to 1000. Default value: 64. Note
This parameter applies only to AnalyticDB for PostgreSQL 7.0 instances. Its value must be ≥ 2 × HNSW_M. |
128 |
| SparseRetrievalFields |
string |
No |
The metadata fields used to build sparse vectors. Separate multiple fields with commas (,). Each field must be a key defined in the Metadata parameter. |
title,abstract |
Response elements
|
Element |
Type |
Description |
Example |
|
object |
|||
| RequestId |
string |
The ID of the request. |
ABB39CC3-4488-4857-905D-2E4A051D0521 |
| Message |
string |
The response message. |
Successful |
| Status |
string |
The status of the API execution.
|
successs |
Examples
Success response
JSON format
{
"RequestId": "ABB39CC3-4488-4857-905D-2E4A051D0521",
"Message": "Successful",
"Status": "successs"
}
Error codes
See Error Codes for a complete list.
Release notes
See Release Notes for a complete list.