Common configurations of vector indexes - OpenSearch - Alibaba Cloud Documentation Center

0.0.201

When you create a table, you can configure advanced configurations for vector indexes in the Index Schema step. This topic describes the parameters for the advanced configurations of vector indexes.

When you create a table, configure the index schema in the Index Schema step. cn向量索引结构配置.png

The following figure shows the parameters for advanced configurations.

The following table describes the parameters.

Parameter	Valid value	Description
Vector Dimension	N/A	The number of features or attributes of a vector. The Vector Dimension parameter specifies the complexity of the information and features that the vector can represent. You must configure the Vector Dimension parameter based on the vector generated by your vector model.
Distance Type	SquareEuclidean InnerProduct	The type of the distance that is used to calculate the vector similarity. If you set this parameter to SquareEuclidean, a smaller vector score indicates that the vector is more relevant. If you set this parameter to InnerProduct, a greater vector score indicates that the vector is more relevant.
Vector Index Algorithm	Qc HNSW Linear QGraph CAGRA	The vector indexing algorithm. For more information, see Introduction to vectors.
Real-time Indexing	true false	Specifies whether to enable the real-time indexing feature. If you set this parameter to true, the real-time indexing feature is enabled. The OpenSearch Vector Search Edition instance builds indexes for the real-time data that you push by calling API operations. Then, you can query the data in real time.
Real-time Indexing Parameters	{"proxima.oswg.streamer.segment_size":2048}	The parameters for real-time indexing. We recommend that you use the default value.
Index Retrieval Parameters	N/A	The parameters for real-time retrieval. You must configure this parameter based on the vector indexing algorithm. For more information, see the following topics: Quantized clustering configurations HNSW configurations QGraph (Quantized Graph) configuration
Vector Separator	Customizable	The delimiter that is used to separate dimensions during vector retrieval. For example, a comma (,) is used as the delimiter in vector:'1.05066,0.15610,0.156145...'.
Threshold for Linear Building	Default value: 5000	The threshold value for operations that do not create indexes in order. A value of 5000 specifies that indexes are created in order if the number of documents is less than 5,000.
Ignore Invalid Vector Data	true false	Specifies whether to ignore invalid vector data. If you set this parameter to true, the system creates indexes for full or batch incremental data as expected when the vector dimension is invalid and the vector data is empty.

Feedback

Previous: Introduction to vectorsNext: Quantized clustering configurations

About Alibaba Cloud

Our Global Network

Quick Start

Global Offices

Olympic Games Paris 2024 New

Stade Roland Garros – Glitz from the Past New

Place de la Concorde – “Breaking” the Barriers New

Vaires-sur-Marne Nautical Stadium – Sports with Sustainability New

International Broadcast Center – Images, Sounds, and Data that Captivate Billions New

Customer Success Stories New

Trust Center

Security & Compliance Center

Cloud Compliance Resources

Security Compliance FAQs

Product & Feature Update New

Cloud Forward

Press Room

Alibaba Cloud e-Magazine New

Alibaba Cloud in Analyst Research

Notice

Go Global Service New

Go Global Alliance with Alibaba Cloud

Asia Accelerator Hot

Information Compliance

China Gateway - MLPS 2.0 Compliance New

China Gateway - Networking

China Gateway - Global Application Acceleration New

China Gateway - Security

China Gateway - Data Security New

ICP Support Hot

China Gateway - Omnichannel Data Mid-End New

China Gateway - Organizational Data Mid-End New

China Gateway - Business Mid-End New

China Gateway - AI Service for Conversational Chatbots New

China Gateway - Online Education

China Gateway - Domain Registration

Work at Alibaba Cloud

Experienced Professionals

Students and Graduates

Free Trial

Pricing

Promo Center

Price Reduction

Pay Less and Deploy More

FinOps

Elastic Compute Service (ECS)

Simple Application Server (SAS)

Elastic GPU Service

Elastic Desktop Service (EDS)

Object Storage Service (OSS)

Cloud Enterprise Network (CEN)

Web Application Firewall (WAF)

Domain Names

Lingma

Container Compute Service (ACS)

Secure Access Service Edge (SASE)

Intelligent Media Services(IMS)

Edge Security Acceleration (ESA)(Original DCDN)

Intelligent Media Management

DingTalk Enterprise

YiDA

Alibaba Cloud Model Studio

Apsara Prime - For Easy Cloud Product Selection

Alibaba Cloud ECS - Cater All Your Cloud Hosting Needs

1TB CDN—Get Free 1 TB Outbound Traffic Plan Now

Security—Under Attack? Get Free Security Support

Short Message Service - Free Testing is Available

Elastic Compute Service (ECS) Hot

CloudBox

Compute Nest

Dedicated Host Hot

ECS Bare Metal Instance

Elastic GPU Service Featured

Simple Application Server (SAS) Hot

Auto Scaling

Cloud Phone Beta

Elastic Desktop Service (EDS) Featured

Batch Compute

Elastic High Performance Computing (E-HPC)

Super Computing Cluster (SCC)