All Products
Search
Document Center

ApsaraDB RDS:Object storage paths

Last Updated:Oct 17, 2024

This topic describes the paths of vector data that is stored in Alibaba Cloud Object Storage Service (OSS) buckets, self-managed Multi-Cloud Object Storage (MinIO) buckets, and Hadoop Distributed File System (HDFS) files. You can use the paths to perform queries, searches, and data analysis in an efficient manner.

Paths of vector data in OSS buckets

The path of vector data in an OSS bucket is in the following format:

oss://<access_id>:<secrect_key>@[<Endpoint>]/<bucket>/path_to/file
Note

The following list describes the parameters:

  • access_id and secrect_key specify the AccessKey ID and AccessKey secret of the AccessKey pair that is used to access your Object Storage Service (OSS) bucket. For more information, see Obtain an AccessKey pair.

  • To ensure data accessibility, make sure that the cloud database and OSS reside in the same region and use the internal endpoint. For more information, see OSS endpoints.

  • /<bucket>/path_to/file the path of the object in OSS. If you use a file in the Shapefile format, the file must contain at least three of the following file types: .shp,. shx, and. dbf. The files must be uploaded to the same folder of an OSS bucket.

Paths of vector data in MinIO buckets

Note

Only scenarios in which you can access files in MinIO buckets in hybrid cloud environments are supported.

The path of vector data in a MinIO bucket is in the following format:

mio://<id>:<key>@<host>[:<port>]/<bucket>/path_to/file

If you do not specify the port parameter, the HTTPS port is used. To import data at high performance, make sure that your ApsaraDB RDS instance and the self-managed MinIO bucket reside in the same region and can communicate over an internal network.

Paths of vector data in HDFS files

Note

Only scenarios in which you can access HDFS files in hybrid cloud environments are supported.

The path of vector data in an HDFS file is in the following format:

HDFS://<user_name>@<host>[:<port>]/path_to/file
  • If you do not specify the user_name parameter, the default value hadoop is used.

  • The host and port parameters specify the endpoint and port of Remote Procedure Call (RPC) in the HDFS configuration file.

  • To import data at high performance, make sure that your ApsaraDB RDS instance and the HDFS cluster reside in the same VPC and communicate over an internal network.