All Products
Search
Document Center

Alibaba Cloud Model Studio:Retrieve

Last Updated:Dec 03, 2024

Queries information from a specified knowledge base.

Operation description

  • Before you call this operation, make sure that your knowledge base is created and is not deleted. That is, the primary key ID of the knowledge base IndexId is valid.
  • The response time may be long because this operation involves complex retrieval and matching. We recommend that you set appropriate timeout and retry policy for requests.
  • This interface is idempotent.

Debugging

You can run this interface directly in OpenAPI Explorer, saving you the trouble of calculating signatures. After running successfully, OpenAPI Explorer can automatically generate SDK code samples.

Authorization information

The following table shows the authorization information corresponding to the API. The authorization information can be used in the Action policy element to grant a RAM user or RAM role the permissions to call this API operation. Description:

  • Operation: the value that you can use in the Action element to specify the operation on a resource.
  • Access level: the access level of each operation. The levels are read, write, and list.
  • Resource type: the type of the resource on which you can authorize the RAM user or the RAM role to perform the operation. Take note of the following items:
    • The required resource types are displayed in bold characters.
    • If the permissions cannot be granted at the resource level, All Resources is used in the Resource type column of the operation.
  • Condition Key: the condition key that is defined by the cloud service.
  • Associated operation: other operations that the RAM user or the RAM role must have permissions to perform to complete the operation. To complete the operation, the RAM user or the RAM role must have the permissions to perform the associated operations.
OperationAccess levelResource typeCondition keyAssociated operation
sfm:Retrievenone
*All Resources
*
    none
none

Request syntax

POST /{WorkspaceId}/index/retrieve HTTP/1.1

Request parameters

ParameterTypeRequiredDescriptionExample
QuerystringNo

The input query prompt. The length and characters of the query are not limited.

DenseSimilarityTopKintegerNo

Vector retrieval top K. After generating vectors based on input text, the top K chunks in the knowledge base that are most similar to the vector representation of the input text are retrieved. Valid values: 0 to 100. The sum of the DenseSimilarityTopK and SparseSimilarityTopK parameters must be less than or equal to 200.

Default value: 100.

100
EnableRerankingbooleanNo

Specifies whether to enable reranking. For more information, see Create a knowledge base. Valid values:

  • true
  • false

Default value: true.

true
EnableRewritebooleanNo

Specifies whether to enable multi-round conversation rewriting. For more information, see Create a knowledge base. Valid values:

  • true
  • false

Default value: false.

false
Rerankarray<object>No

Ranking configurations.

objectNo

The ranking configurations object.

ModelNamestringNo

The name of the rank model. For more information, see Create a knowledge base. Valid values:

  • gte-rerank-hybrid: Recommended official model.
  • gte-rerank
gte-rerank-hybrid
RerankMinScorefloatNo

Similarity Threshold The lowest similarity score of chunks that can be returned. This parameter is used to filter text chunks returned by the rank model. For more information, see Create a knowledge base. Valid values: [0.01-1.00]. The priority of this parameter is greater than the similarity threshold configured for the knowledge base.

By default, this parameter is left empty. In this case, the similarity threshold of the knowledge base is used.

0.20
RerankTopNintegerNo

The top N return data after reranking. Valid values: 1 to 20. Default value: 5.

5
Rewritearray<object>No

Conversation rewriting configurations.

objectNo

Conversation rewriting configurations object.

ModelNamestringNo

Conversation rewriting model name. The query rewriting model automatically adjusts the original prompt based on the context to improve retrieval performance. Valid value:

  • conv-rewrite-qwen-1.8b

By default, this parameter is left empty, which means conv-rewrite-qwen-1.8b is used.

conv-rewrite-qwen-1.8b
SparseSimilarityTopKintegerNo

The top K of keyword retrieval. Chunks that exactly match the keywords of the input text are retrieved from the knowledge base. This filters out irrelevant chunks and boosts accuracy. Valid values: 0 to 100. The sum of the DenseSimilarityTopK and SparseSimilarityTopK parameters must be less than or equal to 200.

Default value: 100.

100
WorkspaceIdstringYes

The ID of the workspace to which the knowledge base belongs. To view the workspace ID, you can click the Workspace Details icon in the upper-left corner on the homepage of the console.

ws_3Nt27MYcoK191ISp
IndexIdstringYes

The primary key ID of the knowledge base, which is the Data.Id parameter returned by the CreateIndex operation.

5pwe0m2g6t
SaveRetrieverHistorybooleanNo

Specifies whether to save the retrieve test history. Valid values:

  • true
  • false

Default value: false.

false
SearchFiltersarray<object>No

Specifies complex filter conditions. For more information about the syntax of SearchFilters, see the SearchFilter syntax section of this topic.

objectNo

The filter condition object.

stringNo

The filter condition.

Alibaba Cloud Model Studio

SearchFilters syntax

  • The filter conditions support sub groups. By default, the default semantics between sub groups is AND and cannot be modified. 

    • Fields within a sub group support the _operator = AND/OR logical operators and the AND semantics by default. Case insensitive. 
    • Fields within a sub group support _conditions =[] for nested sub-conditions. By default, the logical relationship between sub-conditions is inherited from the _operator logical operator of the parent group. You can also use _conditions_operator= AND/OR to customize the logical relationship between sub-conditions.
  • Within a sub group, retrievable fields support singleQuery, multiQuery, and rangeQuery. 

    • singleQuery: numeric values and strings.
    • multiQuery: an array of numeric values and strings.
    • rangeQuery: The eq, neq, and like attributes are supported. You cannot configure multiple (case insensitive) intervals for a field. The gt, gte, lt, and lte attributes are supported. We recommend that you set these values to numeric (case insensitive).
Sample:

{
  "search_filters": [
    {
      "singleQuery": "stringValue",  // Single-value query
      "multiQuery": [                // Multi-value query.
        "stringValue1",
        "stringValue2"
      ],
      "logicQuery": {                // Logic query.
        "like": "prefix"
      },
      "rangeQuery": {                // Range query.
        "gte": intValue,
        "lte": intValue
      },
      "_conditions": [                // Nested sub-conditions.
        {
          "singleQuery": intValue
        }
      ]
    },
    {
      "_operator": "OR",                // Multi-condition query.
      "singleQuery": "stringValue"
    }
  ]
}

Response parameters

ParameterTypeDescriptionExample
object
Codestring

HTTP status code

Index.InvalidParameter
Dataobject

The returned data.

Nodesarray<object>

The list of queried chunks.

nodesobject

The chunk object.

Metadataany

The metadata map of the chunk.

Scoredouble

The similarity score of the chunk. Valid values:[0-1].

0.3
Textstring

The text of the chunk.

Messagestring

The error message.

Required parameter(%s) missing or invalid, please check the request parameters.
RequestIdstring

The request ID.

17204B98-7734-4F9A-8464-2446A84821CA
Statusstring

The HTTP status code returned.

200
Successboolean

Indications whether the API call is successful. Valid values:

  • true
  • false
true

Examples

Sample success responses

JSONformat

{
  "Code": "Index.InvalidParameter",
  "Data": {
    "Nodes": [
      {
        "Metadata": "",
        "Score": 0.3,
        "Text": ""
      }
    ]
  },
  "Message": "Required parameter(%s) missing or invalid, please check the request parameters.",
  "RequestId": "17204B98-7734-4F9A-8464-2446A84821CA",
  "Status": "200",
  "Success": true
}

Error codes

For a list of error codes, visit the Service error codes.

Change history

Change timeSummary of changesOperation
2024-07-23The request parameters of the API has changedView Change Details