You can use match query (MatchQuery) to query data in a table based on approximate matches. Tablestore tokenizes the values in TEXT columns and the keywords that you use to perform match queries based on the analyzer type that you specify. This way, Tablestore can perform match queries based on the tokens. We recommend that you use match phase query for columns for which fuzzy tokenization is used to ensure high performance in fuzzy queries.
Prerequisites
An OTSClient instance is initialized. For more information, see Initialize an OTSClient instance.
A data table is created and data is written to the data table. For more information, see Create a data table and Write data.
A search index is created for the data table. For more information, see Create search indexes.
Parameters
Parameter | Description |
TableName | The name of the data table. |
IndexName | The name of the search index. |
Query | The type of the query. Set the query type to MatchQuery. |
FieldName | The name of the field that you want to match. You can perform match queries on TEXT columns. |
Text | The keyword that is used to match the column values when you perform a match query. If the column used for the match query is of the TEXT type, the keyword is tokenized into multiple keywords based on the analyzer you specified when you create the search index. By default, single-word tokenization is performed if you do not specify the analyzer when you create the search index. For example, if the column that you want to match is a TEXT column, you set the analyzer type to single-word tokenization, and you use "this is" as a search keyword, you can obtain query results such as "..., this is tablestore", "is this tablestore", "tablestore is cool", "this", and "is". |
Operator | The logical operator. By default, the OR logical operator is used, which indicates that a row meets the query conditions if the value of the column contains the specified number of tokens. If you set the Operator parameter to AND, a row meets the query conditions only if the value of the column contains all tokens. |
MinimumShouldMatch | The minimum number of matched tokens contained in a column value. A row is returned only if the value of the column specified by the FieldName parameter in the row contains at least the minimum number of matched tokens. Note The MinimumShouldMatch parameter must be used with the OR logical operator. |
Offset | The position from which the current query starts. |
Limit | The maximum number of rows that you want the current query to return. To query only the number of rows that meet the query conditions without specific data, set the Limit parameter to 0. |
GetTotalCount | Specifies whether to return the total number of rows that meet the query conditions. Default value: false. If you set this parameter to true, the query performance is compromised. |
ColumnsToGet | Specifies whether to return all columns of each row that meets the query conditions. You can specify the ReturnAll and Columns parameters. By default, the ReturnAll parameter is set to false, which indicates that not all columns are returned. If the ReturnAll parameter is set to false, you can use the Columns parameter to specify the columns that you want to return. If you do not specify the Columns parameter, only the primary key columns are returned. If you set ReturnAll to true, all columns are returned. |
Sample code
The following sample code provides an example on how to query the rows whose value of Col_Keyword matches 'hangzhou':
/**
* Query the rows whose value of Col_Keyword matches "hangzhou" in a table. Tablestore returns the total number of rows that meet the query conditions and part of the matched rows.
*/
func MatchQuery(client *tablestore.TableStoreClient, tableName string, indexName string) {
searchRequest := &tablestore.SearchRequest{}
searchRequest.SetTableName(tableName)
searchRequest.SetIndexName(indexName)
query := &search.MatchQuery{} // Set the query type to MatchQuery.
query.FieldName = "Col_Keyword" // Specify the name of the column that you want to query.
query.Text = "hangzhou" // Specify the keyword that is used to match the value of the column.
searchQuery := search.NewSearchQuery()
searchQuery.SetQuery(query)
searchQuery.SetGetTotalCount(true)
searchQuery.SetOffset(0) // Set the Offset parameter to 0.
searchQuery.SetLimit(20) // Set the Limit parameter to 20, which indicates a maximum of 20 rows can be returned.
searchRequest.SetSearchQuery(searchQuery)
searchResponse, err := client.Search(searchRequest)
if err != nil { // An error occurs.
fmt.Printf("%#v", err)
return
}
fmt.Println("IsAllSuccess: ", searchResponse.IsAllSuccess) // Check whether all rows that meet the query conditions are returned.
fmt.Println("TotalCount: ", searchResponse.TotalCount) // Display the total number of rows that meet the query conditions.
fmt.Println("RowCount: ", len(searchResponse.Rows)) // Display the total number of rows that are returned.
for _, row := range searchResponse.Rows {
jsonBody, err := json.Marshal(row)
if err != nil {
panic(err)
}
fmt.Println("Row: ", string(jsonBody)) // By default, if you do not specify the columnsToGet parameter, only the primary key columns are returned.
}
// Return all columns in the rows that meet the query conditions.
searchRequest.SetColumnsToGet(&tablestore.ColumnsToGet{
ReturnAll:true,
})
searchResponse, err = client.Search(searchRequest)
if err != nil {
fmt.Printf("%#v", err)
return
}
fmt.Println("IsAllSuccess: ", searchResponse.IsAllSuccess) // Check whether all rows that meet the query conditions are returned.
fmt.Println("TotalCount: ", searchResponse.TotalCount) // Display the total number of rows that meet the query conditions.
fmt.Println("RowCount: ", len(searchResponse.Rows))
for _, row := range searchResponse.Rows {
jsonBody, err := json.Marshal(row)
if err != nil {
panic(err)
}
fmt.Println("Row: ", string(jsonBody))
}
}
FAQ
References
When you use a search index to query data, you can use the following query methods: term query, terms query, match all query, match query, match phrase query, prefix query, range query, wildcard query, geo query, Boolean query, KNN vector query, nested query, and exists query. You can use the query methods provided by the search index to query data from multiple dimensions based on your business requirements.
You can sort or paginate rows that meet the query conditions by using the sorting and paging features. For more information, see Sorting and paging.
You can use the collapse (distinct) feature to collapse the result set based on a specific column. This way, data of the specified type appears only once in the query results. For more information, see Collapse (distinct).
If you want to analyze data in a data table, you can use the aggregation feature of the Search operation or execute SQL statements. For example, you can obtain the minimum and maximum values, sum, and total number of rows. For more information, see Aggregation and SQL query.
If you want to obtain all rows that meet the query conditions without the need to sort the rows, you can call the ParallelScan and ComputeSplits operations to use the parallel scan feature. For more information, see Parallel scan.