All Products
Search
Document Center

Tablestore:Collapse (distinct)

Last Updated:Aug 23, 2024

You can use the collapse (distinct) feature to collapse the result set based on a specified column when the results of a query contain large amounts of data of a specific type. Data of the specific type is displayed only once in the query results to ensure the diversity of the result types.

Prerequisites

Usage notes

  • If you use the collapse (distinct) feature, you can perform pagination only by specifying the offset and limit parameters.

  • If you aggregate and collapse a result set at the same time, the result set is aggregated before it is collapsed.

  • If you collapse the results of a query, the total number of results that are returned is determined by the sum of the values of the offset and limit parameters. A maximum of 50,000 results can be returned.

  • The total number of rows in the response indicates the number of rows that meet the query conditions before you use the collapse (distinct) feature. After the result set is collapsed, the total number of distinct values cannot be queried.

Parameters

Parameter

Description

query

The query type. You can set this parameter to any query type.

collapse

Collapses the result set based on the field that is specified by the fieldName parameter.

fieldName: the name of the field based on which the result set is collapsed. Only the fields whose values are of the INTEGER, FLOATING-POINT, or KEYWORD type are supported. The fields whose values are of the ARRAY type are not supported.

offset

The position from which the current query starts.

limit

The maximum number of rows that you want the current query to return.

To query only the number of rows that meet the query conditions without querying specific data of the rows, set the limit parameter to 0.

getTotalCount

Specifies whether to return the total number of rows that meet the query conditions. The default value of this parameter is false, which specifies that the total number of rows that meet the query conditions is not returned.

If you set this parameter to true, the query performance is compromised.

tableName

The name of the data table.

indexName

The name of the search index.

columnsToGet

Specifies whether to return all columns of each row that meets the query conditions.

  • If you set returnType to TableStore.ColumnReturnType.RETURN_SPECIFIED, you need to configure returnNames to specify the columns that you want to return.

  • If you set the returnType parameter to TableStore.ColumnReturnType.RETURN_ALL, all columns are returned.

  • If you set the returnType parameter to TableStore.ColumnReturnType.RETURN_ALL_FROM_INDEX, all columns in the search index are returned. .

  • If you set the returnType parameter to TableStore.ColumnReturnType.RETURN_NONE, only the primary key columns are returned.

Examples

The following sample code provides an example on how to use match all query to query the total number of rows in a table and collapse the result set based on the value of the col_keyword column:

let searchQuery = {
    offset: 0,
    limit: 100,
    query: {
        queryType: TableStore.QueryType.MATCH_ALL_QUERY,
    },
    collapse: {
        fieldName: "col_keyword",
    },
    getTotalCount: false,
};
let params = {
    tableName: tableName,
    indexName: indexName,
    searchQuery: searchQuery,
    columnToGet: { // Specify the columns that you want to return. You can set the parameter to RETURN_SPECIFIED to return the specified columns, RETURN_ALL to return all columns, RETURN_ALL_FROM_INDEX to return all columns in the search index, or RETURN_NONE to return only the primary key columns. 
        returnType: TableStore.ColumnReturnType.RETURN_ALL_FROM_INDEX
    },
    timeoutMs: 30000,
}
client.search(params, function (err, data) {
    if (err) {
        console.log('search error:', err.toString());
    } else {
        console.log('search success:', data);
    }
});

FAQ

References

  • The following query types are supported by search indexes: term query, terms query, match all query, match query, match phrase query, prefix query, range query, wildcard query, Boolean query, geo query, nested query, vector query, and exists query. You can select a query type to query data based on your business requirements.

    If you want to sort or paginate the rows that meet the query conditions, you can use the sorting and paging feature. For more information, see Sorting and paging.

    If you want to collapse the result set based on a specific column, you can use the collapse (distinct) feature. This way, data of the specified type appears only once in the query results. For more information, see Collapse (distinct).

  • If you want to analyze data in a data table, such as obtaining the extreme values, sum, and total number of rows, you can perform aggregation operations or execute SQL statements. For more information, see Aggregation and SQL query.

  • If you want to quickly obtain all rows that meet the query conditions without the need to sort the rows, you can call the ParallelScan and ComputeSplits operations to use the parallel scan feature. For more information, see Parallel scan.