GPUdb C++ API
Version 7.2.2.4
|
A set of parameters for GPUdb::aggregateGroupBy. More...
#include <gpudb/protocol/aggregate_group_by.h>
Public Member Functions | |
AggregateGroupByRequest () | |
Constructs an AggregateGroupByRequest object with default parameters. More... | |
AggregateGroupByRequest (const std::string &tableName_, const std::vector< std::string > &columnNames_, const int64_t offset_, const int64_t limit_, const std::map< std::string, std::string > &options_) | |
Constructs an AggregateGroupByRequest object with the specified parameters. More... | |
AggregateGroupByRequest (const std::string &tableName_, const std::vector< std::string > &columnNames_, const int64_t offset_, const int64_t limit_, const std::string &encoding_, const std::map< std::string, std::string > &options_) | |
Constructs an AggregateGroupByRequest object with the specified parameters. More... | |
Public Attributes | |
std::string | tableName |
Name of an existing table or view on which the operation will be performed, in [ schema_name. ]table_name format, using standard name resolution rules. More... | |
std::vector< std::string > | columnNames |
List of one or more column names, expressions, and aggregate expressions. More... | |
int64_t | offset |
A positive integer indicating the number of initial results to skip (this can be useful for paging through the results). More... | |
int64_t | limit |
A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned. More... | |
std::string | encoding |
Specifies the encoding for returned records. More... | |
std::map< std::string, std::string > | options |
Optional parameters. More... | |
A set of parameters for GPUdb::aggregateGroupBy.
Calculates unique combinations (groups) of values for the given columns in a given table or view and computes aggregates on each unique combination. This is somewhat analogous to an SQL-style SELECT...GROUP BY.
For aggregation details and examples, see Aggregation. For limitations, see Aggregation Limitations.
Any column(s) can be grouped on, and all column types except unrestricted-length strings may be used for computing applicable aggregates; columns marked as store-only are unable to be used in grouping or aggregation.
The results can be paged via the offset and limit parameters. For example, to get 10 groups with the largest counts the inputs would be: limit=10, options={"sort_order":"descending", "sort_by":"value"}.
options can be used to customize behavior of this call e.g. filtering or sorting the results.
To group by columns 'x' and 'y' and compute the number of objects within each group, use: column_names=['x','y','count(*)'].
To also compute the sum of 'z' over each group, use: column_names=['x','y','count(*)','sum(z)'].
Available aggregation functions are: count(*), sum, min, max, avg, mean, stddev, stddev_pop, stddev_samp, var, var_pop, var_samp, arg_min, arg_max and count_distinct.
Available grouping functions are Rollup, Cube, and Grouping Sets
This service also provides support for Pivot operations.
Filtering on aggregates is supported via expressions using aggregation functions supplied to having.
The response is returned as a dynamic schema. For details see: dynamic schemas documentation.
If a result_table name is specified in the options, the results are stored in a new table with that name–no results are returned in the response. Both the table name and resulting column names must adhere to standard naming conventions; column/aggregation expressions will need to be aliased. If the source table's shard key is used as the grouping column(s) and all result records are selected (offset is 0 and limit is -9999), the result table will be sharded, in all other cases it will be replicated. Sorting will properly function only if the result table is replicated or if there is only one processing node and should not be relied upon in other cases. Not available when any of the values of columnNames is an unrestricted-length string.
Definition at line 86 of file aggregate_group_by.h.
|
inline |
Constructs an AggregateGroupByRequest object with default parameters.
Definition at line 92 of file aggregate_group_by.h.
|
inline |
Constructs an AggregateGroupByRequest object with the specified parameters.
[in] | tableName_ | Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules. |
[in] | columnNames_ | List of one or more column names, expressions, and aggregate expressions. |
[in] | offset_ | A positive integer indicating the number of initial results to skip (this can be useful for paging through the results). The default value is 0. The minimum allowed value is 0. The maximum allowed value is MAX_INT. |
[in] | limit_ | A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned. The number of records returned will never exceed the server's own limit, defined by the max_get_records_size parameter in the server configuration. Use hasMoreRecords to see if more records exist in the result to be fetched, and offset_ & limit_ to request subsequent pages of results. The default value is -9999. |
[in] | options_ | Optional parameters.
|
Definition at line 458 of file aggregate_group_by.h.
|
inline |
Constructs an AggregateGroupByRequest object with the specified parameters.
[in] | tableName_ | Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules. |
[in] | columnNames_ | List of one or more column names, expressions, and aggregate expressions. |
[in] | offset_ | A positive integer indicating the number of initial results to skip (this can be useful for paging through the results). The default value is 0. The minimum allowed value is 0. The maximum allowed value is MAX_INT. |
[in] | limit_ | A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned. The number of records returned will never exceed the server's own limit, defined by the max_get_records_size parameter in the server configuration. Use hasMoreRecords to see if more records exist in the result to be fetched, and offset_ & limit_ to request subsequent pages of results. The default value is -9999. |
[in] | encoding_ | Specifies the encoding for returned records. Supported values:
|
[in] | options_ | Optional parameters.
|
Definition at line 839 of file aggregate_group_by.h.
std::vector<std::string> gpudb::AggregateGroupByRequest::columnNames |
List of one or more column names, expressions, and aggregate expressions.
Definition at line 861 of file aggregate_group_by.h.
std::string gpudb::AggregateGroupByRequest::encoding |
Specifies the encoding for returned records.
Supported values:
The default value is aggregate_group_by_binary.
Definition at line 900 of file aggregate_group_by.h.
int64_t gpudb::AggregateGroupByRequest::limit |
A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned.
The number of records returned will never exceed the server's own limit, defined by the max_get_records_size parameter in the server configuration. Use hasMoreRecords to see if more records exist in the result to be fetched, and offset & limit to request subsequent pages of results. The default value is -9999.
Definition at line 884 of file aggregate_group_by.h.
int64_t gpudb::AggregateGroupByRequest::offset |
A positive integer indicating the number of initial results to skip (this can be useful for paging through the results).
The default value is 0. The minimum allowed value is 0. The maximum allowed value is MAX_INT.
Definition at line 869 of file aggregate_group_by.h.
std::map<std::string, std::string> gpudb::AggregateGroupByRequest::options |
Optional parameters.
The default value is an empty map.
Definition at line 1127 of file aggregate_group_by.h.
std::string gpudb::AggregateGroupByRequest::tableName |
Name of an existing table or view on which the operation will be performed, in [ schema_name. ]table_name format, using standard name resolution rules.
Definition at line 855 of file aggregate_group_by.h.