Class AggregateGroupByRequest
org.apache.avro.generic.GenericContainer, org.apache.avro.generic.IndexedRecordGPUdb.aggregateGroupBy.
Calculates unique combinations (groups) of values for the given columns in a given table or view and computes aggregates on each unique combination. This is somewhat analogous to an SQL-style SELECT…GROUP BY.
For aggregation details and examples, see Aggregation. For limitations, see Aggregation Limitations.
Any column(s) can be grouped on, and all column types except unrestricted-length strings may be used for computing applicable aggregates.
The results can be paged via the offset and limit parameters. For example, to get 10 groups with the largest counts the inputs would be: limit=10, options=“sort_order”:“descending”, “sort_by”:“value”.
options can be used to customize behavior of this call e.g. filtering or sorting the results.
To group by columns ‘x’ and ‘y’ and compute the number of objects within each group, use: column_names=[‘x’,‘y’,‘count(*)’].
To also compute the sum of ‘z’ over each group, use: column_names=[‘x’,‘y’,‘count(*)’,‘sum(z)’].
Available aggregation functions are: count(*), sum, min, max, avg, mean, stddev, stddev_pop, stddev_samp, var, var_pop, var_samp, arg_min, arg_max and count_distinct.
Available grouping functions are Rollup, Cube, and Grouping Sets
This service also provides support for Pivot operations.
Filtering on aggregates is supported via expressions using aggregation functions supplied to HAVING.
The response is returned as a dynamic schema. For details see: dynamic schemas documentation.
If a RESULT_TABLE name is specified in the options, the results are stored in a new table with that name—no results are returned in the response. Both the table name and resulting column names must adhere to standard naming conventions; column/aggregation expressions will need to be aliased. If the source table’s shard key is used as the grouping column(s) and all result records are selected (offset is 0 and limit is -9999), the result table will be sharded, in all other cases it will be replicated. Sorting will properly function only if the result table is replicated or if there is only one processing node and should not be relied upon in other cases. Not available when any of the values of columnNames is an unrestricted-length string.
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic final classA set of string constants for theAggregateGroupByRequestparameterencoding.static final classA set of string constants for theAggregateGroupByRequestparameteroptions.Constructor Summary
ConstructorsConstructorDescriptionConstructs an AggregateGroupByRequest object with default parameters.AggregateGroupByRequest(String tableName, List<String> columnNames, long offset, long limit, String encoding, Map<String, String> options) Constructs an AggregateGroupByRequest object with the specified parameters.AggregateGroupByRequest(String tableName, List<String> columnNames, long offset, long limit, Map<String, String> options) Constructs an AggregateGroupByRequest object with the specified parameters.Method Summary
Modifier and TypeMethodDescriptionbooleanget(int index) This method supports the Avro framework and is not intended to be called directly by the user.static org.apache.avro.SchemaThis method supports the Avro framework and is not intended to be called directly by the user.List of one or more column names, expressions, and aggregate expressions.Specifies the encoding for returned records.longgetLimit()A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned.longA positive integer indicating the number of initial results to skip (this can be useful for paging through the results).Optional parameters.org.apache.avro.SchemaThis method supports the Avro framework and is not intended to be called directly by the user.Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules.inthashCode()voidThis method supports the Avro framework and is not intended to be called directly by the user.setColumnNames(List<String> columnNames) List of one or more column names, expressions, and aggregate expressions.setEncoding(String encoding) Specifies the encoding for returned records.setLimit(long limit) A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned.setOffset(long offset) A positive integer indicating the number of initial results to skip (this can be useful for paging through the results).setOptions(Map<String, String> options) Optional parameters.setTableName(String tableName) Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules.toString()
Constructor Details
AggregateGroupByRequest
public AggregateGroupByRequest()Constructs an AggregateGroupByRequest object with default parameters.AggregateGroupByRequest
public AggregateGroupByRequest(String tableName, List<String> columnNames, long offset, long limit, Map<String, String> options) Constructs an AggregateGroupByRequest object with the specified parameters.Parameters:tableName- Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules.columnNames- List of one or more column names, expressions, and aggregate expressions.offset- A positive integer indicating the number of initial results to skip (this can be useful for paging through the results). The default value is 0. The minimum allowed value is 0. The maximum allowed value is MAX_INT.limit- A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned. The number of records returned will never exceed the server’s own limit, defined by the max_get_records_size parameter in the server configuration. UsehasMoreRecordsto see if more records exist in the result to be fetched, andoffsetandlimitto request subsequent pages of results. The default value is -9999.options- Optional parameters.CREATE_TEMP_TABLE: IfTRUE, a unique temporary table name will be generated in the sys_temp schema and used in place ofRESULT_TABLE. IfRESULT_TABLE_PERSISTisFALSE(or unspecified), then this is always allowed even if the caller does not have permission to create tables. The generated name is returned inQUALIFIED_RESULT_TABLE_NAME. Supported values:The default value isFALSE.COLLECTION_NAME: [DEPRECATED—please specify the containing schema as part ofRESULT_TABLEand useGPUdb.createSchemato create the schema if non-existent] Name of a schema which is to contain the table specified inRESULT_TABLE. If the schema provided is non-existent, it will be automatically created.EXPRESSION: Filter expression to apply to the table prior to computing the aggregate group by.PIPELINED_EXPRESSION_EVALUATION: Evaluate the group-by during last JoinedSet filter plan step. Supported values:The default value isFALSE.HAVING: Filter expression to apply to the aggregated results.SORT_ORDER: [DEPRECATED—use order_by instead] String indicating how the returned values should be sorted - ascending or descending. Supported values:ASCENDING: Indicates that the returned values should be sorted in ascending order.DESCENDING: Indicates that the returned values should be sorted in descending order.
ASCENDING.SORT_BY: [DEPRECATED—use order_by instead] String determining how the results are sorted. Supported values:KEY: Indicates that the returned values should be sorted by key, which corresponds to the grouping columns. If you have multiple grouping columns (and are sorting by key), it will first sort the first grouping column, then the second grouping column, etc.VALUE: Indicates that the returned values should be sorted by value, which corresponds to the aggregates. If you have multiple aggregates (and are sorting by value), it will first sort by the first aggregate, then the second aggregate, etc.
VALUE.ORDER_BY: Comma-separated list of the columns to be sorted by as well as the sort direction, e.g., ‘timestamp asc, x desc’. The default value is ”.STRATEGY_DEFINITION: The tier strategy for the table and its columns.COMPRESSION_CODEC: The default compression codec for the result table’s columns.RESULT_TABLE: The name of a table used to store the results, in [schema_name.]table_name format, using standard name resolution rules and meeting table naming criteria. Column names (group-by and aggregate fields) need to be given aliases e.g. [“FChar256 as fchar256”, “sum(FDouble) as sfd”]. If present, no results are returned in the response. This option is not available if one of the grouping attributes is an unrestricted string (i.e.; not charN) type.RESULT_TABLE_PERSIST: IfTRUE, then the result table specified inRESULT_TABLEwill be persisted and will not expire unless aTTLis specified. IfFALSE, then the result table will be an in-memory table and will expire unless aTTLis specified otherwise. Supported values:The default value isFALSE.RESULT_TABLE_FORCE_REPLICATED: Force the result table to be replicated (ignores any sharding). Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_PK: IfTRUEthen set a primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_SOFT_PK: IfTRUEthen set a soft primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.TTL: Sets the TTL of the table specified inRESULT_TABLE.CHUNK_SIZE: Indicates the number of records per chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_COLUMN_MAX_MEMORY: Indicates the target maximum data size for each column in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_MAX_MEMORY: Indicates the target maximum data size for all columns in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CREATE_INDEXES: Comma-separated list of columns on which to create indexes on the result table. Must be used in combination with theRESULT_TABLEoption.PARTITION_TYPE: Partitioning scheme to use for the result table. Supported values:RANGE: Use range partitioning.INTERVAL: Use interval partitioning.LIST: Use list partitioning.HASH: Use hash partitioning.SERIES: Use series partitioning.
PARTITION_KEYS: Comma-separated list of partition keys, which are the columns or column expressions by which records will be assigned to partitions defined byPARTITION_DEFINITIONS.PARTITION_DEFINITIONS: Comma-separated list of partition definitions, whose format depends on the choice ofPARTITION_TYPE. See range partitioning, interval partitioning, list partitioning, hash partitioning, or series partitioning for example formats.IS_AUTOMATIC_PARTITION: IfTRUE, a new partition will be created for values which don’t fall into an existing partition. Currently only supported for list partitions. Supported values:The default value isFALSE.VIEW_ID: ID of view of which the result table will be a member. The default value is ”.PIVOT: Pivot column.PIVOT_VALUES: Comma-separated list of the values in thePIVOTcolumn. The list provided will become the column header prefixes in the output.GROUPING_SETS: Customize the grouping attribute sets to compute the aggregates. These sets can include ROLLUP or CUBE operators. The attribute sets should be enclosed in parentheses and can include composite attributes. All attributes specified in the grouping sets must present in the group-by attributes.ROLLUP: This option is used to specify the multilevel aggregates.CUBE: This option is used to specify the multidimensional aggregates.SHARD_KEY: Comma-separated list of the columns to be sharded on; e.g. ‘column1, column2’. The columns specified must be present incolumnNames. If any alias is given for any column name, the alias must be used, rather than the original column name. The default value is ”.
Map.AggregateGroupByRequest
public AggregateGroupByRequest(String tableName, List<String> columnNames, long offset, long limit, String encoding, Map<String, String> options) Constructs an AggregateGroupByRequest object with the specified parameters.Parameters:tableName- Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules.columnNames- List of one or more column names, expressions, and aggregate expressions.offset- A positive integer indicating the number of initial results to skip (this can be useful for paging through the results). The default value is 0. The minimum allowed value is 0. The maximum allowed value is MAX_INT.limit- A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned. The number of records returned will never exceed the server’s own limit, defined by the max_get_records_size parameter in the server configuration. UsehasMoreRecordsto see if more records exist in the result to be fetched, andoffsetandlimitto request subsequent pages of results. The default value is -9999.encoding- Specifies the encoding for returned records. Supported values:BINARY: Indicates that the returned records should be binary encoded.JSON: Indicates that the returned records should be JSON-encoded.
BINARY.options- Optional parameters.CREATE_TEMP_TABLE: IfTRUE, a unique temporary table name will be generated in the sys_temp schema and used in place ofRESULT_TABLE. IfRESULT_TABLE_PERSISTisFALSE(or unspecified), then this is always allowed even if the caller does not have permission to create tables. The generated name is returned inQUALIFIED_RESULT_TABLE_NAME. Supported values:The default value isFALSE.COLLECTION_NAME: [DEPRECATED—please specify the containing schema as part ofRESULT_TABLEand useGPUdb.createSchemato create the schema if non-existent] Name of a schema which is to contain the table specified inRESULT_TABLE. If the schema provided is non-existent, it will be automatically created.EXPRESSION: Filter expression to apply to the table prior to computing the aggregate group by.PIPELINED_EXPRESSION_EVALUATION: Evaluate the group-by during last JoinedSet filter plan step. Supported values:The default value isFALSE.HAVING: Filter expression to apply to the aggregated results.SORT_ORDER: [DEPRECATED—use order_by instead] String indicating how the returned values should be sorted - ascending or descending. Supported values:ASCENDING: Indicates that the returned values should be sorted in ascending order.DESCENDING: Indicates that the returned values should be sorted in descending order.
ASCENDING.SORT_BY: [DEPRECATED—use order_by instead] String determining how the results are sorted. Supported values:KEY: Indicates that the returned values should be sorted by key, which corresponds to the grouping columns. If you have multiple grouping columns (and are sorting by key), it will first sort the first grouping column, then the second grouping column, etc.VALUE: Indicates that the returned values should be sorted by value, which corresponds to the aggregates. If you have multiple aggregates (and are sorting by value), it will first sort by the first aggregate, then the second aggregate, etc.
VALUE.ORDER_BY: Comma-separated list of the columns to be sorted by as well as the sort direction, e.g., ‘timestamp asc, x desc’. The default value is ”.STRATEGY_DEFINITION: The tier strategy for the table and its columns.COMPRESSION_CODEC: The default compression codec for the result table’s columns.RESULT_TABLE: The name of a table used to store the results, in [schema_name.]table_name format, using standard name resolution rules and meeting table naming criteria. Column names (group-by and aggregate fields) need to be given aliases e.g. [“FChar256 as fchar256”, “sum(FDouble) as sfd”]. If present, no results are returned in the response. This option is not available if one of the grouping attributes is an unrestricted string (i.e.; not charN) type.RESULT_TABLE_PERSIST: IfTRUE, then the result table specified inRESULT_TABLEwill be persisted and will not expire unless aTTLis specified. IfFALSE, then the result table will be an in-memory table and will expire unless aTTLis specified otherwise. Supported values:The default value isFALSE.RESULT_TABLE_FORCE_REPLICATED: Force the result table to be replicated (ignores any sharding). Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_PK: IfTRUEthen set a primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_SOFT_PK: IfTRUEthen set a soft primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.TTL: Sets the TTL of the table specified inRESULT_TABLE.CHUNK_SIZE: Indicates the number of records per chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_COLUMN_MAX_MEMORY: Indicates the target maximum data size for each column in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_MAX_MEMORY: Indicates the target maximum data size for all columns in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CREATE_INDEXES: Comma-separated list of columns on which to create indexes on the result table. Must be used in combination with theRESULT_TABLEoption.PARTITION_TYPE: Partitioning scheme to use for the result table. Supported values:RANGE: Use range partitioning.INTERVAL: Use interval partitioning.LIST: Use list partitioning.HASH: Use hash partitioning.SERIES: Use series partitioning.
PARTITION_KEYS: Comma-separated list of partition keys, which are the columns or column expressions by which records will be assigned to partitions defined byPARTITION_DEFINITIONS.PARTITION_DEFINITIONS: Comma-separated list of partition definitions, whose format depends on the choice ofPARTITION_TYPE. See range partitioning, interval partitioning, list partitioning, hash partitioning, or series partitioning for example formats.IS_AUTOMATIC_PARTITION: IfTRUE, a new partition will be created for values which don’t fall into an existing partition. Currently only supported for list partitions. Supported values:The default value isFALSE.VIEW_ID: ID of view of which the result table will be a member. The default value is ”.PIVOT: Pivot column.PIVOT_VALUES: Comma-separated list of the values in thePIVOTcolumn. The list provided will become the column header prefixes in the output.GROUPING_SETS: Customize the grouping attribute sets to compute the aggregates. These sets can include ROLLUP or CUBE operators. The attribute sets should be enclosed in parentheses and can include composite attributes. All attributes specified in the grouping sets must present in the group-by attributes.ROLLUP: This option is used to specify the multilevel aggregates.CUBE: This option is used to specify the multidimensional aggregates.SHARD_KEY: Comma-separated list of the columns to be sharded on; e.g. ‘column1, column2’. The columns specified must be present incolumnNames. If any alias is given for any column name, the alias must be used, rather than the original column name. The default value is ”.
Map.
Method Details
getClassSchema
public static org.apache.avro.Schema getClassSchema()This method supports the Avro framework and is not intended to be called directly by the user.Returns:The schema for the class.getTableName
Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules.Returns:The current value oftableName.setTableName
Name of an existing table or view on which the operation will be performed, in [schema_name.]table_name format, using standard name resolution rules.Parameters:tableName- The new value fortableName.Returns:thisto mimic the builder pattern.setColumnNames
List of one or more column names, expressions, and aggregate expressions.Parameters:columnNames- The new value forcolumnNames.Returns:thisto mimic the builder pattern.getOffset
public long getOffset()A positive integer indicating the number of initial results to skip (this can be useful for paging through the results). The default value is 0. The minimum allowed value is 0. The maximum allowed value is MAX_INT.Returns:The current value ofoffset.setOffset
A positive integer indicating the number of initial results to skip (this can be useful for paging through the results). The default value is 0. The minimum allowed value is 0. The maximum allowed value is MAX_INT.Parameters:offset- The new value foroffset.Returns:thisto mimic the builder pattern.getLimit
public long getLimit()A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned. The number of records returned will never exceed the server’s own limit, defined by the max_get_records_size parameter in the server configuration. UsehasMoreRecordsto see if more records exist in the result to be fetched, andoffsetandlimitto request subsequent pages of results. The default value is -9999.Returns:The current value oflimit.setLimit
A positive integer indicating the maximum number of results to be returned, or END_OF_SET (-9999) to indicate that the maximum number of results allowed by the server should be returned. The number of records returned will never exceed the server’s own limit, defined by the max_get_records_size parameter in the server configuration. UsehasMoreRecordsto see if more records exist in the result to be fetched, andoffsetandlimitto request subsequent pages of results. The default value is -9999.Parameters:limit- The new value forlimit.Returns:thisto mimic the builder pattern.setEncoding
Specifies the encoding for returned records. Supported values:BINARY: Indicates that the returned records should be binary encoded.JSON: Indicates that the returned records should be JSON-encoded.
BINARY.Parameters:encoding- The new value forencoding.Returns:thisto mimic the builder pattern.getOptions
Optional parameters.CREATE_TEMP_TABLE: IfTRUE, a unique temporary table name will be generated in the sys_temp schema and used in place ofRESULT_TABLE. IfRESULT_TABLE_PERSISTisFALSE(or unspecified), then this is always allowed even if the caller does not have permission to create tables. The generated name is returned inQUALIFIED_RESULT_TABLE_NAME. Supported values:The default value isFALSE.COLLECTION_NAME: [DEPRECATED—please specify the containing schema as part ofRESULT_TABLEand useGPUdb.createSchemato create the schema if non-existent] Name of a schema which is to contain the table specified inRESULT_TABLE. If the schema provided is non-existent, it will be automatically created.EXPRESSION: Filter expression to apply to the table prior to computing the aggregate group by.PIPELINED_EXPRESSION_EVALUATION: Evaluate the group-by during last JoinedSet filter plan step. Supported values:The default value isFALSE.HAVING: Filter expression to apply to the aggregated results.SORT_ORDER: [DEPRECATED—use order_by instead] String indicating how the returned values should be sorted - ascending or descending. Supported values:ASCENDING: Indicates that the returned values should be sorted in ascending order.DESCENDING: Indicates that the returned values should be sorted in descending order.
ASCENDING.SORT_BY: [DEPRECATED—use order_by instead] String determining how the results are sorted. Supported values:KEY: Indicates that the returned values should be sorted by key, which corresponds to the grouping columns. If you have multiple grouping columns (and are sorting by key), it will first sort the first grouping column, then the second grouping column, etc.VALUE: Indicates that the returned values should be sorted by value, which corresponds to the aggregates. If you have multiple aggregates (and are sorting by value), it will first sort by the first aggregate, then the second aggregate, etc.
VALUE.ORDER_BY: Comma-separated list of the columns to be sorted by as well as the sort direction, e.g., ‘timestamp asc, x desc’. The default value is ”.STRATEGY_DEFINITION: The tier strategy for the table and its columns.COMPRESSION_CODEC: The default compression codec for the result table’s columns.RESULT_TABLE: The name of a table used to store the results, in [schema_name.]table_name format, using standard name resolution rules and meeting table naming criteria. Column names (group-by and aggregate fields) need to be given aliases e.g. [“FChar256 as fchar256”, “sum(FDouble) as sfd”]. If present, no results are returned in the response. This option is not available if one of the grouping attributes is an unrestricted string (i.e.; not charN) type.RESULT_TABLE_PERSIST: IfTRUE, then the result table specified inRESULT_TABLEwill be persisted and will not expire unless aTTLis specified. IfFALSE, then the result table will be an in-memory table and will expire unless aTTLis specified otherwise. Supported values:The default value isFALSE.RESULT_TABLE_FORCE_REPLICATED: Force the result table to be replicated (ignores any sharding). Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_PK: IfTRUEthen set a primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_SOFT_PK: IfTRUEthen set a soft primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.TTL: Sets the TTL of the table specified inRESULT_TABLE.CHUNK_SIZE: Indicates the number of records per chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_COLUMN_MAX_MEMORY: Indicates the target maximum data size for each column in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_MAX_MEMORY: Indicates the target maximum data size for all columns in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CREATE_INDEXES: Comma-separated list of columns on which to create indexes on the result table. Must be used in combination with theRESULT_TABLEoption.PARTITION_TYPE: Partitioning scheme to use for the result table. Supported values:RANGE: Use range partitioning.INTERVAL: Use interval partitioning.LIST: Use list partitioning.HASH: Use hash partitioning.SERIES: Use series partitioning.
PARTITION_KEYS: Comma-separated list of partition keys, which are the columns or column expressions by which records will be assigned to partitions defined byPARTITION_DEFINITIONS.PARTITION_DEFINITIONS: Comma-separated list of partition definitions, whose format depends on the choice ofPARTITION_TYPE. See range partitioning, interval partitioning, list partitioning, hash partitioning, or series partitioning for example formats.IS_AUTOMATIC_PARTITION: IfTRUE, a new partition will be created for values which don’t fall into an existing partition. Currently only supported for list partitions. Supported values:The default value isFALSE.VIEW_ID: ID of view of which the result table will be a member. The default value is ”.PIVOT: Pivot column.PIVOT_VALUES: Comma-separated list of the values in thePIVOTcolumn. The list provided will become the column header prefixes in the output.GROUPING_SETS: Customize the grouping attribute sets to compute the aggregates. These sets can include ROLLUP or CUBE operators. The attribute sets should be enclosed in parentheses and can include composite attributes. All attributes specified in the grouping sets must present in the group-by attributes.ROLLUP: This option is used to specify the multilevel aggregates.CUBE: This option is used to specify the multidimensional aggregates.SHARD_KEY: Comma-separated list of the columns to be sharded on; e.g. ‘column1, column2’. The columns specified must be present incolumnNames. If any alias is given for any column name, the alias must be used, rather than the original column name. The default value is ”.
Map.Returns:The current value ofoptions.setOptions
Optional parameters.CREATE_TEMP_TABLE: IfTRUE, a unique temporary table name will be generated in the sys_temp schema and used in place ofRESULT_TABLE. IfRESULT_TABLE_PERSISTisFALSE(or unspecified), then this is always allowed even if the caller does not have permission to create tables. The generated name is returned inQUALIFIED_RESULT_TABLE_NAME. Supported values:The default value isFALSE.COLLECTION_NAME: [DEPRECATED—please specify the containing schema as part ofRESULT_TABLEand useGPUdb.createSchemato create the schema if non-existent] Name of a schema which is to contain the table specified inRESULT_TABLE. If the schema provided is non-existent, it will be automatically created.EXPRESSION: Filter expression to apply to the table prior to computing the aggregate group by.PIPELINED_EXPRESSION_EVALUATION: Evaluate the group-by during last JoinedSet filter plan step. Supported values:The default value isFALSE.HAVING: Filter expression to apply to the aggregated results.SORT_ORDER: [DEPRECATED—use order_by instead] String indicating how the returned values should be sorted - ascending or descending. Supported values:ASCENDING: Indicates that the returned values should be sorted in ascending order.DESCENDING: Indicates that the returned values should be sorted in descending order.
ASCENDING.SORT_BY: [DEPRECATED—use order_by instead] String determining how the results are sorted. Supported values:KEY: Indicates that the returned values should be sorted by key, which corresponds to the grouping columns. If you have multiple grouping columns (and are sorting by key), it will first sort the first grouping column, then the second grouping column, etc.VALUE: Indicates that the returned values should be sorted by value, which corresponds to the aggregates. If you have multiple aggregates (and are sorting by value), it will first sort by the first aggregate, then the second aggregate, etc.
VALUE.ORDER_BY: Comma-separated list of the columns to be sorted by as well as the sort direction, e.g., ‘timestamp asc, x desc’. The default value is ”.STRATEGY_DEFINITION: The tier strategy for the table and its columns.COMPRESSION_CODEC: The default compression codec for the result table’s columns.RESULT_TABLE: The name of a table used to store the results, in [schema_name.]table_name format, using standard name resolution rules and meeting table naming criteria. Column names (group-by and aggregate fields) need to be given aliases e.g. [“FChar256 as fchar256”, “sum(FDouble) as sfd”]. If present, no results are returned in the response. This option is not available if one of the grouping attributes is an unrestricted string (i.e.; not charN) type.RESULT_TABLE_PERSIST: IfTRUE, then the result table specified inRESULT_TABLEwill be persisted and will not expire unless aTTLis specified. IfFALSE, then the result table will be an in-memory table and will expire unless aTTLis specified otherwise. Supported values:The default value isFALSE.RESULT_TABLE_FORCE_REPLICATED: Force the result table to be replicated (ignores any sharding). Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_PK: IfTRUEthen set a primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.RESULT_TABLE_GENERATE_SOFT_PK: IfTRUEthen set a soft primary key for the result table. Must be used in combination with theRESULT_TABLEoption. Supported values:The default value isFALSE.TTL: Sets the TTL of the table specified inRESULT_TABLE.CHUNK_SIZE: Indicates the number of records per chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_COLUMN_MAX_MEMORY: Indicates the target maximum data size for each column in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CHUNK_MAX_MEMORY: Indicates the target maximum data size for all columns in a chunk to be used for the result table. Must be used in combination with theRESULT_TABLEoption.CREATE_INDEXES: Comma-separated list of columns on which to create indexes on the result table. Must be used in combination with theRESULT_TABLEoption.PARTITION_TYPE: Partitioning scheme to use for the result table. Supported values:RANGE: Use range partitioning.INTERVAL: Use interval partitioning.LIST: Use list partitioning.HASH: Use hash partitioning.SERIES: Use series partitioning.
PARTITION_KEYS: Comma-separated list of partition keys, which are the columns or column expressions by which records will be assigned to partitions defined byPARTITION_DEFINITIONS.PARTITION_DEFINITIONS: Comma-separated list of partition definitions, whose format depends on the choice ofPARTITION_TYPE. See range partitioning, interval partitioning, list partitioning, hash partitioning, or series partitioning for example formats.IS_AUTOMATIC_PARTITION: IfTRUE, a new partition will be created for values which don’t fall into an existing partition. Currently only supported for list partitions. Supported values:The default value isFALSE.VIEW_ID: ID of view of which the result table will be a member. The default value is ”.PIVOT: Pivot column.PIVOT_VALUES: Comma-separated list of the values in thePIVOTcolumn. The list provided will become the column header prefixes in the output.GROUPING_SETS: Customize the grouping attribute sets to compute the aggregates. These sets can include ROLLUP or CUBE operators. The attribute sets should be enclosed in parentheses and can include composite attributes. All attributes specified in the grouping sets must present in the group-by attributes.ROLLUP: This option is used to specify the multilevel aggregates.CUBE: This option is used to specify the multidimensional aggregates.SHARD_KEY: Comma-separated list of the columns to be sharded on; e.g. ‘column1, column2’. The columns specified must be present incolumnNames. If any alias is given for any column name, the alias must be used, rather than the original column name. The default value is ”.
Map.Parameters:options- The new value foroptions.Returns:thisto mimic the builder pattern.getSchema
public org.apache.avro.Schema getSchema()This method supports the Avro framework and is not intended to be called directly by the user.Specified by:getSchemain interfaceorg.apache.avro.generic.GenericContainerReturns:The schema object describing this class.get
This method supports the Avro framework and is not intended to be called directly by the user.Specified by:getin interfaceorg.apache.avro.generic.IndexedRecordParameters:index- the position of the field to getReturns:value of the field with the given index.Throws:put
This method supports the Avro framework and is not intended to be called directly by the user.Specified by:putin interfaceorg.apache.avro.generic.IndexedRecordParameters:index- the position of the field to setvalue- the value to setThrows: