public class AggregateUniqueRequest extends Object implements org.apache.avro.generic.IndexedRecord
GPUdb.aggregateUniqueRaw(AggregateUniqueRequest)
.
Returns all the unique values from a particular column
(specified by columnName
) of a particular table or view
(specified by tableName
). If columnName
is a numeric column,
the values will be in binaryEncodedResponse
. Otherwise if
columnName
is a string column, the values will be in
jsonEncodedResponse
. The results can be paged via offset
and limit
parameters.
Columns marked as store-only are unable to be used with this function.
To get the first 10 unique values sorted in descending order options
would be::
{"limit":"10","sort_order":"descending"}.
The response is returned as a dynamic schema. For details see: dynamic schemas documentation.
If a result_table
name is specified in the
options
, the results are stored in a new table with that name--no
results are returned in the response. Both the table name and resulting
column
name must adhere to
standard
naming conventions;
any column expression will need to be aliased. If the source table's
shard
key is used as the
columnName
, the result table will be sharded, in all other cases it
will be replicated. Sorting will properly function only if the result table
is
replicated or if there is only one processing node and should not be relied
upon
in other cases. Not available if the value of columnName
is an
unrestricted-length string.
Modifier and Type | Class and Description |
---|---|
static class |
AggregateUniqueRequest.Encoding
Specifies the encoding for returned records.
|
static class |
AggregateUniqueRequest.Options
Optional parameters.
|
Constructor and Description |
---|
AggregateUniqueRequest()
Constructs an AggregateUniqueRequest object with default parameters.
|
AggregateUniqueRequest(String tableName,
String columnName,
long offset,
long limit,
Map<String,String> options)
Constructs an AggregateUniqueRequest object with the specified
parameters.
|
AggregateUniqueRequest(String tableName,
String columnName,
long offset,
long limit,
String encoding,
Map<String,String> options)
Constructs an AggregateUniqueRequest object with the specified
parameters.
|
Modifier and Type | Method and Description |
---|---|
boolean |
equals(Object obj) |
Object |
get(int index)
This method supports the Avro framework and is not intended to be called
directly by the user.
|
static org.apache.avro.Schema |
getClassSchema()
This method supports the Avro framework and is not intended to be called
directly by the user.
|
String |
getColumnName() |
String |
getEncoding() |
long |
getLimit() |
long |
getOffset() |
Map<String,String> |
getOptions() |
org.apache.avro.Schema |
getSchema()
This method supports the Avro framework and is not intended to be called
directly by the user.
|
String |
getTableName() |
int |
hashCode() |
void |
put(int index,
Object value)
This method supports the Avro framework and is not intended to be called
directly by the user.
|
AggregateUniqueRequest |
setColumnName(String columnName) |
AggregateUniqueRequest |
setEncoding(String encoding) |
AggregateUniqueRequest |
setLimit(long limit) |
AggregateUniqueRequest |
setOffset(long offset) |
AggregateUniqueRequest |
setOptions(Map<String,String> options) |
AggregateUniqueRequest |
setTableName(String tableName) |
String |
toString() |
public AggregateUniqueRequest()
public AggregateUniqueRequest(String tableName, String columnName, long offset, long limit, Map<String,String> options)
tableName
- Name of an existing table or view on which the
operation will be performed, in
[schema_name.]table_name format, using standard name resolution rules.columnName
- Name of the column or an expression containing one or
more column names on which the unique function would
be applied.offset
- A positive integer indicating the number of initial
results to skip (this can be useful for paging through
the results). The default value is 0.The minimum allowed
value is 0. The maximum allowed value is MAX_INT.limit
- A positive integer indicating the maximum number of
results to be returned, or
END_OF_SET (-9999) to indicate that the maximum number of
results allowed by the server should be
returned. The number of records returned will never
exceed the server's own limit, defined by the
max_get_records_size parameter in the
server configuration.
Use hasMoreRecords
to see if more records exist in
the result to be fetched, and
offset
& limit
to request subsequent pages
of results. The default value is -9999.options
- Optional parameters.
CREATE_TEMP_TABLE
: If true
, a unique temporary
table name will be generated in the sys_temp schema and
used in place of result_table
. If result_table_persist
is false
(or unspecified),
then this is always allowed even if the caller does not
have permission to create tables. The generated name is
returned in qualified_result_table_name
.
Supported values:
The default value is FALSE
.
COLLECTION_NAME
: [DEPRECATED--please specify the
containing schema as part of result_table
and
use GPUdb.createSchema(CreateSchemaRequest)
to
create the schema if non-existent] Name of a schema
which is to contain the table specified in result_table
. If the schema provided is non-existent,
it will be automatically created.
EXPRESSION
: Optional filter expression to apply to the
table.
SORT_ORDER
: String indicating how the returned values
should be sorted.
Supported values:
The default value is ASCENDING
.
RESULT_TABLE
: The name of the table used to store the
results, in [schema_name.]table_name format, using
standard name resolution rules and meeting table naming criteria. If present, no
results are returned in the response. Not available if
columnName
is an unrestricted-length string.
RESULT_TABLE_PERSIST
: If true
, then the result
table specified in result_table
will be
persisted and will not expire unless a ttl
is
specified. If false
, then the result table
will be an in-memory table and will expire unless a
ttl
is specified otherwise.
Supported values:
The default value is FALSE
.
RESULT_TABLE_FORCE_REPLICATED
: Force the result table
to be replicated (ignores any sharding). Must be used in
combination with the result_table
option.
Supported values:
The default value is FALSE
.
RESULT_TABLE_GENERATE_PK
: If true
then set a
primary key for the result table. Must be used in
combination with the result_table
option.
Supported values:
The default value is FALSE
.
TTL
: Sets the TTL of the table specified in result_table
.
CHUNK_SIZE
: Indicates the number of records per chunk
to be used for the result table. Must be used in
combination with the result_table
option.
VIEW_ID
: ID of view of which the result table will be a
member. The default value is ''.
Map
.public AggregateUniqueRequest(String tableName, String columnName, long offset, long limit, String encoding, Map<String,String> options)
tableName
- Name of an existing table or view on which the
operation will be performed, in
[schema_name.]table_name format, using standard name resolution rules.columnName
- Name of the column or an expression containing one or
more column names on which the unique function would
be applied.offset
- A positive integer indicating the number of initial
results to skip (this can be useful for paging through
the results). The default value is 0.The minimum allowed
value is 0. The maximum allowed value is MAX_INT.limit
- A positive integer indicating the maximum number of
results to be returned, or
END_OF_SET (-9999) to indicate that the maximum number of
results allowed by the server should be
returned. The number of records returned will never
exceed the server's own limit, defined by the
max_get_records_size parameter in the
server configuration.
Use hasMoreRecords
to see if more records exist in
the result to be fetched, and
offset
& limit
to request subsequent pages
of results. The default value is -9999.encoding
- Specifies the encoding for returned records.
Supported values:
BINARY
: Indicates that the returned records should be
binary encoded.
JSON
: Indicates that the returned records should be
json encoded.
BINARY
.options
- Optional parameters.
CREATE_TEMP_TABLE
: If true
, a unique temporary
table name will be generated in the sys_temp schema and
used in place of result_table
. If result_table_persist
is false
(or unspecified),
then this is always allowed even if the caller does not
have permission to create tables. The generated name is
returned in qualified_result_table_name
.
Supported values:
The default value is FALSE
.
COLLECTION_NAME
: [DEPRECATED--please specify the
containing schema as part of result_table
and
use GPUdb.createSchema(CreateSchemaRequest)
to
create the schema if non-existent] Name of a schema
which is to contain the table specified in result_table
. If the schema provided is non-existent,
it will be automatically created.
EXPRESSION
: Optional filter expression to apply to the
table.
SORT_ORDER
: String indicating how the returned values
should be sorted.
Supported values:
The default value is ASCENDING
.
RESULT_TABLE
: The name of the table used to store the
results, in [schema_name.]table_name format, using
standard name resolution rules and meeting table naming criteria. If present, no
results are returned in the response. Not available if
columnName
is an unrestricted-length string.
RESULT_TABLE_PERSIST
: If true
, then the result
table specified in result_table
will be
persisted and will not expire unless a ttl
is
specified. If false
, then the result table
will be an in-memory table and will expire unless a
ttl
is specified otherwise.
Supported values:
The default value is FALSE
.
RESULT_TABLE_FORCE_REPLICATED
: Force the result table
to be replicated (ignores any sharding). Must be used in
combination with the result_table
option.
Supported values:
The default value is FALSE
.
RESULT_TABLE_GENERATE_PK
: If true
then set a
primary key for the result table. Must be used in
combination with the result_table
option.
Supported values:
The default value is FALSE
.
TTL
: Sets the TTL of the table specified in result_table
.
CHUNK_SIZE
: Indicates the number of records per chunk
to be used for the result table. Must be used in
combination with the result_table
option.
VIEW_ID
: ID of view of which the result table will be a
member. The default value is ''.
Map
.public static org.apache.avro.Schema getClassSchema()
public String getTableName()
public AggregateUniqueRequest setTableName(String tableName)
tableName
- Name of an existing table or view on which the
operation will be performed, in
[schema_name.]table_name format, using standard name resolution rules.this
to mimic the builder pattern.public String getColumnName()
public AggregateUniqueRequest setColumnName(String columnName)
columnName
- Name of the column or an expression containing one or
more column names on which the unique function would
be applied.this
to mimic the builder pattern.public long getOffset()
public AggregateUniqueRequest setOffset(long offset)
offset
- A positive integer indicating the number of initial
results to skip (this can be useful for paging through
the results). The default value is 0.The minimum allowed
value is 0. The maximum allowed value is MAX_INT.this
to mimic the builder pattern.public long getLimit()
hasMoreRecords
to see if more records exist in the
result to be fetched, and
offset
& limit
to request subsequent pages of
results. The default value is -9999.public AggregateUniqueRequest setLimit(long limit)
limit
- A positive integer indicating the maximum number of
results to be returned, or
END_OF_SET (-9999) to indicate that the maximum number of
results allowed by the server should be
returned. The number of records returned will never
exceed the server's own limit, defined by the
max_get_records_size parameter in the
server configuration.
Use hasMoreRecords
to see if more records exist in
the result to be fetched, and
offset
& limit
to request subsequent pages
of results. The default value is -9999.this
to mimic the builder pattern.public String getEncoding()
public AggregateUniqueRequest setEncoding(String encoding)
public Map<String,String> getOptions()
CREATE_TEMP_TABLE
: If true
, a unique temporary table
name will be generated in the sys_temp schema and used in place
of result_table
. If result_table_persist
is
false
(or unspecified), then this is always allowed even
if the caller does not have permission to create tables. The
generated name is returned in qualified_result_table_name
.
Supported values:
The default value is FALSE
.
COLLECTION_NAME
: [DEPRECATED--please specify the containing
schema as part of result_table
and use GPUdb.createSchema(CreateSchemaRequest)
to create the
schema if non-existent] Name of a schema which is to contain
the table specified in result_table
. If the schema
provided is non-existent, it will be automatically created.
EXPRESSION
: Optional filter expression to apply to the table.
SORT_ORDER
: String indicating how the returned values should be
sorted.
Supported values:
The default value is ASCENDING
.
RESULT_TABLE
: The name of the table used to store the results,
in [schema_name.]table_name format, using standard name resolution rules and meeting table naming criteria. If present, no results
are returned in the response. Not available if columnName
is an unrestricted-length string.
RESULT_TABLE_PERSIST
: If true
, then the result table
specified in result_table
will be persisted and will not
expire unless a ttl
is specified. If false
,
then the result table will be an in-memory table and will expire
unless a ttl
is specified otherwise.
Supported values:
The default value is FALSE
.
RESULT_TABLE_FORCE_REPLICATED
: Force the result table to be
replicated (ignores any sharding). Must be used in combination
with the result_table
option.
Supported values:
The default value is FALSE
.
RESULT_TABLE_GENERATE_PK
: If true
then set a primary
key for the result table. Must be used in combination with the
result_table
option.
Supported values:
The default value is FALSE
.
TTL
: Sets
the TTL of the table specified in result_table
.
CHUNK_SIZE
: Indicates the number of records per chunk to be
used for the result table. Must be used in combination with the
result_table
option.
VIEW_ID
: ID of view of which the result table will be a member.
The default value is ''.
Map
.public AggregateUniqueRequest setOptions(Map<String,String> options)
options
- Optional parameters.
CREATE_TEMP_TABLE
: If true
, a unique temporary
table name will be generated in the sys_temp schema and
used in place of result_table
. If result_table_persist
is false
(or unspecified),
then this is always allowed even if the caller does not
have permission to create tables. The generated name is
returned in qualified_result_table_name
.
Supported values:
The default value is FALSE
.
COLLECTION_NAME
: [DEPRECATED--please specify the
containing schema as part of result_table
and
use GPUdb.createSchema(CreateSchemaRequest)
to
create the schema if non-existent] Name of a schema
which is to contain the table specified in result_table
. If the schema provided is non-existent,
it will be automatically created.
EXPRESSION
: Optional filter expression to apply to the
table.
SORT_ORDER
: String indicating how the returned values
should be sorted.
Supported values:
The default value is ASCENDING
.
RESULT_TABLE
: The name of the table used to store the
results, in [schema_name.]table_name format, using
standard name resolution rules and meeting table naming criteria. If present, no
results are returned in the response. Not available if
columnName
is an unrestricted-length string.
RESULT_TABLE_PERSIST
: If true
, then the result
table specified in result_table
will be
persisted and will not expire unless a ttl
is
specified. If false
, then the result table
will be an in-memory table and will expire unless a
ttl
is specified otherwise.
Supported values:
The default value is FALSE
.
RESULT_TABLE_FORCE_REPLICATED
: Force the result table
to be replicated (ignores any sharding). Must be used in
combination with the result_table
option.
Supported values:
The default value is FALSE
.
RESULT_TABLE_GENERATE_PK
: If true
then set a
primary key for the result table. Must be used in
combination with the result_table
option.
Supported values:
The default value is FALSE
.
TTL
: Sets the TTL of the table specified in result_table
.
CHUNK_SIZE
: Indicates the number of records per chunk
to be used for the result table. Must be used in
combination with the result_table
option.
VIEW_ID
: ID of view of which the result table will be a
member. The default value is ''.
Map
.this
to mimic the builder pattern.public org.apache.avro.Schema getSchema()
getSchema
in interface org.apache.avro.generic.GenericContainer
public Object get(int index)
get
in interface org.apache.avro.generic.IndexedRecord
index
- the position of the field to getIndexOutOfBoundsException
public void put(int index, Object value)
put
in interface org.apache.avro.generic.IndexedRecord
index
- the position of the field to setvalue
- the value to setIndexOutOfBoundsException
Copyright © 2024. All rights reserved.