GPUdb API object that provides access to GPUdb server functions.
Constructor
new GPUdb(url, optionsopt)
Creates a GPUdb API object for the specified URL using the given options.
Once created, all options are immutable; to use a different URL or change
options, create a new instance. (Creating a new instance does not
communicate with the server and should not cause performance concerns.)
Parameters:
Name
Type
Attributes
Description
url
String
|
Array.<String>
The URL of the GPUdb server (e.g.,
http://hostname:9191). May also be specified as
a list of urls; all urls in the list must be well formed.
options
Object
<optional>
A set of configurable options for the GPUdb API.
Properties
Name
Type
Attributes
Description
username
String
<optional>
The username to be used for authentication
to GPUdb. This username will be sent with every GPUdb request
made via the API along with the specified password and may be
used for authorization decisions by the server if it is so
configured. If neither username nor password is specified, no
authentication will be performed.
password
String
<optional>
The password to be used for authentication
to GPUdb. This password will be sent with every GPUdb request
made via the API along with the specified username and may be
used for authorization decisions by the server if it is so
configured. If neither username nor password is specified, no
authentication will be performed.
timeout
Number
<optional>
The timeout value, in milliseconds, after
which requests to GPUdb will be aborted. A timeout value of
zero is interpreted as an infinite timeout. Note that timeout
is not suppored for synchronous requests, which will not
return until a response is received and cannot be aborted.
The timeout value, in milliseconds, after which requests to GPUdb
will be aborted. A timeout of zero is interpreted as an infinite
timeout. Will be zero if none was provided to the GPUdb constructor.
Decodes a JSON string, or array of JSON strings, returned from GPUdb into
JSON object(s). Special treatment for quoted "Infinity", "-Infinity",
and "NaN". Catches those and converts to null. This is significantly
slower than the regular decode function.
Perform the requested action on a list of one or more job(s). Based on the
type of job and the current state of execution, the action may not be
successfully executed. The final result of the attempted actions for each
specified job is returned in the status array of the response. See Job Manager for
more information.
Parameters:
Name
Type
Description
job_ids
Array.<Number>
Jobs to be modified.
action
String
Action to be performed on the jobs specified by
job_ids.
Supported values:
Perform the requested action on a list of one or more job(s). Based on the
type of job and the current state of execution, the action may not be
successfully executed. The final result of the attempted actions for each
specified job is returned in the status array of the response. See Job Manager for
more information.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Retrieves a list of the most recent alerts generated. The number of alerts
to retrieve is specified in this request.
Returns lists of alert data, earliest to latest
Parameters:
Name
Type
Description
num_alerts
Number
Number of most recent alerts to request. The
response will return num_alerts
alerts, or less if there are less in the system.
A value of 0 returns all stored alerts.
Retrieves a list of the most recent alerts generated. The number of alerts
to retrieve is specified in this request.
Returns lists of alert data, earliest to latest
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Response object containing the method_codes of the
operation.
Type
Object
admin_show_shards(options, callback) → {Object}
Show the mapping of shards to the corresponding rank and tom. The response
message contains list of 16384 (total number of shards in the system) Rank
and TOM numbers corresponding to each shard.
Show the mapping of shards to the corresponding rank and tom. The response
message contains list of 16384 (total number of shards in the system) Rank
and TOM numbers corresponding to each shard.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Response object containing the method_codes of the
operation.
Type
Object
admin_verify_db(options, callback) → {Object}
Verify database is in a consistent state. When inconsistencies or errors
are found, the verified_ok flag in the response is set to false and the list
of errors found is provided in the error_list.
Verify database is in a consistent state. When inconsistencies or errors
are found, the verified_ok flag in the response is set to false and the list
of errors found is provided in the error_list.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates unique combinations (groups) of values for the given columns in a
given table/view/collection and computes aggregates on each unique
combination. This is somewhat analogous to an SQL-style SELECT...GROUP BY.
Any column(s) can be grouped on, and all column types except
unrestricted-length strings may be used for computing applicable aggregates;
columns marked as store-only are unable to be used in grouping or
aggregation.
The results can be paged via the offset and limit
parameters. For example, to get 10 groups with the largest counts the inputs
would be: limit=10, options={"sort_order":"descending", "sort_by":"value"}.
options can be used to customize behavior of this call e.g.
filtering or sorting the results.
To group by columns 'x' and 'y' and compute the number of objects within
each group, use: column_names=['x','y','count(*)'].
To also compute the sum of 'z' over each group, use:
column_names=['x','y','count(*)','sum(z)'].
Available aggregation functions are: count(*), sum, min, max, avg,
mean, stddev, stddev_pop, stddev_samp, var, var_pop, var_samp, arg_min,
arg_max and count_distinct.
If a result_table name is specified in the
options, the results are stored in a new table with that
name--no results are returned in the response. Both the table name and
resulting column names must adhere to standard naming
conventions; column/aggregation expressions will need to be aliased. If
the source table's shard key is used as the grouping column(s) and all result
records are selected (offset is 0 and limit is
-9999), the result table will be sharded, in all other cases it will be
replicated. Sorting will properly function only if the result table is
replicated or if there is only one processing node and should not be relied
upon in other cases. Not available when any of the values of
column_names is an unrestricted-length string.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the operation will be
performed. Must be an existing
table/view/collection.
column_names
Array.<String>
List of one or more column names,
expressions, and aggregate expressions.
offset
Number
A positive integer indicating the number of initial
results to skip (this can be useful for paging
through the results).
limit
Number
A positive integer indicating the maximum number of
results to be returned Or END_OF_SET (-9999) to
indicate that the max number of results should be
returned.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the table specified
in result_table. If the collection
provided is non-existent, the collection will be
automatically created. If empty, then the table
will be a top-level table. Additionally this
option is invalid if table_name is a
collection.
'expression': Filter expression to
apply to the table prior to computing the aggregate
group by.
'having': Filter expression to apply
to the aggregated results.
'sort_order': String indicating how
the returned values should be sorted - ascending or
descending.
Supported values:
'ascending': Indicates that the
returned values should be sorted in ascending
order.
'descending': Indicates that the
returned values should be sorted in descending
order.
The default value is 'ascending'.
'sort_by': String determining how the
results are sorted.
Supported values:
'key': Indicates that the returned
values should be sorted by key, which corresponds
to the grouping columns. If you have multiple
grouping columns (and are sorting by key), it will
first sort the first grouping column, then the
second grouping column, etc.
'value': Indicates that the returned
values should be sorted by value, which corresponds
to the aggregates. If you have multiple aggregates
(and are sorting by value), it will first sort by
the first aggregate, then the second aggregate,
etc.
The default value is 'value'.
'result_table': The name of the table
used to store the results. Has the same naming
restrictions as tables. Column names (group-by
and aggregate fields) need to be given aliases e.g.
["FChar256 as fchar256", "sum(FDouble) as sfd"].
If present, no results are returned in the
response. This option is not available if one of
the grouping attributes is an unrestricted string
(i.e.; not charN) type.
'result_table_persist': If
true, then the result table specified
in result_table will be persisted and
will not expire unless a ttl is
specified. If false, then the result
table will be an in-memory table and will expire
unless a ttl is specified otherwise.
Supported values:
'true'
'false'
The default value is 'false'.
'result_table_force_replicated': Force
the result table to be replicated (ignores any
sharding). Must be used in combination with the
result_table option.
Supported values:
'true'
'false'
The default value is 'false'.
'result_table_generate_pk': If 'true'
then set a primary key for the result table. Must
be used in combination with the
result_table option.
Supported values:
'true'
'false'
The default value is 'false'.
'ttl': Sets the TTL of the table specified in
result_table.
'chunk_size': Indicates the chunk size
to be used for the result table. Must be used in
combination with the result_table
option.
'create_indexes': Comma-separated list
of columns on which to create indexes on the result
table. Must be used in combination with the
result_table option.
'view_id': view this result table is
part of. The default value is ''.
'materialize_on_gpu': If
true then the columns of the groupby
result table will be cached on the GPU. Must be
used in combination with the
result_table option.
Supported values:
'true'
'false'
The default value is 'false'.
'pivot': pivot column
'pivot_values': The value list
provided will become the column headers in the
output. Should be the values from the pivot_column.
'grouping_sets': Customize the
grouping attribute sets to compute the aggregates.
These sets can include ROLLUP or CUBE operartors.
The attribute sets should be enclosed in
paranthesis and can include composite attributes.
All attributes specified in the grouping sets must
present in the groupby attributes.
'rollup': This option is used to
specify the multilevel aggregates.
'cube': This option is used to specify
the multidimensional aggregates.
Calculates unique combinations (groups) of values for the given columns in a
given table/view/collection and computes aggregates on each unique
combination. This is somewhat analogous to an SQL-style SELECT...GROUP BY.
Any column(s) can be grouped on, and all column types except
unrestricted-length strings may be used for computing applicable aggregates;
columns marked as store-only are unable to be used in grouping or
aggregation.
The results can be paged via the offset and limit
parameters. For example, to get 10 groups with the largest counts the inputs
would be: limit=10, options={"sort_order":"descending", "sort_by":"value"}.
options can be used to customize behavior of this call e.g.
filtering or sorting the results.
To group by columns 'x' and 'y' and compute the number of objects within
each group, use: column_names=['x','y','count(*)'].
To also compute the sum of 'z' over each group, use:
column_names=['x','y','count(*)','sum(z)'].
Available aggregation functions are: count(*), sum, min, max, avg,
mean, stddev, stddev_pop, stddev_samp, var, var_pop, var_samp, arg_min,
arg_max and count_distinct.
If a result_table name is specified in the
options, the results are stored in a new table with that
name--no results are returned in the response. Both the table name and
resulting column names must adhere to standard naming
conventions; column/aggregation expressions will need to be aliased. If
the source table's shard key is used as the grouping column(s) and all result
records are selected (offset is 0 and limit is
-9999), the result table will be sharded, in all other cases it will be
replicated. Sorting will properly function only if the result table is
replicated or if there is only one processing node and should not be relied
upon in other cases. Not available when any of the values of
column_names is an unrestricted-length string.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Performs a histogram calculation given a table, a column, and an interval
function. The interval is used to produce bins of that size and
the result, computed over the records falling within each bin, is returned.
For each bin, the start value is inclusive, but the end value is
exclusive--except for the very last bin for which the end value is also
inclusive. The value returned for each bin is the number of records in it,
except when a column name is provided as a value_column. In
this latter case the sum of the values corresponding to the
value_column is used as the result instead. The total number
of bins requested cannot exceed 10,000.
NOTE: The Kinetica instance being accessed must be running a CUDA
(GPU-based) build to service a request that specifies a
value_column option.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the operation will be
performed. Must be an existing table or
collection.
column_name
String
Name of a column or an expression of one or
more column names over which the histogram will
be calculated.
start
Number
Lower end value of the histogram interval, inclusive.
end
Number
Upper end value of the histogram interval, inclusive.
interval
Number
The size of each bin within the start and end
parameters.
options
Object
Optional parameters.
'value_column': The name of the column
to use when calculating the bin values (values are
summed). The column must be a numerical type (int,
double, long, float).
Performs a histogram calculation given a table, a column, and an interval
function. The interval is used to produce bins of that size and
the result, computed over the records falling within each bin, is returned.
For each bin, the start value is inclusive, but the end value is
exclusive--except for the very last bin for which the end value is also
inclusive. The value returned for each bin is the number of records in it,
except when a column name is provided as a value_column. In
this latter case the sum of the values corresponding to the
value_column is used as the result instead. The total number
of bins requested cannot exceed 10,000.
NOTE: The Kinetica instance being accessed must be running a CUDA
(GPU-based) build to service a request that specifies a
value_column option.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
This endpoint runs the k-means algorithm - a heuristic algorithm that
attempts to do k-means clustering. An ideal k-means clustering algorithm
selects k points such that the sum of the mean squared distances of each
member of the set to the nearest of the k points is minimized. The k-means
algorithm however does not necessarily produce such an ideal cluster. It
begins with a randomly selected set of k points and then refines the
location of the points iteratively and settles to a local minimum. Various
parameters and options are provided to control the heuristic search.
NOTE: The Kinetica instance being accessed must be running a CUDA
(GPU-based) build to service this request.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the operation will be
performed. Must be an existing table or
collection.
column_names
Array.<String>
List of column names on which the operation
would be performed. If n columns are
provided then each of the k result points
will have n dimensions corresponding to the
n columns.
k
Number
The number of mean points to be determined by the
algorithm.
tolerance
Number
Stop iterating when the distances between
successive points is less than the given
tolerance.
options
Object
Optional parameters.
'whiten': When set to 1 each of the
columns is first normalized by its stdv - default
is not to whiten.
'max_iters': Number of times to try to
hit the tolerance limit before giving up - default
is 10.
'num_tries': Number of times to run
the k-means algorithm with a different randomly
selected starting points - helps avoid local
minimum. Default is 1.
This endpoint runs the k-means algorithm - a heuristic algorithm that
attempts to do k-means clustering. An ideal k-means clustering algorithm
selects k points such that the sum of the mean squared distances of each
member of the set to the nearest of the k points is minimized. The k-means
algorithm however does not necessarily produce such an ideal cluster. It
begins with a randomly selected set of k points and then refines the
location of the points iteratively and settles to a local minimum. Various
parameters and options are provided to control the heuristic search.
NOTE: The Kinetica instance being accessed must be running a CUDA
(GPU-based) build to service this request.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates the requested statistics of the given column(s) in a given table.
The available statistics are count (number of total objects),
mean, stdv (standard deviation),
variance, skew, kurtosis,
sum, min, max,
weighted_average, cardinality (unique count),
estimated_cardinality, percentile and
percentile_rank.
Estimated cardinality is calculated by using the hyperloglog approximation
technique.
Percentiles and percentile ranks are approximate and are calculated using
the t-digest algorithm. They must include the desired
percentile/percentile_rank. To compute multiple
percentiles each value must be specified separately (i.e.
'percentile(75.0),percentile(99.0),percentile_rank(1234.56),percentile_rank(-5)').
A second, comma-separated value can be added to the percentile
statistic to calculate percentile resolution, e.g., a 50th percentile with
200 resolution would be 'percentile(50,200)'.
The weighted average statistic requires a weight_column_name to
be specified in options. The weighted average is then defined
as the sum of the products of column_name times the
weight_column_name values divided by the sum of the
weight_column_name values.
Additional columns can be used in the calculation of statistics via the
additional_column_names option. Values in these columns will
be included in the overall aggregate calculation--individual aggregates will
not be calculated per additional column. For instance, requesting the
count & mean of column_name x and
additional_column_names y & z, where x holds the numbers 1-10,
y holds 11-20, and z holds 21-30, would return the total number of x, y, & z
values (30), and the single average value across all x, y, & z values
(15.5).
The response includes a list of key/value pairs of each statistic requested
and its corresponding value.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the statistics
operation will be performed.
column_name
String
Name of the primary column for which the
statistics are to be calculated.
stats
String
Comma separated list of the statistics to calculate,
e.g. "sum,mean".
Supported values:
'count': Number of objects (independent
of the given column(s)).
'mean': Arithmetic mean (average),
equivalent to sum/count.
'stdv': Sample standard deviation
(denominator is count-1).
'variance': Unbiased sample variance
(denominator is count-1).
'weighted_average': Weighted arithmetic
mean (using the option
weight_column_name as the weighting
column).
'cardinality': Number of unique values
in the column(s).
'estimated_cardinality': Estimate (via
hyperloglog technique) of the number of unique values
in the column(s).
'percentile': Estimate (via t-digest) of
the given percentile of the column(s)
(percentile(50.0) will be an approximation of the
median). Add a second, comma-separated value to
calculate percentile resolution, e.g.,
'percentile(75,150)'
'percentile_rank': Estimate (via
t-digest) of the percentile rank of the given value
in the column(s) (if the given value is the median of
the column(s), percentile_rank() will return
approximately 50.0).
options
Object
Optional parameters.
'additional_column_names': A list of
comma separated column names over which statistics
can be accumulated along with the primary column.
All columns listed and column_name
must be of the same type. Must not include the
column specified in column_name and no
column can be listed twice.
'weight_column_name': Name of column
used as weighting attribute for the weighted
average statistic.
Divides the given set into bins and calculates statistics of the values of a
value-column in each bin. The bins are based on the values of a given
binning-column. The statistics that may be requested are mean, stdv
(standard deviation), variance, skew, kurtosis, sum, min, max, first, last
and weighted average. In addition to the requested statistics the count of
total samples in each bin is returned. This counts vector is just the
histogram of the column used to divide the set members into bins. The
weighted average statistic requires a weight_column to be specified in
options. The weighted average is then defined as the sum of the
products of the value column times the weight column divided by the sum of
the weight column.
There are two methods for binning the set members. In the first, which can
be used for numeric valued binning-columns, a min, max and interval are
specified. The number of bins, nbins, is the integer upper bound of
(max-min)/interval. Values that fall in the range
[min+n*interval,min+(n+1)*interval) are placed in the nth bin where n ranges
from 0..nbin-2. The final bin is [min+(nbin-1)*interval,max]. In the second
method, options bin_values specifies a list of binning column
values. Binning-columns whose value matches the nth member of the bin_values
list are placed in the nth bin. When a list is provided the binning-column
must be of type string or int.
NOTE: The Kinetica instance being accessed must be running a CUDA
(GPU-based) build to service this request.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the ranged-statistics
operation will be performed.
select_expression
String
For a non-empty expression statistics are
calculated for those records for which
the expression is true.
column_name
String
Name of the binning-column used to divide the
set samples into bins.
value_column_name
String
Name of the value-column for which
statistics are to be computed.
stats
String
A string of comma separated list of the statistics to
calculate, e.g. 'sum,mean'. Available statistics:
mean, stdv (standard deviation), variance, skew,
kurtosis, sum.
start
Number
The lower bound of the binning-column.
end
Number
The upper bound of the binning-column.
interval
Number
The interval of a bin. Set members fall into bin i
if the binning-column falls in the range
[start+interval*i, start+interval*(i+1)).
options
Object
Map of optional parameters:
'additional_column_names': A list of
comma separated value-column names over which
statistics can be accumulated along with the
primary value_column.
'bin_values': A list of comma
separated binning-column values. Values that match
the nth bin_values value are placed in the nth bin.
'weight_column_name': Name of the
column used as weighting column for the
weighted_average statistic.
'order_column_name': Name of the
column used for candlestick charting techniques.
Divides the given set into bins and calculates statistics of the values of a
value-column in each bin. The bins are based on the values of a given
binning-column. The statistics that may be requested are mean, stdv
(standard deviation), variance, skew, kurtosis, sum, min, max, first, last
and weighted average. In addition to the requested statistics the count of
total samples in each bin is returned. This counts vector is just the
histogram of the column used to divide the set members into bins. The
weighted average statistic requires a weight_column to be specified in
options. The weighted average is then defined as the sum of the
products of the value column times the weight column divided by the sum of
the weight column.
There are two methods for binning the set members. In the first, which can
be used for numeric valued binning-columns, a min, max and interval are
specified. The number of bins, nbins, is the integer upper bound of
(max-min)/interval. Values that fall in the range
[min+n*interval,min+(n+1)*interval) are placed in the nth bin where n ranges
from 0..nbin-2. The final bin is [min+(nbin-1)*interval,max]. In the second
method, options bin_values specifies a list of binning column
values. Binning-columns whose value matches the nth member of the bin_values
list are placed in the nth bin. When a list is provided the binning-column
must be of type string or int.
NOTE: The Kinetica instance being accessed must be running a CUDA
(GPU-based) build to service this request.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates the requested statistics of the given column(s) in a given table.
The available statistics are count (number of total objects),
mean, stdv (standard deviation),
variance, skew, kurtosis,
sum, min, max,
weighted_average, cardinality (unique count),
estimated_cardinality, percentile and
percentile_rank.
Estimated cardinality is calculated by using the hyperloglog approximation
technique.
Percentiles and percentile ranks are approximate and are calculated using
the t-digest algorithm. They must include the desired
percentile/percentile_rank. To compute multiple
percentiles each value must be specified separately (i.e.
'percentile(75.0),percentile(99.0),percentile_rank(1234.56),percentile_rank(-5)').
A second, comma-separated value can be added to the percentile
statistic to calculate percentile resolution, e.g., a 50th percentile with
200 resolution would be 'percentile(50,200)'.
The weighted average statistic requires a weight_column_name to
be specified in options. The weighted average is then defined
as the sum of the products of column_name times the
weight_column_name values divided by the sum of the
weight_column_name values.
Additional columns can be used in the calculation of statistics via the
additional_column_names option. Values in these columns will
be included in the overall aggregate calculation--individual aggregates will
not be calculated per additional column. For instance, requesting the
count & mean of column_name x and
additional_column_names y & z, where x holds the numbers 1-10,
y holds 11-20, and z holds 21-30, would return the total number of x, y, & z
values (30), and the single average value across all x, y, & z values
(15.5).
The response includes a list of key/value pairs of each statistic requested
and its corresponding value.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Returns all the unique values from a particular column (specified by
column_name) of a particular table or collection (specified by
table_name). If column_name is a numeric column
the values will be in binary_encoded_response. Otherwise if
column_name is a string column the values will be in
json_encoded_response. The results can be paged via the
offset and limit parameters.
Columns marked as store-only are unable to be used with this function.
To get the first 10 unique values sorted in descending order
options would be::
If a result_table name is specified in the
options, the results are stored in a new table with that
name--no results are returned in the response. Both the table name and
resulting column name must adhere to standard naming
conventions; any column expression will need to be aliased. If the
source table's shard key is used as the column_name, the
result table will be sharded, in all other cases it will be replicated.
Sorting will properly function only if the result table is replicated or if
there is only one processing node and should not be relied upon in other
cases. Not available if table_name is a collection or when the
value of column_name is an unrestricted-length string.
Parameters:
Name
Type
Description
table_name
String
Name of an existing table/collection on which
the operation will be performed.
column_name
String
Name of the column or an expression containing
one or more column names on which the unique
function would be applied.
offset
Number
A positive integer indicating the number of initial
results to skip (this can be useful for paging
through the results).
limit
Number
A positive integer indicating the maximum number of
results to be returned. Or END_OF_SET (-9999) to
indicate that the max number of results should be
returned.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the table specified
in result_table. If the collection
provided is non-existent, the collection will be
automatically created. If empty, then the table
will be a top-level table. Additionally this
option is invalid if table_name is a
collection.
'expression': Optional filter
expression to apply to the table.
'sort_order': String indicating how
the returned values should be sorted.
Supported values:
'ascending'
'descending'
The default value is 'ascending'.
'result_table': The name of the table
used to store the results. If present, no results
are returned in the response. Has the same naming
restrictions as tables. Not available if
table_name is a collection or when
column_name is an unrestricted-length
string.
'result_table_persist': If
true, then the result table specified
in result_table will be persisted and
will not expire unless a ttl is
specified. If false, then the result
table will be an in-memory table and will expire
unless a ttl is specified otherwise.
Supported values:
'true'
'false'
The default value is 'false'.
'result_table_force_replicated': Force
the result table to be replicated (ignores any
sharding). Must be used in combination with the
result_table option.
Supported values:
'true'
'false'
The default value is 'false'.
'result_table_generate_pk': If 'true'
then set a primary key for the result table. Must
be used in combination with the
result_table option.
Supported values:
'true'
'false'
The default value is 'false'.
'ttl': Sets the TTL of the table specified in
result_table.
'chunk_size': Indicates the chunk size
to be used for the result table. Must be used in
combination with the result_table
option.
'view_id': view this result table is
part of. The default value is ''.
Returns all the unique values from a particular column (specified by
column_name) of a particular table or collection (specified by
table_name). If column_name is a numeric column
the values will be in binary_encoded_response. Otherwise if
column_name is a string column the values will be in
json_encoded_response. The results can be paged via the
offset and limit parameters.
Columns marked as store-only are unable to be used with this function.
To get the first 10 unique values sorted in descending order
options would be::
If a result_table name is specified in the
options, the results are stored in a new table with that
name--no results are returned in the response. Both the table name and
resulting column name must adhere to standard naming
conventions; any column expression will need to be aliased. If the
source table's shard key is used as the column_name, the
result table will be sharded, in all other cases it will be replicated.
Sorting will properly function only if the result table is replicated or if
there is only one processing node and should not be relied upon in other
cases. Not available if table_name is a collection or when the
value of column_name is an unrestricted-length string.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Unpivot is used to normalize tables that are built for cross tabular
reporting purposes. The unpivot operator rotates the column values for all
the pivoted columns. A variable column, value column and all columns from
the source table except the unpivot columns are projected into the result
table. The variable column and value columns in the result table indicate
the pivoted column name and values respectively.
Name of the table on which the operation will be
performed. Must be an existing table/view.
column_names
Array.<String>
List of column names or expressions. A
wildcard '*' can be used to include all the
non-pivoted columns from the source table.
variable_column_name
String
Specifies the variable/parameter
column name.
value_column_name
String
Specifies the value column name.
pivoted_columns
Array.<String>
List of one or more values typically the
column names of the input table. All the
columns in the source table must have the
same data type.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the table specified
in result_table. If the collection
provided is non-existent, the collection will be
automatically created. If empty, then the table
will be a top-level table.
'result_table': The name of the table
used to store the results. Has the same naming
restrictions as tables. If present, no results
are returned in the response.
'result_table_persist': If
true, then the result table specified
in result_table will be persisted and
will not expire unless a ttl is
specified. If false, then the result
table will be an in-memory table and will expire
unless a ttl is specified otherwise.
Supported values:
'true'
'false'
The default value is 'false'.
'expression': Filter expression to
apply to the table prior to unpivot processing.
'order_by': Comma-separated list of
the columns to be sorted by; e.g. 'timestamp asc, x
desc'. The columns specified must be present in
input table. If any alias is given for any column
name, the alias must be used, rather than the
original column name. The default value is ''.
'chunk_size': Indicates the chunk size
to be used for the result table. Must be used in
combination with the result_table
option.
'limit': The number of records to
keep. The default value is ''.
'ttl': Sets the TTL of the table specified in
result_table.
'view_id': view this result table is
part of. The default value is ''.
'materialize_on_gpu': If
true then the output columns will be
cached on the GPU.
Supported values:
'true'
'false'
The default value is 'false'.
'create_indexes': Comma-separated list
of columns on which to create indexes on the table
specified in result_table. The columns
specified must be present in output column names.
If any alias is given for any column name, the
alias must be used, rather than the original column
name.
'result_table_force_replicated': Force
the result table to be replicated (ignores any
sharding). Must be used in combination with the
result_table option.
Supported values:
Unpivot is used to normalize tables that are built for cross tabular
reporting purposes. The unpivot operator rotates the column values for all
the pivoted columns. A variable column, value column and all columns from
the source table except the unpivot columns are projected into the result
table. The variable column and value columns in the result table indicate
the pivoted column name and values respectively.
The GPUdb#alter_system_properties endpoint is primarily used
to simplify the testing of the system and is not expected to be used during
normal execution. Commands are given through the
property_updates_map whose keys are commands and values are
strings representing integer values (for example '8000') or boolean values
('true' or 'false').
Parameters:
Name
Type
Description
property_updates_map
Object
Map containing the properties of the
system to be updated. Error if empty.
'sm_omp_threads': Set the
number of OpenMP threads that will be
used to service filter & aggregation
requests against collections to the
specified integer value.
'kernel_omp_threads': Set
the number of kernel OpenMP threads to
the specified integer value.
'concurrent_kernel_execution': Enables
concurrent kernel execution if the
value is true and
disables it if the value is
false.
Supported values:
'true'
'false'
'chunk_size': Sets the
chunk size of all new sets to the
specified integer value.
'execution_mode': Sets
the execution_mode for kernel
executions to the specified string
value. Possible values are host,
device, default (engine decides) or an
integer value that indicates max chunk
size to exec on host
'flush_to_disk': Flushes
any changes to any tables to the
persistent store. These changes
include updates to the vector store,
object store, and text search store,
Value string is ignored
'clear_cache': Clears
cached results. Useful to allow
repeated timing of endpoints. Value
string is ignored
'communicator_test':
Invoke the communicator test and
report timing results. Value string is
is a comma separated list of
= expressions.
Expressions are:
num_transactions= where num is
the number of request reply
transactions to invoke per test;
message_size= where bytes is
the size of the messages to send in
bytes; check_values= where if
enabled is true the value of the
messages received are verified.
'set_message_timers_enabled': Enables
the communicator test to collect
additional timing statistics when the
value string is true.
Disables the collection when the value
string is false
Supported values:
'true'
'false'
'bulk_add_test': Invoke
the bulk add test and report timing
results. Value string is ignored.
'network_speed': Invoke
the network speed test and report
timing results. Value string is a
semicolon-separated list of
= expressions. Valid
expressions are: seconds=
The GPUdb#alter_system_properties endpoint is primarily used
to simplify the testing of the system and is not expected to be used during
normal execution. Commands are given through the
property_updates_map whose keys are commands and values are
strings representing integer values (for example '8000') or boolean values
('true' or 'false').
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Apply various modifications to a table, view, or collection. The available
modifications include the following:
Create or delete an index on a
particular column. This can speed up certain operations when using
expressions
containing equality or relational operators on indexed columns. This only
applies to tables.
Set the time-to-live
(TTL). This can be applied
to tables, views, or collections. When applied to collections, every
contained
table & view that is not protected will have its TTL set to the given value.
Set the global access mode (i.e. locking) for a table. This setting trumps
any
role-based access controls that may be in place; e.g., a user with write
access
to a table marked read-only will not be able to insert records into it. The
mode
can be set to read-only, write-only, read/write, and no access.
Change the protection mode to prevent or
allow automatic expiration. This can be applied to tables, views, and
collections.
Allow homogeneous tables within a collection.
Manage a table's columns--a column can be added, removed, or have its
type and properties
modified.
Table on which the operation will be performed.
Must be an existing table, view, or collection.
action
String
Modification operation to be applied
Supported values:
'allow_homogeneous_tables': Sets
whether homogeneous tables are allowed in the given
collection. This action is only valid if
table_name is a collection. The
value must be either 'true' or 'false'.
'create_index': Creates an index on the column name specified
in value. If this column is already
indexed, an error will be returned.
'delete_index': Deletes an existing index on the column name specified
in value. If this column does not have
indexing turned on, an error will be returned.
'move_to_collection': Moves a table
into a collection value.
'protected': Sets whether the given
table_name should be protected or not. The
value must be either 'true' or 'false'.
'rename_table': Renames a table, view
or collection to value. Has the same
naming restrictions as tables.
'ttl': Sets the time-to-live in minutes of the
table, view, or collection specified in
table_name.
'memory_ttl': Sets the time-to-live in
minutes for the individual chunks of the columns of
the table, view, or collection specified in
table_name to free their memory if
unused longer than the given time. Specify an empty
string to restore the global memory_ttl setting and
a value of '-1' for an infinite timeout.
'add_column': Adds the column specified
in value to the table specified in
table_name. Use
column_type and
column_properties in
options to set the column's type and
properties, respectively.
'change_column': Changes type and
properties of the column specified in
value. Use column_type
and column_properties in
options to set the column's type and
properties, respectively. Note that primary key
and/or shard key columns cannot be changed. All
unchanging column properties must be listed for the
change to take place, e.g., to add dictionary
encoding to an existing 'char4' column, both 'char4'
and 'dict' must be specified in the
options map.
'set_column_compression': Modifies the
compression setting on the column
specified in value.
'delete_column': Deletes the column
specified in value from the table
specified in table_name.
'create_foreign_key': Creates a foreign key using the format
'(source_column_name [, ...]) references
target_table_name(primary_key_column_name [, ...])
[as foreign_key_name]'.
'delete_foreign_key': Deletes a foreign key. The
value should be the foreign_key_name
specified when creating the key or the complete
string used to define it.
'set_global_access_mode': Sets the
global access mode (i.e. locking) for the table
specified in table_name. Specify the
access mode in value. Valid modes are
'no_access', 'read_only', 'write_only' and
'read_write'.
'refresh': Replays all the table
creation commands required to create this materialized view.
'set_refresh_method': Sets the method
by which this materialized view is refreshed -
one of 'manual', 'periodic', 'on_change'.
'set_refresh_start_time': Sets the time
to start periodic refreshes of this materialized view to datetime
string with format 'YYYY-MM-DD HH:MM:SS'.
Subsequent refreshes occur at the specified time + N
* the refresh period.
'set_refresh_period': Sets the time
interval in seconds at which to refresh this materialized view. Also, sets the
refresh method to periodic if not alreay set.
'remove_text_search_attributes': remove
text_search attribute from all columns, if exists.
value
String
The value of the modification. May be a column name,
'true' or 'false', a TTL, or the global access mode
depending on action.
options
Object
Optional parameters.
'column_default_value': When adding a
column, set a default value for existing records.
For nullable columns, the default value will be
null, regardless of data type.
'column_properties': When adding or
changing a column, set the column properties
(strings, separated by a comma: data, store_only,
text_search, char8, int8 etc).
'column_type': When adding or changing
a column, set the column type (strings, separated
by a comma: int, double, string, null etc).
'compression_type': When setting
column compression
(set_column_compression for
action), compression type to use:
none (to use no compression) or a
valid compression type.
Supported values:
'none'
'snappy'
'lz4'
'lz4hc'
The default value is 'snappy'.
'copy_values_from_column': please see
add_column_expression instead.
'rename_column': When changing a
column, specify new column name.
'validate_change_column': When
changing a column, validate the change before
applying it. If true, then validate
all values. A value too large (or too long) for the
new type will prevent any change. If
false, then when a value is too large
or long, it will be truncated.
Supported values:
'true': true
'false': false
The default value is 'true'.
'update_last_access_time': Indicates
whether need to update the last_access_time.
Supported values:
'true'
'false'
The default value is 'true'.
'add_column_expression': expression
for new column's values (optional with add_column).
Any valid expressions including existing columns.
Updates (adds or changes) metadata for tables. The metadata key and values
must both be strings. This is an easy way to annotate whole tables rather
than single records within tables. Some examples of metadata are owner of
the table, table creation timestamp etc.
Parameters:
Name
Type
Description
table_names
Array.<String>
Names of the tables whose metadata will be
updated. All specified tables must exist, or
an error will be returned.
metadata_map
Object
A map which contains the metadata of the
tables that are to be updated. Note that only
one map is provided for all the tables; so the
change will be applied to every table. If the
provided map is empty, then all existing
metadata for the table(s) will be cleared.
Updates (adds or changes) metadata for tables. The metadata key and values
must both be strings. This is an easy way to annotate whole tables rather
than single records within tables. Some examples of metadata are owner of
the table, table creation timestamp etc.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Response object containing the method_codes of the
operation.
Type
Object
alter_table_request(request, callback) → {Object}
Apply various modifications to a table, view, or collection. The available
modifications include the following:
Create or delete an index on a
particular column. This can speed up certain operations when using
expressions
containing equality or relational operators on indexed columns. This only
applies to tables.
Set the time-to-live
(TTL). This can be applied
to tables, views, or collections. When applied to collections, every
contained
table & view that is not protected will have its TTL set to the given value.
Set the global access mode (i.e. locking) for a table. This setting trumps
any
role-based access controls that may be in place; e.g., a user with write
access
to a table marked read-only will not be able to insert records into it. The
mode
can be set to read-only, write-only, read/write, and no access.
Change the protection mode to prevent or
allow automatic expiration. This can be applied to tables, views, and
collections.
Allow homogeneous tables within a collection.
Manage a table's columns--a column can be added, removed, or have its
type and properties
modified.
Append (or insert) all records from a source table (specified by
source_table_name) to a particular target table (specified by
table_name). The field map (specified by
field_map) holds the user specified map of target table column
names with their mapped source column names.
Parameters:
Name
Type
Description
table_name
String
The table name for the records to be appended.
Must be an existing table.
source_table_name
String
The source table name to get records
from. Must be an existing table name.
field_map
Object
Contains the mapping of column names from the
target table (specified by
table_name) as the keys, and
corresponding column names or expressions (e.g.,
'col_name+1') from the source table (specified by
source_table_name). Must be existing
column names in source table and target table,
and their types must be matched. For details on
using expressions, see Expressions.
options
Object
Optional parameters.
'offset': A positive integer
indicating the number of initial results to skip
from source table (specified by
source_table_name). Default is 0. The
minimum allowed value is 0. The maximum allowed
value is MAX_INT. The default value is '0'.
'limit': A positive integer indicating
the maximum number of results to be returned from
source table (specified by
source_table_name). Or END_OF_SET
(-9999) to indicate that the max number of results
should be returned. The default value is '-9999'.
'expression': Optional filter
expression to apply to the source table (specified
by source_table_name). Empty by
default. The default value is ''.
'order_by': Comma-separated list of
the columns and expressions to be sorted by from
the source table (specified by
source_table_name); e.g. 'timestamp
asc, x desc'. The order_by columns do
not have to be present in field_map.
The default value is ''.
'update_on_existing_pk': Specifies the
record collision policy for inserting the source
table records (specified by
source_table_name) into the target
table (specified by table_name) table
with a primary key. If set to
true, any existing target table record
with primary key values that match those of a
source table record being inserted will be replaced
by that new record. If set to false,
any existing target table record with primary key
values that match those of a source table record
being inserted will remain unchanged and the new
record discarded. If the specified table does not
have a primary key, then this option is ignored.
Supported values:
'true'
'false'
The default value is 'false'.
'truncate_strings': If set to
true, it allows inserting unrestricted
length strings into charN string columns by
truncating the unrestricted length strings to fit.
Supported values:
Append (or insert) all records from a source table (specified by
source_table_name) to a particular target table (specified by
table_name). The field map (specified by
field_map) holds the user specified map of target table column
names with their mapped source column names.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Clears (drops) one or all tables in the database cluster. The operation is
synchronous meaning that the table will be cleared before the function
returns. The response payload returns the status of the operation along with
the name of the table that was cleared.
Parameters:
Name
Type
Description
table_name
String
Name of the table to be cleared. Must be an
existing table. Empty string clears all
available tables, though this behavior is be
prevented by default via gpudb.conf parameter
'disable_clear_all'.
authorization
String
No longer used. User can pass an empty
string.
options
Object
Optional parameters.
'no_error_if_not_exists': If
true and if the table specified in
table_name does not exist no error is
returned. If false and if the table
specified in table_name does not exist
then an error is returned.
Supported values:
Response object containing the method_codes of the
operation.
Type
Object
clear_table_request(request, callback) → {Object}
Clears (drops) one or all tables in the database cluster. The operation is
synchronous meaning that the table will be cleared before the function
returns. The response payload returns the status of the operation along with
the name of the table that was cleared.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Clears or cancels the trigger identified by the specified handle. The output
returns the handle of the trigger cleared as well as indicating success or
failure of the trigger deactivation.
Clears or cancels the trigger identified by the specified handle. The output
returns the handle of the trigger cleared as well as indicating success or
failure of the trigger deactivation.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Create a job which will run asynchronously. The response returns a job ID,
which can be used to query the status and result of the job. The status and
the result of the job upon completion can be requested by
GPUdb#get_job.
Parameters:
Name
Type
Description
endpoint
String
Indicates which endpoint to execute, e.g.
'/alter/table'.
request_encoding
String
The encoding of the request payload for
the job.
Supported values:
'binary'
'json'
'snappy'
The default value is 'binary'.
data
String
Binary-encoded payload for the job to be run
asynchronously. The payload must contain the relevant
input parameters for the endpoint indicated in
endpoint. Please see the documentation
for the appropriate endpoint to see what values must
(or can) be specified. If this parameter is used,
then request_encoding must be
binary or snappy.
data_str
String
JSON-encoded payload for the job to be run
asynchronously. The payload must contain the
relevant input parameters for the endpoint
indicated in endpoint. Please see
the documentation for the appropriate endpoint to
see what values must (or can) be specified. If
this parameter is used, then
request_encoding must be
json.
Response object containing the method_codes of the
operation.
Type
Object
create_job_request(request, callback) → {Object}
Create a job which will run asynchronously. The response returns a job ID,
which can be used to query the status and result of the job. The status and
the result of the job upon completion can be requested by
GPUdb#get_job.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Name of the join table to be created. Has
the same naming restrictions as tables.
table_names
Array.<String>
The list of table names composing the join.
Corresponds to a SQL statement FROM clause.
column_names
Array.<String>
List of member table columns or column
expressions to be included in the join.
Columns can be prefixed with
'table_id.column_name', where 'table_id' is
the table name or alias. Columns can be
aliased via the syntax 'column_name as
alias'. Wild cards '*' can be used to
include all columns across member tables or
'table_id.*' for all of a single table's
columns. Columns and column expressions
composing the join must be uniquely named or
aliased--therefore, the '*' wild card cannot
be used if column names aren't unique across
all tables.
expressions
Array.<String>
An optional list of expressions to combine
and filter the joined tables. Corresponds to
a SQL statement WHERE clause. For details
see: expressions.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the join. If the
collection provided is non-existent, the collection
will be automatically created. If empty, then the
join will be at the top level. The default value
is ''.
'max_query_dimensions': The maximum
number of tables in a join that can be accessed by
a query and are not equated by a foreign-key to
primary-key equality predicate
'optimize_lookups': Use more memory to
speed up the joining of tables.
Supported values:
'true'
'false'
The default value is 'false'.
'refresh_method': Method by which the
join can be refreshed when the data in underlying
member tables have changed.
Supported values:
'manual': refresh only occurs when
manually requested by calling this endpoint with
refresh option set to refresh or
full_refresh
'on_query': incrementally refresh
(refresh just those records added) whenever a new
query is issued and new data is inserted into the
base table. A full refresh of all the records
occurs when a new query is issued and there have
been inserts to any non-base-tables since the last
query. TTL will be set to not expire;
any ttl specified will be ignored.
'on_insert': incrementally refresh
(refresh just those records added) whenever new
data is inserted into a base table. A full refresh
of all the records occurs when a new query is
issued and there have been inserts to any
non-base-tables since the last query. TTL will be set to not expire;
any ttl specified will be ignored.
The default value is 'manual'.
'refresh': Do a manual refresh of the
join if it exists - throws an error otherwise
Supported values:
'no_refresh': don't refresh
'refresh': incrementally refresh
(refresh just those records added) if new data has
been inserted into the base table. A full refresh
of all the records occurs if there have been
inserts to any non-base-tables since the last
refresh
'full_refresh': always refresh even if
no new records have been added. Only refresh
method guaranteed to do a full refresh (refresh all
the records) if a delete or update has occurred
since the last refresh.
The default value is 'no_refresh'.
'ttl': Sets the TTL of the join table specified
in join_table_name. Ignored if
refresh_method is either
on_insert or on_query.
'view_id': view this projection is
part of. The default value is ''.
'no_count': return a count of 0 for
the join table for logging and for show_table.
optimization needed for large overlapped equi-join
stencils. The default value is 'false'.
'chunk_size': Maximum size of a
joined-chunk for this table. Defaults to the
gpudb.conf file chunk size
'allow_right_primary_key_join': When
true allows right joins from a key to a primary key
to be done as primary key joins. Such a join table
cannot be joined to other join tables. When false
the right join shall be done as an equi-join. The
default value is 'false'.
The response contains view_id, which is used to tag each
subsequent operation (projection, union, aggregation, filter, or join) that
will compose the view.
Parameters:
Name
Type
Description
table_name
String
Name of the table to be created that is the
top-level table of the materialized view.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created table will be a
top-level table.
'ttl': Sets the TTL of the table specified in
table_name.
'persist': If true, then
the materialized view specified in
table_name will be persisted and will
not expire unless a ttl is specified.
If false, then the materialized view
will be an in-memory table and will expire unless a
ttl is specified otherwise.
Supported values:
'true'
'false'
The default value is 'false'.
'refresh_method': Method by which the
join can be refreshed when the data in underlying
member tables have changed.
Supported values:
'manual': Refresh only occurs when
manually requested by calling
GPUdb#alter_table with an 'action' of
'refresh'
'on_query': For future use.
'on_change': If possible,
incrementally refresh (refresh just those records
added) whenever an insert, update, delete or
refresh of input table is done. A full refresh is
done if an incremental refresh is not possible.
'periodic': Refresh table periodically
at rate specified by refresh_period
The default value is 'manual'.
'refresh_period': When
refresh_method is
periodic, specifies the period in
seconds at which refresh occurs
'refresh_start_time': When
refresh_method is
periodic, specifies the first time at
which a refresh is to be done. Value is a datetime
string with format 'YYYY-MM-DD HH:MM:SS'.
The response contains view_id, which is used to tag each
subsequent operation (projection, union, aggregation, filter, or join) that
will compose the view.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Creates an instance (proc) of the user-defined function (UDF) specified by
the given command, options, and files, and makes it available for execution.
For details on UDFs, see: User-Defined Functions
Parameters:
Name
Type
Description
proc_name
String
Name of the proc to be created. Must not be the
name of a currently existing proc.
execution_mode
String
The execution mode of the proc.
Supported values:
'distributed': Input table data
will be divided into data segments that are
distributed across all nodes in the cluster,
and the proc command will be invoked once
per data segment in parallel. Output table
data from each invocation will be saved to
the same node as the corresponding input
data.
'nondistributed': The proc
command will be invoked only once per
execution, and will not have access to any
input or output table data.
The default value is 'distributed'.
files
Object
A map of the files that make up the proc. The keys of
the map are file names, and the values are the binary
contents of the files. The file names may include
subdirectory names (e.g. 'subdir/file') but must not
resolve to a directory above the root for the proc.
command
String
The command (excluding arguments) that will be
invoked when the proc is executed. It will be
invoked from the directory containing the proc
files and may be any command that can
be resolved from that directory. It need not refer
to a file actually in that directory; for example,
it could be 'java' if the proc is a Java
application; however, any necessary external
programs must be preinstalled on every database
node. If the command refers to a file in that
directory, it must be preceded with './' as per
Linux convention. If not specified, and exactly one
file is provided in files, that file
will be invoked.
args
Array.<String>
An array of command-line arguments that will be
passed to command when the proc is
executed.
options
Object
Optional parameters.
'max_concurrency_per_node': The
maximum number of concurrent instances of the proc
that will be executed per node. 0 allows unlimited
concurrency. The default value is '0'.
Response object containing the method_codes of the
operation.
Type
Object
create_proc_request(request, callback) → {Object}
Creates an instance (proc) of the user-defined function (UDF) specified by
the given command, options, and files, and makes it available for execution.
For details on UDFs, see: User-Defined Functions
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
A projection can be created with a different shard key
than the source table. By specifying shard_key, the projection
will be sharded according to the specified columns, regardless of how the
source table is sharded. The source table can even be unsharded or
replicated.
Parameters:
Name
Type
Description
table_name
String
Name of the existing table on which the
projection is to be applied.
projection_name
String
Name of the projection to be created. Has
the same naming restrictions as tables.
column_names
Array.<String>
List of columns from table_name
to be included in the projection. Can
include derived columns. Can be specified as
aliased via the syntax 'column_name as
alias'.
options
Object
Optional parameters.
'collection_name': Name of a collection to which the
projection is to be assigned as a child. If the
collection provided is non-existent, the collection
will be automatically created. If empty, then the
projection will be at the top level. The default
value is ''.
'expression': An optional filter expression to be applied to the
source table prior to the projection. The default
value is ''.
'is_replicated': If true
then the projection will be replicated even if the
source table is not.
Supported values:
'true'
'false'
The default value is 'false'.
'limit': The number of records to
keep. The default value is ''.
'order_by': Comma-separated list of
the columns to be sorted by; e.g. 'timestamp asc, x
desc'. The columns specified must be present in
column_names. If any alias is given
for any column name, the alias must be used, rather
than the original column name. The default value
is ''.
'materialize_on_gpu': If
true then the columns of the
projection will be cached on the GPU.
Supported values:
'true'
'false'
The default value is 'false'.
'chunk_size': Indicates the chunk size
to be used for this table.
'create_indexes': Comma-separated list
of columns on which to create indexes on the output
table. The columns specified must be present in
column_names. If any alias is given
for any column name, the alias must be used, rather
than the original column name.
'ttl': Sets the TTL of the projection specified
in projection_name.
'shard_key': Comma-separated list of
the columns to be sharded on; e.g. 'column1,
column2'. The columns specified must be present in
column_names. If any alias is given
for any column name, the alias must be used, rather
than the original column name. The default value
is ''.
'persist': If true, then
the projection specified in
projection_name will be persisted and
will not expire unless a ttl is
specified. If false, then the
projection will be an in-memory table and will
expire unless a ttl is specified
otherwise.
Supported values:
'true'
'false'
The default value is 'false'.
'preserve_dict_encoding': If
true, then columns that were dict
encoded in the source table will be dict encoded in
the projection table.
Supported values:
'true'
'false'
The default value is 'false'.
'view_id': view this projection is
part of. The default value is ''.
A projection can be created with a different shard key
than the source table. By specifying shard_key, the projection
will be sharded according to the specified columns, regardless of how the
source table is sharded. The source table can even be unsharded or
replicated.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Response object containing the method_codes of the
operation.
Type
Object
create_role(name, options, callback) → {Object}
Creates a new role.
Parameters:
Name
Type
Description
name
String
Name of the role to be created. Must contain only
lowercase letters, digits, and underscores, and cannot
begin with a digit. Must not be the same name as an
existing user or role.
Creates a new table or collection. If a new table is being created, the type
of the table is given by type_id, which must the be the ID of a
currently registered type (i.e. one created via
GPUdb#create_type). The table will be created inside a
collection if the option collection_name is specified. If that
collection does not already exist, it will be created.
To create a new collection, specify the name of the collection in
table_name and set the is_collection option to
true; type_id will be ignored.
Parameters:
Name
Type
Description
table_name
String
Name of the table to be created. Error for
requests with existing table of the same name
and type id may be suppressed by using the
no_error_if_exists option. See Tables for naming
restrictions.
type_id
String
ID of a currently registered type. All objects
added to the newly created table will be of this
type. Ignored if is_collection is
true.
options
Object
Optional parameters.
'no_error_if_exists': If
true, prevents an error from occurring
if the table already exists and is of the given
type. If a table with the same ID but a different
type exists, it is still an error.
Supported values:
'true'
'false'
The default value is 'false'.
'collection_name': Name of a
collection which is to contain the newly created
table. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created table will be a
top-level table.
'is_collection': Indicates whether the
new table to be created will be a collection.
Supported values:
'true'
'false'
The default value is 'false'.
'disallow_homogeneous_tables': For a
collection, indicates whether the collection
prohibits containment of multiple tables of exactly
the same data type.
Supported values:
'true'
'false'
The default value is 'false'.
'is_replicated': For a table,
indicates the distribution scheme for the
table's data. If true, the table will be replicated. If false, the table
will be sharded according to the shard key specified in the given
type_id, or randomly sharded, if no shard key
is specified.
Supported values:
'true'
'false'
The default value is 'false'.
'foreign_keys': Semicolon-separated
list of foreign keys, of the format
'(source_column_name [, ...]) references
target_table_name(primary_key_column_name [, ...])
[as foreign_key_name]'.
'foreign_shard_key': Foreign shard key
of the format 'source_column references
shard_by_column from
target_table(primary_key_column)'
'ttl': For a table, sets the TTL of the table specified in
table_name.
'chunk_size': Indicates the chunk size
to be used for this table.
'is_result_table': For a table,
indicates whether the table is an in-memory table.
A result table cannot contain store_only,
text_search, or string columns (charN columns are
acceptable), and it will not be retained if the
server is restarted.
Supported values:
Creates a monitor that watches for new records inserted into a particular
table (identified by table_name) and forwards copies to
subscribers via ZMQ. After this call completes, subscribe to the returned
topic_id on the ZMQ table monitor port (default 9002). Each
time an insert operation on the table completes, a multipart message is
published for that topic; the first part contains only the topic ID, and
each subsequent part contains one binary-encoded Avro object that was
inserted. The monitor will continue to run (regardless of whether or not
there are any subscribers) until deactivated with
GPUdb#clear_table_monitor.
Parameters:
Name
Type
Description
table_name
String
Name of the table to monitor. Must not refer to
a collection.
Creates a monitor that watches for new records inserted into a particular
table (identified by table_name) and forwards copies to
subscribers via ZMQ. After this call completes, subscribe to the returned
topic_id on the ZMQ table monitor port (default 9002). Each
time an insert operation on the table completes, a multipart message is
published for that topic; the first part contains only the topic ID, and
each subsequent part contains one binary-encoded Avro object that was
inserted. The monitor will continue to run (regardless of whether or not
there are any subscribers) until deactivated with
GPUdb#clear_table_monitor.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Creates a new table or collection. If a new table is being created, the type
of the table is given by type_id, which must the be the ID of a
currently registered type (i.e. one created via
GPUdb#create_type). The table will be created inside a
collection if the option collection_name is specified. If that
collection does not already exist, it will be created.
To create a new collection, specify the name of the collection in
table_name and set the is_collection option to
true; type_id will be ignored.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Sets up an area trigger mechanism for two column_names for one or more
tables. (This function is essentially the two-dimensional version of
GPUdb#create_trigger_by_range.) Once the trigger has been
activated, any record added to the listed tables(s) via
GPUdb#insert_records with the chosen columns' values falling
within the specified region will trip the trigger. All such records will be
queued at the trigger port (by default '9001' but able to be retrieved via
GPUdb#show_system_status) for any listening client to collect.
Active triggers can be cancelled by using the
GPUdb#clear_trigger endpoint or by clearing all relevant
tables.
The output returns the trigger handle as well as indicating success or
failure of the trigger activation.
Parameters:
Name
Type
Description
request_id
String
User-created ID for the trigger. The ID can be
alphanumeric, contain symbols, and must contain
at least one character.
table_names
Array.<String>
Names of the tables on which the trigger will
be activated and maintained.
x_column_name
String
Name of a numeric column on which the trigger
is activated. Usually 'x' for geospatial data
points.
x_vector
Array.<Number>
The respective coordinate values for the region
on which the trigger is activated. This usually
translates to the x-coordinates of a geospatial
region.
y_column_name
String
Name of a second numeric column on which the
trigger is activated. Usually 'y' for
geospatial data points.
y_vector
Array.<Number>
The respective coordinate values for the region
on which the trigger is activated. This usually
translates to the y-coordinates of a geospatial
region. Must be the same length as xvals.
Sets up an area trigger mechanism for two column_names for one or more
tables. (This function is essentially the two-dimensional version of
GPUdb#create_trigger_by_range.) Once the trigger has been
activated, any record added to the listed tables(s) via
GPUdb#insert_records with the chosen columns' values falling
within the specified region will trip the trigger. All such records will be
queued at the trigger port (by default '9001' but able to be retrieved via
GPUdb#show_system_status) for any listening client to collect.
Active triggers can be cancelled by using the
GPUdb#clear_trigger endpoint or by clearing all relevant
tables.
The output returns the trigger handle as well as indicating success or
failure of the trigger activation.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Sets up a simple range trigger for a column_name for one or more tables.
Once the trigger has been activated, any record added to the listed
tables(s) via GPUdb#insert_records with the chosen
column_name's value falling within the specified range will trip the
trigger. All such records will be queued at the trigger port (by default
'9001' but able to be retrieved via GPUdb#show_system_status)
for any listening client to collect. Active triggers can be cancelled by
using the GPUdb#clear_trigger endpoint or by clearing all
relevant tables.
The output returns the trigger handle as well as indicating success or
failure of the trigger activation.
Parameters:
Name
Type
Description
request_id
String
User-created ID for the trigger. The ID can be
alphanumeric, contain symbols, and must contain
at least one character.
table_names
Array.<String>
Tables on which the trigger will be active.
column_name
String
Name of a numeric column_name on which the
trigger is activated.
min
Number
The lower bound (inclusive) for the trigger range.
max
Number
The upper bound (inclusive) for the trigger range.
Sets up a simple range trigger for a column_name for one or more tables.
Once the trigger has been activated, any record added to the listed
tables(s) via GPUdb#insert_records with the chosen
column_name's value falling within the specified range will trip the
trigger. All such records will be queued at the trigger port (by default
'9001' but able to be retrieved via GPUdb#show_system_status)
for any listening client to collect. Active triggers can be cancelled by
using the GPUdb#clear_trigger endpoint or by clearing all
relevant tables.
The output returns the trigger handle as well as indicating success or
failure of the trigger activation.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Creates a new type describing the layout or schema of a table. The type
definition is a JSON string describing the fields (i.e. columns) of the
type. Each field consists of a name and a data type. Supported data types
are: double, float, int, long, string, and bytes. In addition one or more
properties can be specified for each column which customize the memory usage
and query availability of that column. Note that some properties are
mutually exclusive--i.e. they cannot be specified for any given column
simultaneously. One example of mutually exclusive properties are
data and store_only.
A single primary key and/or single shard key can
be set across one or more columns. If a primary key is specified, then a
uniqueness constraint is enforced, in that only a single object can exist
with a given primary key. When inserting
data into a table with a primary key, depending on the parameters in the
request, incoming objects with primary key values that match existing
objects will either overwrite (i.e. update) the existing object or will be
skipped and not added into the set.
Example of a type definition with some of the parameters::
a JSON string describing the columns of the
type to be registered.
label
String
A user-defined description string which can be used
to differentiate between tables and types with
otherwise identical schemas.
properties
Object
Each key-value pair specifies the properties to
use for a given column where the key is the
column name. All keys used must be relevant
column names for the given table. Specifying
any property overrides the default properties
for that column (which is based on the column's
data type).
Valid values are:
'data': Default property for all
numeric and string type columns; makes the
column available for GPU queries.
'text_search': Valid only for
'string' columns. Enables full text search for
string columns. Can be set independently of
data and store_only.
'store_only': Persist the column
value but do not make it available to queries
(e.g. GPUdb#filter)-i.e. it is
mutually exclusive to the data
property. Any 'bytes' type column must have a
store_only property. This property
reduces system memory usage.
'timestamp': Valid only for 'long'
columns. Indicates that this field represents a
timestamp and will be provided in milliseconds
since the Unix epoch: 00:00:00 Jan 1 1970.
Dates represented by a timestamp must fall
between the year 1000 and the year 2900.
'decimal': Valid only for 'string'
columns. It represents a SQL type NUMERIC(19,
4) data type. There can be up to 15 digits
before the decimal point and up to four digits
in the fractional part. The value can be
positive or negative (indicated by a minus sign
at the beginning). This property is mutually
exclusive with the text_search
property.
'date': Valid only for 'string'
columns. Indicates that this field represents a
date and will be provided in the format
'YYYY-MM-DD'. The allowable range is 1000-01-01
through 2900-01-01. This property is mutually
exclusive with the text_search
property.
'time': Valid only for 'string'
columns. Indicates that this field represents a
time-of-day and will be provided in the format
'HH:MM:SS.mmm'. The allowable range is
00:00:00.000 through 23:59:59.999. This
property is mutually exclusive with the
text_search property.
'datetime': Valid only for 'string'
columns. Indicates that this field represents a
datetime and will be provided in the format
'YYYY-MM-DD HH:MM:SS.mmm'. The allowable range
is 1000-01-01 00:00:00.000 through 2900-01-01
23:59:59.999. This property is mutually
exclusive with the text_search
property.
'char1': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 1 character.
'char2': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 2 characters.
'char4': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 4 characters.
'char8': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 8 characters.
'char16': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 16 characters.
'char32': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 32 characters.
'char64': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 64 characters.
'char128': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 128 characters.
'char256': This property provides
optimized memory, disk and query performance for
string columns. Strings with this property must
be no longer than 256 characters.
'int8': This property provides
optimized memory and query performance for int
columns. Ints with this property must be between
-128 and +127 (inclusive)
'int16': This property provides
optimized memory and query performance for int
columns. Ints with this property must be between
-32768 and +32767 (inclusive)
'ipv4': This property provides
optimized memory, disk and query performance for
string columns representing IPv4 addresses (i.e.
192.168.1.1). Strings with this property must be
of the form: A.B.C.D where A, B, C and D are in
the range of 0-255.
'wkt': Valid only for 'string' and
'bytes' columns. Indicates that this field
contains geospatial geometry objects in
Well-Known Text (WKT) or Well-Known Binary (WKB)
format.
'primary_key': This property
indicates that this column will be part of (or
the entire) primary key.
'shard_key': This property
indicates that this column will be part of (or
the entire) shard key.
'nullable': This property indicates
that this column is nullable. However, setting
this property is insufficient for making the
column nullable. The user must declare the type
of the column as a union between its regular
type and 'null' in the avro schema for the
record type in type_definition.
For example, if a column is of type integer and
is nullable, then the entry for the column in
the avro schema must be: ['int', 'null'].
The C++, C#, Java, and Python APIs have built-in
convenience for bypassing setting the avro
schema by hand. For those languages, one can
use this property as usual and not have to worry
about the avro schema for the record.
'dict': This property indicates
that this column should be dictionary encoded.
It can only be used in conjunction with string
columns marked with a charN or date property or
with int or long columns. This property is
appropriate for columns where the cardinality
(the number of unique values) is expected to be
low, and can save a large amount of memory.
'init_with_now': For columns with
attributes of date, time, datetime or timestamp,
at insert time, replace empty strings and
invalid timestamps with NOW()
Response object containing the method_codes of the
operation.
Type
Object
create_type_request(request, callback) → {Object}
Creates a new type describing the layout or schema of a table. The type
definition is a JSON string describing the fields (i.e. columns) of the
type. Each field consists of a name and a data type. Supported data types
are: double, float, int, long, string, and bytes. In addition one or more
properties can be specified for each column which customize the memory usage
and query availability of that column. Note that some properties are
mutually exclusive--i.e. they cannot be specified for any given column
simultaneously. One example of mutually exclusive properties are
data and store_only.
A single primary key and/or single shard key can
be set across one or more columns. If a primary key is specified, then a
uniqueness constraint is enforced, in that only a single object can exist
with a given primary key. When inserting
data into a table with a primary key, depending on the parameters in the
request, incoming objects with primary key values that match existing
objects will either overwrite (i.e. update) the existing object or will be
skipped and not added into the set.
Example of a type definition with some of the parameters::
INTERSECT (DISTINCT/ALL) - For data set intersection details and examples,
see Intersect.
For limitations, see Intersect Limitations.
EXCEPT (DISTINCT/ALL) - For data set subtraction details and examples, see
Except. For
limitations, see Except Limitations.
MERGE VIEWS - For a given set of filtered views
on a single table, creates a single filtered view containing all of the
unique records across all of the given filtered data sets.
Non-charN 'string' and 'bytes' column types cannot be merged, nor can
columns marked as store-only.
Parameters:
Name
Type
Description
table_name
String
Name of the table to be created. Has the same
naming restrictions as tables.
table_names
Array.<String>
The list of table names to merge. Must
contain the names of one or more existing
tables.
input_column_names
Array.<Array.<String>>
The list of columns from each of the
corresponding input tables.
output_column_names
Array.<String>
The list of names of the columns to
be stored in the output table.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the output table. If
the collection provided is non-existent, the
collection will be automatically created. If empty,
the output table will be a top-level table. The
default value is ''.
'materialize_on_gpu': If
true, then the columns of the output
table will be cached on the GPU.
Supported values:
'true'
'false'
The default value is 'false'.
'mode': If merge_views,
then this operation will merge the provided views.
All table_names must be views from the
same underlying base table.
Supported values:
'union_all': Retains all rows from the
specified tables.
'union': Retains all unique rows from
the specified tables (synonym for
union_distinct).
'union_distinct': Retains all unique
rows from the specified tables.
'except': Retains all unique rows from
the first table that do not appear in the second
table (only works on 2 tables).
'except_all': Retains all
rows(including duplicates) from the first table
that do not appear in the second table (only works
on 2 tables).
'intersect': Retains all unique rows
that appear in both of the specified tables (only
works on 2 tables).
'intersect_all': Retains all
rows(including duplicates) that appear in both of
the specified tables (only works on 2 tables).
'merge_views': Merge two or more views
(or views of views) of the same base data set into
a new view. If this mode is selected
input_column_names AND
output_column_names must be empty. The
resulting view would match the results of a SQL OR
operation, e.g., if filter 1 creates a view using
the expression 'x = 20' and filter 2 creates a view
using the expression 'x <= 10', then the merge
views operation creates a new view using the
expression 'x = 20 OR x <= 10'.
The default value is 'union_all'.
'chunk_size': Indicates the chunk size
to be used for this table.
'create_indexes': Comma-separated list
of columns on which to create indexes on the output
table. The columns specified must be present in
output_column_names.
'ttl': Sets the TTL of the table specified in
table_name.
'persist': If true, then
the table specified in table_name will
be persisted and will not expire unless a
ttl is specified. If
false, then the table will be an
in-memory table and will expire unless a
ttl is specified otherwise.
Supported values:
'true'
'false'
The default value is 'false'.
'view_id': view the output table will
be a part of. The default value is ''.
'force_replicated': If
true, then the table specified in
table_name will be replicated even if
the source tables are not.
Supported values:
INTERSECT (DISTINCT/ALL) - For data set intersection details and examples,
see Intersect.
For limitations, see Intersect Limitations.
EXCEPT (DISTINCT/ALL) - For data set subtraction details and examples, see
Except. For
limitations, see Except Limitations.
MERGE VIEWS - For a given set of filtered views
on a single table, creates a single filtered view containing all of the
unique records across all of the given filtered data sets.
Non-charN 'string' and 'bytes' column types cannot be merged, nor can
columns marked as store-only.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Creates a new external user (a user whose credentials are managed by an
external LDAP).
Parameters:
Name
Type
Description
name
String
Name of the user to be created. Must exactly match the
user's name in the external LDAP, prefixed with a @.
Must not be the same name as an existing user.
Creates a new internal user (a user whose credentials are managed by the
database system).
Parameters:
Name
Type
Description
name
String
Name of the user to be created. Must contain only
lowercase letters, digits, and underscores, and cannot
begin with a digit. Must not be the same name as an
existing user or role.
password
String
Initial password of the user to be created. May be
an empty string for no password.
Deletes record(s) matching the provided criteria from the given table. The
record selection criteria can either be one or more
expressions (matching multiple records), a single record
identified by record_id options, or all records when using
delete_all_records. Note that the three selection criteria are
mutually exclusive. This operation cannot be run on a collection or a view.
The operation is synchronous meaning that a response will not be available
until the request is completely processed and all the matching records are
deleted.
Parameters:
Name
Type
Description
table_name
String
Name of the table from which to delete records.
The set must be a currently existing table and
not a collection or a view.
expressions
Array.<String>
A list of the actual predicates, one for each
select; format should follow the guidelines
provided here. Specifying one or
more expressions is mutually
exclusive to specifying
record_id in the
options.
options
Object
Optional parameters.
'global_expression': An optional
global expression to reduce the search space of the
expressions. The default value is ''.
'delete_all_records': If set to
true, all records in the table will be
deleted. If set to false, then the
option is effectively ignored.
Supported values:
Deletes record(s) matching the provided criteria from the given table. The
record selection criteria can either be one or more
expressions (matching multiple records), a single record
identified by record_id options, or all records when using
delete_all_records. Note that the three selection criteria are
mutually exclusive. This operation cannot be run on a collection or a view.
The operation is synchronous meaning that a response will not be available
until the request is completely processed and all the matching records are
deleted.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Executes a proc. This endpoint is asynchronous and does not wait for the
proc to complete before returning.
Parameters:
Name
Type
Description
proc_name
String
Name of the proc to execute. Must be the name of
a currently existing proc.
params
Object
A map containing named parameters to pass to the
proc. Each key/value pair specifies the name of a
parameter and its value.
bin_params
Object
A map containing named binary parameters to pass
to the proc. Each key/value pair specifies the
name of a parameter and its value.
input_table_names
Array.<String>
Names of the tables containing data to
be passed to the proc. Each name
specified must be the name of a
currently existing table. If no table
names are specified, no data will be
passed to the proc.
input_column_names
Object
Map of table names from
input_table_names to lists
of names of columns from those tables
that will be passed to the proc. Each
column name specified must be the name
of an existing column in the
corresponding table. If a table name
from input_table_names is
not included, all columns from that
table will be passed to the proc.
output_table_names
Array.<String>
Names of the tables to which output
data from the proc will be written. If
a specified table does not exist, it
will automatically be created with the
same schema as the corresponding table
(by order) from
input_table_names,
excluding any primary and shard keys.
If a specified table is a
non-persistent result table, it must
not have primary or shard keys. If no
table names are specified, no output
data can be returned from the proc.
options
Object
Optional parameters.
'cache_input': A comma-delimited list
of table names from input_table_names
from which input data will be cached for use in
subsequent calls to
GPUdb#execute_proc with the
use_cached_input option. Cached input
data will be retained until the proc status is
cleared with the
clear_complete
option of GPUdb#show_proc_status and
all proc instances using the cached data have
completed. The default value is ''.
'use_cached_input': A comma-delimited
list of run IDs (as returned from prior calls to
GPUdb#execute_proc) of running or
completed proc instances from which input data
cached using the cache_input option
will be used. Cached input data will not be used
for any tables specified in
input_table_names, but data from all
other tables cached for the specified run IDs will
be passed to the proc. If the same table was cached
for multiple specified run IDs, the cached data
from the first run ID specified in the list that
includes that table will be used. The default
value is ''.
'kifs_input_dirs': A comma-delimited
list of KiFS directories whose local files will be
made directly accessible to the proc through the
API. (All KiFS files, local or not, are also
accessible through the file system below the KiFS
mount point.) Each name specified must the name of
an existing KiFS directory. The default value is
''.
The response message contains the number of points for which the expression
evaluated to be true, which is equivalent to the size of the result view.
Parameters:
Name
Type
Description
table_name
String
Name of the table to filter. This may be the ID
of a collection, table or a result set (for
chaining queries). If filtering a collection,
all child tables where the filter expression is
valid will be filtered; the filtered result
tables will then be placed in a collection
specified by view_name.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
expression
String
The select expression to filter the specified
table. For details see Expressions.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
'view_id': view this filtered-view is
part of. The default value is ''.
'ttl': Sets the TTL of the view specified in
view_name.
Calculates which objects from a table are within a named area of interest
(NAI/polygon). The operation is synchronous, meaning that a response will
not be returned until all the matching objects are fully available. The
response payload provides the count of the resulting set. A new resultant
set (view) which satisfies the input NAI restriction specification is
created with the name view_name passed in as part of the input.
Parameters:
Name
Type
Description
table_name
String
Name of the table to filter. This may be the
name of a collection, a table or a view (when
chaining queries). If filtering a collection,
all child tables where the filter expression is
valid will be filtered; the filtered result
tables will then be placed in a collection
specified by view_name.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
x_column_name
String
Name of the column containing the x values to
be filtered.
x_vector
Array.<Number>
List of x coordinates of the vertices of the
polygon representing the area to be filtered.
y_column_name
String
Name of the column containing the y values to
be filtered.
y_vector
Array.<Number>
List of y coordinates of the vertices of the
polygon representing the area to be filtered.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which geospatial geometry objects from a table intersect a named
area of interest (NAI/polygon). The operation is synchronous, meaning that a
response will not be returned until all the matching objects are fully
available. The response payload provides the count of the resulting set. A
new resultant set (view) which satisfies the input NAI restriction
specification is created with the name view_name passed in as
part of the input.
Parameters:
Name
Type
Description
table_name
String
Name of the table to filter. This may be the
name of a collection, a table or a view (when
chaining queries). If filtering a collection,
all child tables where the filter expression is
valid will be filtered; the filtered result
tables will then be placed in a collection
specified by view_name.
view_name
String
If provided, then this will be the name of the
view containing the results. Must not be an
already existing collection, table or view.
column_name
String
Name of the geospatial geometry column to be
filtered.
x_vector
Array.<Number>
List of x coordinates of the vertices of the
polygon representing the area to be filtered.
y_vector
Array.<Number>
List of y coordinates of the vertices of the
polygon representing the area to be filtered.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which geospatial geometry objects from a table intersect a named
area of interest (NAI/polygon). The operation is synchronous, meaning that a
response will not be returned until all the matching objects are fully
available. The response payload provides the count of the resulting set. A
new resultant set (view) which satisfies the input NAI restriction
specification is created with the name view_name passed in as
part of the input.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates which objects from a table are within a named area of interest
(NAI/polygon). The operation is synchronous, meaning that a response will
not be returned until all the matching objects are fully available. The
response payload provides the count of the resulting set. A new resultant
set (view) which satisfies the input NAI restriction specification is
created with the name view_name passed in as part of the input.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates how many objects within the given table lie in a rectangular box.
The operation is synchronous, meaning that a response will not be returned
until all the objects are fully available. The response payload provides the
count of the resulting set. A new resultant set which satisfies the input
NAI restriction specification is also created when a view_name
is passed in as part of the input payload.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the bounding box
operation will be performed. Must be an existing
table.
view_name
String
Optional name of the result view that will be
created containing the results of the query. Has
the same naming restrictions as tables.
x_column_name
String
Name of the column on which to perform the
bounding box query. Must be a valid numeric
column.
min_x
Number
Lower bound for the column chosen by
x_column_name. Must be less than or
equal to max_x.
max_x
Number
Upper bound for x_column_name. Must be
greater than or equal to min_x.
y_column_name
String
Name of a column on which to perform the
bounding box query. Must be a valid numeric
column.
min_y
Number
Lower bound for y_column_name. Must be
less than or equal to max_y.
max_y
Number
Upper bound for y_column_name. Must be
greater than or equal to min_y.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which geospatial geometry objects from a table intersect a
rectangular box. The operation is synchronous, meaning that a response will
not be returned until all the objects are fully available. The response
payload provides the count of the resulting set. A new resultant set which
satisfies the input NAI restriction specification is also created when a
view_name is passed in as part of the input payload.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the bounding box
operation will be performed. Must be an existing
table.
view_name
String
Optional name of the result view that will be
created containing the results of the query. Must
not be an already existing collection, table or
view.
column_name
String
Name of the geospatial geometry column to be
filtered.
min_x
Number
Lower bound for the x-coordinate of the rectangular
box. Must be less than or equal to
max_x.
max_x
Number
Upper bound for the x-coordinate of the rectangular
box. Must be greater than or equal to
min_x.
min_y
Number
Lower bound for the y-coordinate of the rectangular
box. Must be less than or equal to
max_y.
max_y
Number
Upper bound for the y-coordinate of the rectangular
box. Must be greater than or equal to
min_y.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which geospatial geometry objects from a table intersect a
rectangular box. The operation is synchronous, meaning that a response will
not be returned until all the objects are fully available. The response
payload provides the count of the resulting set. A new resultant set which
satisfies the input NAI restriction specification is also created when a
view_name is passed in as part of the input payload.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates how many objects within the given table lie in a rectangular box.
The operation is synchronous, meaning that a response will not be returned
until all the objects are fully available. The response payload provides the
count of the resulting set. A new resultant set which satisfies the input
NAI restriction specification is also created when a view_name
is passed in as part of the input payload.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Applies a geometry filter against a geospatial geometry column in a given
table, collection or view. The filtering geometry is provided by
input_wkt.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the filter by
geometry will be performed. Must be an existing
table, collection or view containing a
geospatial geometry column.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
column_name
String
Name of the column to be used in the filter.
Must be a geospatial geometry column.
input_wkt
String
A geometry in WKT format that will be used to
filter the objects in table_name.
operation
String
The geometric filtering operation to perform
Supported values:
'contains': Matches records that
contain the given WKT in input_wkt,
i.e. the given WKT is within the bounds of a
record's geometry.
'crosses': Matches records that
cross the given WKT.
'disjoint': Matches records that are
disjoint from the given WKT.
'equals': Matches records that are
the same as the given WKT.
'intersects': Matches records that
intersect the given WKT.
'overlaps': Matches records that
overlap the given WKT.
'touches': Matches records that
touch the given WKT.
'within': Matches records that are
within the given WKT.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which records from a table have values in the given list for the
corresponding column. The operation is synchronous, meaning that a response
will not be returned until all the objects are fully available. The response
payload provides the count of the resulting set. A new resultant set (view)
which satisfies the input filter specification is also created if a
view_name is passed in as part of the request.
For example, if a type definition has the columns 'x' and 'y', then a filter
by list query with the column map {"x":["10.1", "2.3"], "y":["0.0", "-31.5",
"42.0"]} will return the count of all data points whose x and y values match
both in the respective x- and y-lists, e.g., "x = 10.1 and y = 0.0", "x =
2.3 and y = -31.5", etc. However, a record with "x = 10.1 and y = -31.5" or
"x = 2.3 and y = 0.0" would not be returned because the values in the given
lists do not correspond.
Parameters:
Name
Type
Description
table_name
String
Name of the table to filter. This may be the ID
of a collection, table or a result set (for
chaining queries). If filtering a collection,
all child tables where the filter expression is
valid will be filtered; the filtered result
tables will then be placed in a collection
specified by view_name.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
column_values_map
Object
List of values for the corresponding
column in the table
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
'filter_mode': String indicating the
filter mode, either 'in_list' or 'not_in_list'.
Supported values:
'in_list': The filter will match all
items that are in the provided list(s).
'not_in_list': The filter will match
all items that are not in the provided list(s).
Calculates which records from a table have values in the given list for the
corresponding column. The operation is synchronous, meaning that a response
will not be returned until all the objects are fully available. The response
payload provides the count of the resulting set. A new resultant set (view)
which satisfies the input filter specification is also created if a
view_name is passed in as part of the request.
For example, if a type definition has the columns 'x' and 'y', then a filter
by list query with the column map {"x":["10.1", "2.3"], "y":["0.0", "-31.5",
"42.0"]} will return the count of all data points whose x and y values match
both in the respective x- and y-lists, e.g., "x = 10.1 and y = 0.0", "x =
2.3 and y = -31.5", etc. However, a record with "x = 10.1 and y = -31.5" or
"x = 2.3 and y = 0.0" would not be returned because the values in the given
lists do not correspond.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates which objects from a table lie within a circle with the given
radius and center point (i.e. circular NAI). The operation is synchronous,
meaning that a response will not be returned until all the objects are fully
available. The response payload provides the count of the resulting set. A
new resultant set (view) which satisfies the input circular NAI restriction
specification is also created if a view_name is passed in as
part of the request.
For track data, all track points that lie within the circle plus one point
on either side of the circle (if the track goes beyond the circle) will be
included in the result.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the filter by radius
operation will be performed. Must be an
existing table.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
x_column_name
String
Name of the column to be used for the
x-coordinate (the longitude) of the center.
x_center
Number
Value of the longitude of the center. Must be
within [-180.0, 180.0].
y_column_name
String
Name of the column to be used for the
y-coordinate-the latitude-of the center.
y_center
Number
Value of the latitude of the center. Must be
within [-90.0, 90.0].
radius
Number
The radius of the circle within which the search
will be performed. Must be a non-zero positive
value. It is in meters; so, for example, a value of
'42000' means 42 km.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which geospatial geometry objects from a table intersect a circle
with the given radius and center point (i.e. circular NAI). The operation is
synchronous, meaning that a response will not be returned until all the
objects are fully available. The response payload provides the count of the
resulting set. A new resultant set (view) which satisfies the input circular
NAI restriction specification is also created if a view_name is
passed in as part of the request.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the filter by radius
operation will be performed. Must be an
existing table.
view_name
String
If provided, then this will be the name of the
view containing the results. Must not be an
already existing collection, table or view.
column_name
String
Name of the geospatial geometry column to be
filtered.
x_center
Number
Value of the longitude of the center. Must be
within [-180.0, 180.0].
y_center
Number
Value of the latitude of the center. Must be
within [-90.0, 90.0].
radius
Number
The radius of the circle within which the search
will be performed. Must be a non-zero positive
value. It is in meters; so, for example, a value of
'42000' means 42 km.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which geospatial geometry objects from a table intersect a circle
with the given radius and center point (i.e. circular NAI). The operation is
synchronous, meaning that a response will not be returned until all the
objects are fully available. The response payload provides the count of the
resulting set. A new resultant set (view) which satisfies the input circular
NAI restriction specification is also created if a view_name is
passed in as part of the request.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates which objects from a table lie within a circle with the given
radius and center point (i.e. circular NAI). The operation is synchronous,
meaning that a response will not be returned until all the objects are fully
available. The response payload provides the count of the resulting set. A
new resultant set (view) which satisfies the input circular NAI restriction
specification is also created if a view_name is passed in as
part of the request.
For track data, all track points that lie within the circle plus one point
on either side of the circle (if the track goes beyond the circle) will be
included in the result.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates which objects from a table have a column that is within the given
bounds. An object from the table identified by table_name is
added to the view view_name if its column is within
[lower_bound, upper_bound] (inclusive). The
operation is synchronous. The response provides a count of the number of
objects which passed the bound filter. Although this functionality can also
be accomplished with the standard filter function, it is more efficient.
For track objects, the count reflects how many points fall within the given
bounds (which may not include all the track points of any given track).
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the filter by range
operation will be performed. Must be an
existing table.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
column_name
String
Name of a column on which the operation would
be applied.
lower_bound
Number
Value of the lower bound (inclusive).
upper_bound
Number
Value of the upper bound (inclusive).
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which objects from a table have a column that is within the given
bounds. An object from the table identified by table_name is
added to the view view_name if its column is within
[lower_bound, upper_bound] (inclusive). The
operation is synchronous. The response provides a count of the number of
objects which passed the bound filter. Although this functionality can also
be accomplished with the standard filter function, it is more efficient.
For track objects, the count reflects how many points fall within the given
bounds (which may not include all the track points of any given track).
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Filters objects matching all points of the given track (works only on track
type data). It allows users to specify a particular track to find all other
points in the table that fall within specified ranges-spatial and
temporal-of all points of the given track. Additionally, the user can
specify another track to see if the two intersect (or go close to each other
within the specified ranges). The user also has the flexibility of using
different metrics for the spatial distance calculation: Euclidean (flat
geometry) or Great Circle (spherical geometry to approximate the Earth's
surface distances). The filtered points are stored in a newly created result
set. The return value of the function is the number of points in the
resultant set (view).
This operation is synchronous, meaning that a response will not be returned
until all the objects are fully available.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the filter by track
operation will be performed. Must be a currently
existing table with a track present.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
track_id
String
The ID of the track which will act as the
filtering points. Must be an existing track within
the given table.
target_track_ids
Array.<String>
Up to one track ID to intersect with the
"filter" track. If any provided, it must
be an valid track ID within the given
set.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
'spatial_radius': A positive number
passed as a string representing the radius of the
search area centered around each track point's
geospatial coordinates. The value is interpreted in
meters. Required parameter.
'time_radius': A positive number
passed as a string representing the maximum
allowable time difference between the timestamps of
a filtered object and the given track's points. The
value is interpreted in seconds. Required
parameter.
'spatial_distance_metric': A string
representing the coordinate system to use for the
spatial search criteria. Acceptable values are
'euclidean' and 'great_circle'. Optional parameter;
default is 'euclidean'.
Supported values:
Filters objects matching all points of the given track (works only on track
type data). It allows users to specify a particular track to find all other
points in the table that fall within specified ranges-spatial and
temporal-of all points of the given track. Additionally, the user can
specify another track to see if the two intersect (or go close to each other
within the specified ranges). The user also has the flexibility of using
different metrics for the spatial distance calculation: Euclidean (flat
geometry) or Great Circle (spherical geometry to approximate the Earth's
surface distances). The filtered points are stored in a newly created result
set. The return value of the function is the number of points in the
resultant set (view).
This operation is synchronous, meaning that a response will not be returned
until all the objects are fully available.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates which objects from a table, collection, or view match a string
expression for the given string columns. The options 'case_sensitive' can be
used to modify the behavior for all modes except 'search'. For 'search' mode
details and limitations, see Full Text Search.
Parameters:
Name
Type
Description
table_name
String
Name of the table on which the filter operation
will be performed. Must be an existing table,
collection or view.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
expression
String
The expression with which to filter the table.
mode
String
The string filtering mode to apply. See below for
details.
Supported values:
'search': Full text search query with
wildcards and boolean operators. Note that for this
mode, no column can be specified in
column_names; all string columns of the
table that have text search enabled will be searched.
'equals': Exact whole-string match
(accelerated).
'contains': Partial substring match (not
accelerated). If the column is a string type
(non-charN) and the number of records is too large, it
will return 0.
'starts_with': Strings that start with
the given expression (not accelerated). If the column
is a string type (non-charN) and the number of records
is too large, it will return 0.
'regex': Full regular expression search
(not accelerated). If the column is a string type
(non-charN) and the number of records is too large, it
will return 0.
column_names
Array.<String>
List of columns on which to apply the
filter. Ignored for 'search' mode.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
'case_sensitive': If 'false' then
string filtering will ignore case. Does not apply
to 'search' mode.
Supported values:
Calculates which objects from a table, collection, or view match a string
expression for the given string columns. The options 'case_sensitive' can be
used to modify the behavior for all modes except 'search'. For 'search' mode
details and limitations, see Full Text Search.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Filters objects in one table based on objects in another table. The user
must specify matching column types from the two tables (i.e. the target
table from which objects will be filtered and the source table based on
which the filter will be created); the column names need not be the same. If
a view_name is specified, then the filtered objects will then
be put in a newly created view. The operation is synchronous, meaning that a
response will not be returned until all objects are fully available in the
result view. The return value contains the count (i.e. the size) of the
resulting view.
Parameters:
Name
Type
Description
table_name
String
Name of the table whose data will be filtered.
Must be an existing table.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
column_name
String
Name of the column by whose value the data will
be filtered from the table designated by
table_name.
source_table_name
String
Name of the table whose data will be
compared against in the table called
table_name. Must be an
existing table.
source_table_column_name
String
Name of the column in the
source_table_name
whose values will be used as the
filter for table
table_name. Must be a
geospatial geometry column if in
'spatial' mode; otherwise, Must
match the type of the
column_name.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
'filter_mode': String indicating the
filter mode, either in_table or
not_in_table.
Supported values:
'in_table'
'not_in_table'
The default value is 'in_table'.
'mode': Mode - should be either
spatial or normal.
Supported values:
'normal'
'spatial'
The default value is 'normal'.
'buffer': Buffer size, in meters. Only
relevant for spatial mode. The
default value is '0'.
'buffer_method': Method used to buffer
polygons. Only relevant for spatial
mode.
Supported values:
'normal'
'geos': Use geos 1 edge per corner
algorithm
The default value is 'normal'.
'max_partition_size': Maximum number
of points in a partition. Only relevant for
spatial mode. The default value is
'0'.
'max_partition_score': Maximum number
of points * edges in a partition. Only relevant for
spatial mode. The default value is
'8000000'.
'x_column_name': Name of column
containing x value of point being filtered in
spatial mode. The default value is
'x'.
'y_column_name': Name of column
containing y value of point being filtered in
spatial mode. The default value is
'y'.
Filters objects in one table based on objects in another table. The user
must specify matching column types from the two tables (i.e. the target
table from which objects will be filtered and the source table based on
which the filter will be created); the column names need not be the same. If
a view_name is specified, then the filtered objects will then
be put in a newly created view. The operation is synchronous, meaning that a
response will not be returned until all objects are fully available in the
result view. The return value contains the count (i.e. the size) of the
resulting view.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Calculates which objects from a table has a particular value for a
particular column. The input parameters provide a way to specify either a
String or a Double valued column and a desired value for the column on which
the filter is performed. The operation is synchronous, meaning that a
response will not be returned until all the objects are fully available. The
response payload provides the count of the resulting set. A new result view
which satisfies the input filter restriction specification is also created
with a view name passed in as part of the input payload. Although this
functionality can also be accomplished with the standard filter function, it
is more efficient.
Parameters:
Name
Type
Description
table_name
String
Name of an existing table on which to perform
the calculation.
view_name
String
If provided, then this will be the name of the
view containing the results. Has the same naming
restrictions as tables.
is_string
Boolean
Indicates whether the value being searched for
is string or numeric.
value
Number
The value to search for.
value_str
String
The string value to search for.
column_name
String
Name of a column on which the filter by value
would be applied.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
view. If the collection provided is non-existent,
the collection will be automatically created. If
empty, then the newly created view will be
top-level.
Calculates which objects from a table has a particular value for a
particular column. The input parameters provide a way to specify either a
String or a Double valued column and a desired value for the column on which
the filter is performed. The operation is synchronous, meaning that a
response will not be returned until all the objects are fully available. The
response payload provides the count of the resulting set. A new result view
which satisfies the input filter restriction specification is also created
with a view name passed in as part of the input payload. Although this
functionality can also be accomplished with the standard filter function, it
is more efficient.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Retrieves records as a GeoJSON from a given table, optionally filtered by an expression
and/or sorted by a column. This operation can be performed on tables, views,
or on homogeneous collections (collections containing tables of all the same
type). Records can be returned encoded as binary, json or geojson.
This operation supports paging through the data via the offset
and limit parameters. Note that when paging through a table, if
the table (or the underlying table in case of a view) is updated (records
are inserted, deleted or modified) the records retrieved may differ between
calls based on the updates applied.
Parameters:
Name
Type
Description
table_name
String
Name of the table from which the records will be
fetched. Must be a table, view or homogeneous
collection.
offset
Number
A positive integer indicating the number of initial
results to skip (this can be useful for paging
through the results).
limit
Number
A positive integer indicating the maximum number of
results to be returned. Or END_OF_SET (-9999) to
indicate that the max number of results should be
returned.
options
Object
'expression': Optional filter
expression to apply to the table.
'fast_index_lookup': Indicates if
indexes should be used to perform the lookup for a
given expression if possible. Only applicable if
there is no sorting, the expression contains only
equivalence comparisons based on existing tables
indexes and the range of requested values is from
[0 to END_OF_SET].
Supported values:
'true'
'false'
The default value is 'true'.
'sort_by': Optional column that the
data should be sorted by. Empty by default (i.e. no
sorting is applied).
'sort_order': String indicating how
the returned values should be sorted - ascending or
descending. If sort_order is provided, sort_by has
to be provided.
Supported values:
Retrieves records from a given table, optionally filtered by an expression
and/or sorted by a column. This operation can be performed on tables, views,
or on homogeneous collections (collections containing tables of all the same
type). Records can be returned encoded as binary, json or geojson.
This operation supports paging through the data via the offset
and limit parameters. Note that when paging through a table, if
the table (or the underlying table in case of a view) is updated (records
are inserted, deleted or modified) the records retrieved may differ between
calls based on the updates applied.
Parameters:
Name
Type
Description
table_name
String
Name of the table from which the records will be
fetched. Must be a table, view or homogeneous
collection.
offset
Number
A positive integer indicating the number of initial
results to skip (this can be useful for paging
through the results).
limit
Number
A positive integer indicating the maximum number of
results to be returned. Or END_OF_SET (-9999) to
indicate that the max number of results should be
returned.
options
Object
'expression': Optional filter
expression to apply to the table.
'fast_index_lookup': Indicates if
indexes should be used to perform the lookup for a
given expression if possible. Only applicable if
there is no sorting, the expression contains only
equivalence comparisons based on existing tables
indexes and the range of requested values is from
[0 to END_OF_SET].
Supported values:
'true'
'false'
The default value is 'true'.
'sort_by': Optional column that the
data should be sorted by. Empty by default (i.e. no
sorting is applied).
'sort_order': String indicating how
the returned values should be sorted - ascending or
descending. If sort_order is provided, sort_by has
to be provided.
Supported values:
For a given table, retrieves the values from the requested column(s). Maps
of column name to the array of values as well as the column data type are
returned. This endpoint supports pagination with the offset and
limit parameters.
When using pagination, if the table (or the underlying table in the case of
a view) is modified (records are inserted, updated, or deleted) during a
call to the endpoint, the records or values retrieved may differ between
calls based on the type of the update, e.g., the contiguity across pages
cannot be relied upon.
Name of the table on which this operation will
be performed. The table cannot be a parent set.
column_names
Array.<String>
The list of column values to retrieve.
offset
Number
A positive integer indicating the number of initial
results to skip (this can be useful for paging
through the results).
limit
Number
A positive integer indicating the maximum number of
results to be returned (if not provided the default
is 10000), or END_OF_SET (-9999) to indicate that the
maximum number of results allowed by the server
should be returned.
options
Object
'expression': Optional filter
expression to apply to the table.
'sort_by': Optional column(s) that the
data should be sorted by. Empty by default (i.e. no
sorting is applied).
'sort_order': String indicating how
the returned values should be sorted - ascending or
descending. If sort_order is provided, sort_by has
to be provided.
Supported values:
'ascending'
'descending'
The default value is 'ascending'.
'order_by': Comma-separated list of
the columns to be sorted by; e.g. 'timestamp asc, x
desc'. The default value is ''.
'convert_wkts_to_wkbs': If true, then
WKT string columns will be returned as WKB bytes.
Supported values:
For a given table, retrieves the values from the requested column(s). Maps
of column name to the array of values as well as the column data type are
returned. This endpoint supports pagination with the offset and
limit parameters.
When using pagination, if the table (or the underlying table in the case of
a view) is modified (records are inserted, updated, or deleted) during a
call to the endpoint, the records or values retrieved may differ between
calls based on the type of the update, e.g., the contiguity across pages
cannot be relied upon.
Retrieves the complete series/track records from the given
world_table_name based on the partial track information
contained in the table_name.
This operation supports paging through the data via the offset
and limit parameters.
In contrast to GPUdb#get_records this returns records grouped
by series/track. So if offset is 0 and limit is 5
this operation would return the first 5 series/tracks in
table_name. Each series/track will be returned sorted by their
TIMESTAMP column.
Parameters:
Name
Type
Description
table_name
String
Name of the collection/table/view for which
series/tracks will be fetched.
world_table_name
String
Name of the table containing the complete
series/track information to be returned
for the tracks present in the
table_name. Typically this is
used when retrieving series/tracks from a
view (which contains partial
series/tracks) but the user wants to
retrieve the entire original
series/tracks. Can be blank.
offset
Number
A positive integer indicating the number of initial
series/tracks to skip (useful for paging through the
results).
limit
Number
A positive integer indicating the maximum number of
series/tracks to be returned. Or END_OF_SET (-9999)
to indicate that the max number of results should be
returned.
Retrieves the complete series/track records from the given
world_table_name based on the partial track information
contained in the table_name.
This operation supports paging through the data via the offset
and limit parameters.
In contrast to GPUdb#get_records this returns records grouped
by series/track. So if offset is 0 and limit is 5
this operation would return the first 5 series/tracks in
table_name. Each series/track will be returned sorted by their
TIMESTAMP column.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Retrieves records from a collection. The operation can optionally return the
record IDs which can be used in certain queries such as
GPUdb#delete_records.
This operation supports paging through the data via the offset
and limit parameters.
Note that when using the Java API, it is not possible to retrieve records
from join tables using this operation.
Parameters:
Name
Type
Description
table_name
String
Name of the collection or table from which
records are to be retrieved. Must be an existing
collection or table.
offset
Number
A positive integer indicating the number of initial
results to skip (this can be useful for paging
through the results).
limit
Number
A positive integer indicating the maximum number of
results to be returned, or END_OF_SET (-9999) to
indicate that the max number of results should be
returned.
options
Object
'return_record_ids': If 'true' then
return the internal record ID along with each
returned record. Default is 'false'.
Supported values:
Retrieves records from a collection. The operation can optionally return the
record IDs which can be used in certain queries such as
GPUdb#delete_records.
This operation supports paging through the data via the offset
and limit parameters.
Note that when using the Java API, it is not possible to retrieve records
from join tables using this operation.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Response object containing the method_codes of the
operation.
Type
Object
get_records_request(request, callback) → {Object}
Retrieves records from a given table, optionally filtered by an expression
and/or sorted by a column. This operation can be performed on tables, views,
or on homogeneous collections (collections containing tables of all the same
type). Records can be returned encoded as binary, json or geojson.
This operation supports paging through the data via the offset
and limit parameters. Note that when paging through a table, if
the table (or the underlying table in case of a view) is updated (records
are inserted, deleted or modified) the records retrieved may differ between
calls based on the updates applied.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Grants a table-level permission to a user or role.
Parameters:
Name
Type
Description
name
String
Name of the user or role to which the permission will
be granted. Must be an existing user or role.
permission
String
Permission to grant to the user or role.
Supported values:
'table_admin': Full read/write and
administrative access to the table.
'table_insert': Insert access to
the table.
'table_update': Update access to
the table.
'table_delete': Delete access to
the table.
'table_read': Read access to the
table.
table_name
String
Name of the table to which the permission grants
access. Must be an existing table, collection,
or view. If a collection, the permission also
applies to tables and views in the collection.
Adds multiple records to the specified table. The operation is synchronous,
meaning that a response will not be returned until all the records are fully
inserted and available. The response payload provides the counts of the
number of records actually inserted and/or updated, and can provide the
unique identifier of each added record.
The options parameter can be used to customize this function's
behavior.
The update_on_existing_pk option specifies the record collision
policy for inserting into a table with a primary
key, but is ignored if no primary key exists.
The return_record_ids option indicates that the database should
return the unique identifiers of inserted records.
Parameters:
Name
Type
Description
table_name
String
Table to which the records are to be added. Must
be an existing table.
data
Array.<Object>
An array of JSON encoded data for the records to be
added. All records must be of the same type as that
of the table. Empty array if
list_encoding is binary.
options
Object
Optional parameters.
'update_on_existing_pk': Specifies the
record collision policy for inserting into a table
with a primary key. If set to
true, any existing table record with
primary key values that match those of a record
being inserted will be replaced by that new record.
If set to false, any existing table
record with primary key values that match those of
a record being inserted will remain unchanged and
the new record discarded. If the specified table
does not have a primary key, then this option is
ignored.
Supported values:
'true'
'false'
The default value is 'false'.
'return_record_ids': If
true then return the internal record
id along for each inserted record.
Supported values:
Generates a specified number of random records and adds them to the given
table. There is an optional parameter that allows the user to customize the
ranges of the column values. It also allows the user to specify linear
profiles for some or all columns in which case linear values are generated
rather than random ones. Only individual tables are supported for this
operation.
This operation is synchronous, meaning that a response will not be returned
until all random records are fully available.
Parameters:
Name
Type
Description
table_name
String
Table to which random records will be added.
Must be an existing table. Also, must be an
individual table, not a collection of tables,
nor a view of a table.
count
Number
Number of records to generate.
options
Object
Optional parameter to pass in specifications for
the randomness of the values. This map is
different from the *options* parameter of most
other endpoints in that it is a map of string to
map of string to doubles, while most others are
maps of string to string. In this map, the top
level keys represent which column's parameters are
being specified, while the internal keys represents
which parameter is being specified. These
parameters take on different meanings depending on
the type of the column. Below follows a more
detailed description of the map:
'seed': If provided, the internal
random number generator will be initialized with
the given value. The minimum is 0. This allows
for the same set of random numbers to be generated
across invocation of this endpoint in case the user
wants to repeat the test. Since
options, is a map of maps, we need an
internal map to provide the seed value. For
example, to pass 100 as the seed value through this
parameter, you need something equivalent to:
'options' = {'seed': { 'value': 100 } }
'value': Pass the seed value here.
'all': This key indicates that the
specifications relayed in the internal map are to
be applied to all columns of the records.
'min': For numerical columns, the
minimum of the generated values is set to this
value. Default is -99999. For point, shape, and
track columns, min for numeric 'x' and 'y' columns
needs to be within [-180, 180] and [-90, 90],
respectively. The default minimum possible values
for these columns in such cases are -180.0 and
-90.0. For the 'TIMESTAMP' column, the default
minimum corresponds to Jan 1, 2010.
For string columns, the minimum length of the
randomly generated strings is set to this value
(default is 0). If both minimum and maximum are
provided, minimum must be less than or equal to
max. Value needs to be within [0, 200].
If the min is outside the accepted ranges for
strings columns and 'x' and 'y' columns for
point/shape/track, then those parameters will not
be set; however, an error will not be thrown in
such a case. It is the responsibility of the user
to use the all parameter judiciously.
'max': For numerical columns, the
maximum of the generated values is set to this
value. Default is 99999. For point, shape, and
track columns, max for numeric 'x' and 'y' columns
needs to be within [-180, 180] and [-90, 90],
respectively. The default minimum possible values
for these columns in such cases are 180.0 and 90.0.
For string columns, the maximum length of the
randomly generated strings is set to this value
(default is 200). If both minimum and maximum are
provided, *max* must be greater than or equal to
*min*. Value needs to be within [0, 200].
If the *max* is outside the accepted ranges for
strings columns and 'x' and 'y' columns for
point/shape/track, then those parameters will not
be set; however, an error will not be thrown in
such a case. It is the responsibility of the user
to use the all parameter judiciously.
'interval': If specified, generate
values for all columns evenly spaced with the given
interval value. If a max value is specified for a
given column the data is randomly generated between
min and max and decimated down to the interval. If
no max is provided the data is linerally generated
starting at the minimum value (instead of
generating random data). For non-decimated
string-type columns the interval value is ignored.
Instead the values are generated following the
pattern: 'attrname_creationIndex#', i.e. the column
name suffixed with an underscore and a running
counter (starting at 0). For string types with
limited size (eg char4) the prefix is dropped. No
nulls will be generated for nullable columns.
'null_percentage': If specified, then
generate the given percentage of the count as nulls
for all nullable columns. This option will be
ignored for non-nullable columns. The value must
be within the range [0, 1.0]. The default value is
5% (0.05).
'cardinality': If specified, limit the
randomly generated values to a fixed set. Not
allowed on a column with interval specified, and is
not applicable to WKT or Track-specific columns.
The value must be greater than 0. This option is
disabled by default.
'attr_name': Use the desired column
name in place of attr_name, and set
the following parameters for the column specified.
This overrides any parameter set by
all.
'min': For numerical columns, the
minimum of the generated values is set to this
value. Default is -99999. For point, shape, and
track columns, min for numeric 'x' and 'y' columns
needs to be within [-180, 180] and [-90, 90],
respectively. The default minimum possible values
for these columns in such cases are -180.0 and
-90.0. For the 'TIMESTAMP' column, the default
minimum corresponds to Jan 1, 2010.
For string columns, the minimum length of the
randomly generated strings is set to this value
(default is 0). If both minimum and maximum are
provided, minimum must be less than or equal to
max. Value needs to be within [0, 200].
If the min is outside the accepted ranges for
strings columns and 'x' and 'y' columns for
point/shape/track, then those parameters will not
be set; however, an error will not be thrown in
such a case. It is the responsibility of the user
to use the all parameter judiciously.
'max': For numerical columns, the
maximum of the generated values is set to this
value. Default is 99999. For point, shape, and
track columns, max for numeric 'x' and 'y' columns
needs to be within [-180, 180] and [-90, 90],
respectively. The default minimum possible values
for these columns in such cases are 180.0 and 90.0.
For string columns, the maximum length of the
randomly generated strings is set to this value
(default is 200). If both minimum and maximum are
provided, *max* must be greater than or equal to
*min*. Value needs to be within [0, 200].
If the *max* is outside the accepted ranges for
strings columns and 'x' and 'y' columns for
point/shape/track, then those parameters will not
be set; however, an error will not be thrown in
such a case. It is the responsibility of the user
to use the all parameter judiciously.
'interval': If specified, generate
values for all columns evenly spaced with the given
interval value. If a max value is specified for a
given column the data is randomly generated between
min and max and decimated down to the interval. If
no max is provided the data is linerally generated
starting at the minimum value (instead of
generating random data). For non-decimated
string-type columns the interval value is ignored.
Instead the values are generated following the
pattern: 'attrname_creationIndex#', i.e. the column
name suffixed with an underscore and a running
counter (starting at 0). For string types with
limited size (eg char4) the prefix is dropped. No
nulls will be generated for nullable columns.
'null_percentage': If specified and if
this column is nullable, then generate the given
percentage of the count as nulls. This option will
result in an error if the column is not nullable.
The value must be within the range [0, 1.0]. The
default value is 5% (0.05).
'cardinality': If specified, limit the
randomly generated values to a fixed set. Not
allowed on a column with interval specified, and is
not applicable to WKT or Track-specific columns.
The value must be greater than 0. This option is
disabled by default.
'track_length': This key-map pair is
only valid for track data sets (an error is thrown
otherwise). No nulls would be generated for
nullable columns.
'min': Minimum possible length for
generated series; default is 100 records per
series. Must be an integral value within the range
[1, 500]. If both min and max are specified, min
must be less than or equal to max.
'max': Maximum possible length for
generated series; default is 500 records per
series. Must be an integral value within the range
[1, 500]. If both min and max are specified, max
must be greater than or equal to min.
Generates a specified number of random records and adds them to the given
table. There is an optional parameter that allows the user to customize the
ranges of the column values. It also allows the user to specify linear
profiles for some or all columns in which case linear values are generated
rather than random ones. Only individual tables are supported for this
operation.
This operation is synchronous, meaning that a response will not be returned
until all random records are fully available.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Adds multiple records to the specified table. The operation is synchronous,
meaning that a response will not be returned until all the records are fully
inserted and available. The response payload provides the counts of the
number of records actually inserted and/or updated, and can provide the
unique identifier of each added record.
The options parameter can be used to customize this function's
behavior.
The update_on_existing_pk option specifies the record collision
policy for inserting into a table with a primary
key, but is ignored if no primary key exists.
The return_record_ids option indicates that the database should
return the unique identifiers of inserted records.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Adds a symbol or icon (i.e. an image) to represent data points when data is
rendered visually. Users must provide the symbol identifier (string), a
format (currently supported: 'svg' and 'svg_path'), the data for the symbol,
and any additional optional parameter (e.g. color). To have a symbol used
for rendering create a table with a string column named 'SYMBOLCODE' (along
with 'x' or 'y' for example). Then when the table is rendered (via WMS) if the
'dosymbology' parameter is 'true' then the value of the 'SYMBOLCODE' column
is used to pick the symbol displayed for each point.
Parameters:
Name
Type
Description
symbol_id
String
The id of the symbol being added. This is the
same id that should be in the 'SYMBOLCODE' column
for objects using this symbol
symbol_format
String
Specifies the symbol format. Must be either
'svg' or 'svg_path'.
Supported values:
'svg'
'svg_path'
symbol_data
String
The actual symbol data. If
symbol_format is 'svg' then this
should be the raw bytes representing an svg
file. If symbol_format is svg path
then this should be an svg path string, for
example:
'M25.979,12.896,5.979,12.896,5.979,19.562,25.979,19.562z'
options
Object
Optional parameters.
'color': If symbol_format
is 'svg' this is ignored. If
symbol_format is 'svg_path' then this
option specifies the color (in RRGGBB hex format)
of the path. For example, to have the path rendered
in red, used 'FF0000'. If 'color' is not provided
then '00FF00' (i.e. green) is used by default.
Adds a symbol or icon (i.e. an image) to represent data points when data is
rendered visually. Users must provide the symbol identifier (string), a
format (currently supported: 'svg' and 'svg_path'), the data for the symbol,
and any additional optional parameter (e.g. color). To have a symbol used
for rendering create a table with a string column named 'SYMBOLCODE' (along
with 'x' or 'y' for example). Then when the table is rendered (via WMS) if the
'dosymbology' parameter is 'true' then the value of the 'SYMBOLCODE' column
is used to pick the symbol displayed for each point.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Response object containing the method_codes of the
operation.
Type
Object
kill_proc(run_id, options, callback) → {Object}
Kills a running proc instance.
Parameters:
Name
Type
Description
run_id
String
The run ID of the running proc instance. If the run
ID is not found or the proc instance has already
completed, this does nothing. If not specified, all
running proc instances will be killed.
Manages global access to a table's data. By default a table has a
lock_type of read_write, indicating all operations
are permitted. A user may request a read_only or a
write_only lock, after which only read or write operations,
respectively, are permitted on the table until the lock is removed. When
lock_type is no_access then no operations are
permitted on the table. The lock status can be queried by setting
lock_type to status.
Parameters:
Name
Type
Description
table_name
String
Name of the table to be locked. It must be a
currently existing table, collection, or view.
lock_type
String
The type of lock being applied to the table.
Setting it to status will return the
current lock status of the table without changing
it.
Supported values:
Response object containing the method_codes of the
operation.
Type
Object
lock_table_request(request, callback) → {Object}
Manages global access to a table's data. By default a table has a
lock_type of read_write, indicating all operations
are permitted. A user may request a read_only or a
write_only lock, after which only read or write operations,
respectively, are permitted on the table until the lock is removed. When
lock_type is no_access then no operations are
permitted on the table. The lock status can be queried by setting
lock_type to status.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Create a new empty result table (specified by table_name), and
insert all records from source tables (specified by
source_table_names) based on the field mapping information
(specified by field_maps).
For merge records details and examples, see Merge Records.
For limitations, see Merge Records Limitations and Cautions.
The field map (specified by field_maps) holds the
user-specified maps of target table column names to source table columns.
The array of field_maps must match one-to-one with the
source_table_names, e.g., there's a map present in
field_maps for each table listed in
source_table_names.
Parameters:
Name
Type
Description
table_name
String
The new result table name for the records to be
merged. Must NOT be an existing table.
source_table_names
Array.<String>
The list of source table names to get
the records from. Must be existing
table names.
field_maps
Array.<Object>
Contains a list of source/target column
mappings, one mapping for each source table
listed in source_table_names
being merged into the target table specified
by table_name. Each mapping
contains the target column names (as keys)
that the data in the mapped source columns or
column expressions (as values) will
be merged into. All of the source columns
being merged into a given target column must
match in type, as that type will determine the
type of the new target column.
options
Object
Optional parameters.
'collection_name': Name of a
collection which is to contain the newly created
merged table specified by table_name.
If the collection provided is non-existent, the
collection will be automatically created. If empty,
then the newly created merged table will be a
top-level table.
'is_replicated': Indicates the distribution scheme for the data
of the merged table specified in
table_name. If true, the table will
be replicated. If false, the table
will be randomly sharded.
Supported values:
'true'
'false'
The default value is 'false'.
'ttl': Sets the TTL of the merged table specified
in table_name.
'persist': If true, then
the table specified in table_name will
be persisted and will not expire unless a
ttl is specified. If
false, then the table will be an
in-memory table and will expire unless a
ttl is specified otherwise.
Supported values:
'true'
'false'
The default value is 'true'.
'chunk_size': Indicates the chunk size
to be used for the merged table specified in
table_name.
'view_id': view this result table is
part of. The default value is ''.
Create a new empty result table (specified by table_name), and
insert all records from source tables (specified by
source_table_names) based on the field mapping information
(specified by field_maps).
For merge records details and examples, see Merge Records.
For limitations, see Merge Records Limitations and Cautions.
The field map (specified by field_maps) holds the
user-specified maps of target table column names to source table columns.
The array of field_maps must match one-to-one with the
source_table_names, e.g., there's a map present in
field_maps for each table listed in
source_table_names.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Name of the proc to show information about. If
specified, must be the name of a currently
existing proc. If not specified, information
about all procs will be returned.
options
Object
Optional parameters.
'include_files': If set to
true, the files that make up the proc
will be returned. If set to false, the
files will not be returned.
Supported values:
Shows the statuses of running or completed proc instances. Results are
grouped by run ID (as returned from GPUdb#execute_proc) and
data segment ID (each invocation of the proc command on a data segment is
assigned a data segment ID).
Parameters:
Name
Type
Description
run_id
String
The run ID of a specific running or completed proc
instance for which the status will be returned. If
the run ID is not found, nothing will be returned.
If not specified, the statuses of all running and
completed proc instances will be returned.
options
Object
Optional parameters.
'clear_complete': If set to
true, if a proc instance has completed
(either successfully or unsuccessfully) then its
status will be cleared and no longer returned in
subsequent calls.
Supported values:
Shows the statuses of running or completed proc instances. Results are
grouped by run ID (as returned from GPUdb#execute_proc) and
data segment ID (each invocation of the proc command on a data segment is
assigned a data segment ID).
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Shows security information relating to users and/or roles. If the caller is
not a system administrator, only information relating to the caller and
their roles is returned.
Parameters:
Name
Type
Description
names
Array.<String>
A list of names of users and/or roles about which
security information is requested. If none are
provided, information about all users and roles
will be returned.
Shows security information relating to users and/or roles. If the caller is
not a system administrator, only information relating to the caller and
their roles is returned.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Response object containing the method_codes of the
operation.
Type
Object
show_system_timing(options, callback) → {Object}
Returns the last 100 database requests along with the request timing and
internal job id. The admin tool uses it to present request timing
information to the user.
Returns the last 100 database requests along with the request timing and
internal job id. The admin tool uses it to present request timing
information to the user.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Retrieves detailed information about tables, views, and collections.
If table_name specifies a table or view, information specific
to that entity will be returned.
If table_name specifies a collection, the call can return
information about either the collection itself (setting the
show_children option to false) or the tables and
views it contains (setting show_children to true).
If table_name is empty, information about all collections and
top-level tables and views can be returned. Note:
show_children must be set to true.
If table_name is '*', information about all tables,
collections, and views will be returned. Note: show_children
must be set to true.
If the option get_sizes is set to true, then the
sizes (objects and elements) of each table are returned (in
sizes and full_sizes), along with the total number
of objects in the requested table (in total_size and
total_full_size).
Parameters:
Name
Type
Description
table_name
String
Name of the table for which to retrieve the
information. If blank, then information about
all collections and top-level tables and views
is returned.
options
Object
Optional parameters.
'force_synchronous': If
true then the table sizes will wait
for read lock before returning.
Supported values:
'true'
'false'
The default value is 'true'.
'get_sizes': If true then
the table sizes will be returned; blank, otherwise.
Supported values:
'true'
'false'
The default value is 'false'.
'show_children': If
table_name is a collection, then
true will return information about the
children of the collection, while
false will return information about
the collection itself.
If table_name is empty or '*', then
show_children must be
true (or not specified); otherwise, no
results will be returned.
Supported values:
'true'
'false'
The default value is 'true'.
'no_error_if_not_exists': If
false will return an error if the
provided table_name does not exist. If
true then it will return an empty
result.
Supported values:
'true'
'false'
The default value is 'false'.
'get_column_info': If
true then column info (memory usage,
etc) will be returned.
Supported values:
Response object containing the method_codes of the
operation.
Type
Object
show_table_request(request, callback) → {Object}
Retrieves detailed information about tables, views, and collections.
If table_name specifies a table or view, information specific
to that entity will be returned.
If table_name specifies a collection, the call can return
information about either the collection itself (setting the
show_children option to false) or the tables and
views it contains (setting show_children to true).
If table_name is empty, information about all collections and
top-level tables and views can be returned. Note:
show_children must be set to true.
If table_name is '*', information about all tables,
collections, and views will be returned. Note: show_children
must be set to true.
If the option get_sizes is set to true, then the
sizes (objects and elements) of each table are returned (in
sizes and full_sizes), along with the total number
of objects in the requested table (in total_size and
total_full_size).
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Gets names of the tables whose type matches the given criteria. Each table
has a particular type. This type comprises the schema and properties of the
table and sometimes a type label. This function allows a look up of the
existing tables based on full or partial type information. The operation is
synchronous.
Gets names of the tables whose type matches the given criteria. Each table
has a particular type. This type comprises the schema and properties of the
table and sometimes a type label. This function allows a look up of the
existing tables based on full or partial type information. The operation is
synchronous.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Retrieves information for the specified data type ID or type label. For all
data types that match the input criteria, the database returns the type ID,
the type schema, the label (if available), and the type's column properties.
Response object containing the method_codes of the
operation.
Type
Object
show_types_request(request, callback) → {Object}
Retrieves information for the specified data type ID or type label. For all
data types that match the input criteria, the database returns the type ID,
the type schema, the label (if available), and the type's column properties.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
If a callback function is provided, the request will be submitted
asynchronously, and the result (either a response or an error) will be passed
to the callback function upon completion.
If a callback function is not provided, the request will be submitted
synchronously and the response returned directly, and an exception will be
thrown if an error occurs.
In either case the function will attempt to cycle through available
GPUdb instances as provided in the constructor if an error occurs with the
server.
Runs multiple predicate-based updates in a single call. With the list of
given expressions, any matching record's column values will be updated as
provided in new_values_maps. There is also an optional
'upsert' capability where if a particular predicate doesn't match any
existing record, then a new record can be inserted.
Note that this operation can only be run on an original table and not on a
collection or a result view.
This operation can update primary key values. By default only 'pure primary
key' predicates are allowed when updating primary key values. If the primary
key for a table is the column 'attr1', then the operation will only accept
predicates of the form: "attr1 == 'foo'" if the attr1 column is being
updated. For a composite primary key (e.g. columns 'attr1' and 'attr2')
then this operation will only accept predicates of the form: "(attr1 ==
'foo') and (attr2 == 'bar')". Meaning, all primary key columns must appear
in an equality predicate in the expressions. Furthermore each 'pure primary
key' predicate must be unique within a given request. These restrictions
can be removed by utilizing some available options through
options.
Parameters:
Name
Type
Description
table_name
String
Table to be updated. Must be a currently
existing table and not a collection or view.
expressions
Array.<String>
A list of the actual predicates, one for each
update; format should follow the guidelines
here.
new_values_maps
Array.<Object>
List of new values for the matching
records. Each element is a map with
(key, value) pairs where the keys are the
names of the columns whose values are to
be updated; the values are the new
values. The number of elements in the
list should match the length of
expressions.
data
Array.<Object>
An optional list of new json-avro encoded objects to
insert, one for each update, to be added to the set
if the particular update did not affect any objects.
options
Object
Optional parameters.
'global_expression': An optional
global expression to reduce the search space of the
predicates listed in expressions. The
default value is ''.
'bypass_safety_checks': When set to
true, all predicates are available for
primary key updates. Keep in mind that it is
possible to destroy data in this case, since a
single predicate may match multiple objects
(potentially all of records of a table), and then
updating all of those records to have the same
primary key will, due to the primary key uniqueness
constraints, effectively delete all but one of
those updated records.
Supported values:
'true'
'false'
The default value is 'false'.
'update_on_existing_pk': Can be used
to customize behavior when the updated primary key
value already exists as described in
GPUdb#insert_records.
Supported values:
'true'
'false'
The default value is 'false'.
'use_expressions_in_new_values_maps':
When set to true, all new values in
new_values_maps are considered as
expression values. When set to false,
all new values in new_values_maps are
considered as constants. NOTE: When
true, string constants will need to be
quoted to avoid being evaluated as expressions.
Supported values:
Updates the view specified by table_name to include full series
(track) information from the world_table_name for the series
(tracks) present in the view_name.
Parameters:
Name
Type
Description
table_name
String
Name of the view on which the update operation
will be performed. Must be an existing view.
world_table_name
String
Name of the table containing the complete
series (track) information.
view_name
String
Optional name of the view containing the series
(tracks) which have to be updated.
Updates the view specified by table_name to include full series
(track) information from the world_table_name for the series
(tracks) present in the view_name.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Runs multiple predicate-based updates in a single call. With the list of
given expressions, any matching record's column values will be updated as
provided in new_values_maps. There is also an optional
'upsert' capability where if a particular predicate doesn't match any
existing record, then a new record can be inserted.
Note that this operation can only be run on an original table and not on a
collection or a result view.
This operation can update primary key values. By default only 'pure primary
key' predicates are allowed when updating primary key values. If the primary
key for a table is the column 'attr1', then the operation will only accept
predicates of the form: "attr1 == 'foo'" if the attr1 column is being
updated. For a composite primary key (e.g. columns 'attr1' and 'attr2')
then this operation will only accept predicates of the form: "(attr1 ==
'foo') and (attr2 == 'bar')". Meaning, all primary key columns must appear
in an equality predicate in the expressions. Furthermore each 'pure primary
key' predicate must be unique within a given request. These restrictions
can be removed by utilizing some available options through
options.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.
Scatter plot is the only plot type currently supported. A non-numeric column
can be specified as x or y column and jitters can be added to them to avoid
excessive overlapping. All color values must be in the format RRGGBB or
AARRGGBB (to specify the alpha value).
The image is contained in the image_data field.
Parameters:
Name
Type
Description
table_name
String
Name of the table containing the data to be
drawn as a chart.
x_column_names
Array.<String>
Names of the columns containing the data
mapped to the x axis of a chart.
y_column_names
Array.<String>
Names of the columns containing the data
mapped to the y axis of a chart.
min_x
Number
Lower bound for the x column values. For non-numeric
x column, each x column item is mapped to an integral
value starting from 0.
max_x
Number
Upper bound for the x column values. For non-numeric
x column, each x column item is mapped to an integral
value starting from 0.
min_y
Number
Lower bound for the y column values. For non-numeric
y column, each y column item is mapped to an integral
value starting from 0.
max_y
Number
Upper bound for the y column values. For non-numeric
y column, each y column item is mapped to an integral
value starting from 0.
width
Number
Width of the generated image in pixels.
height
Number
Height of the generated image in pixels.
bg_color
String
Background color of the generated image.
style_options
Object
Rendering style options for a chart.
'pointcolor': The color of
points in the plot represented as a
hexadecimal number. The default value is
'0000FF'.
'pointsize': The size of points
in the plot represented as number of pixels.
The default value is '3'.
'pointshape': The shape of
points in the plot.
Supported values:
'none'
'circle'
'square'
'diamond'
'hollowcircle'
'hollowsquare'
'hollowdiamond'
The default value is 'square'.
'cb_pointcolors': Point color
class break information consisting of three
entries: class-break attribute, class-break
values/ranges, and point color values. This
option overrides the pointcolor option if
both are provided. Class-break ranges are
represented in the form of "min:max".
Class-break values/ranges and point color
values are separated by cb_delimiter, e.g.
{"price", "20:30;30:40;40:50",
"0xFF0000;0x00FF00;0x0000FF"}.
'cb_pointsizes': Point size
class break information consisting of three
entries: class-break attribute, class-break
values/ranges, and point size values. This
option overrides the pointsize option if both
are provided. Class-break ranges are
represented in the form of "min:max".
Class-break values/ranges and point size
values are separated by cb_delimiter, e.g.
{"states", "NY;TX;CA", "3;5;7"}.
'cb_pointshapes': Point shape
class break information consisting of three
entries: class-break attribute, class-break
values/ranges, and point shape names. This
option overrides the pointshape option if
both are provided. Class-break ranges are
represented in the form of "min:max".
Class-break values/ranges and point shape
names are separated by cb_delimiter, e.g.
{"states", "NY;TX;CA",
"circle;square;diamond"}.
'cb_delimiter': A character or
string which separates per-class values in a
class-break style option string. The default
value is ';'.
'x_order_by': An expression or
aggregate expression by which non-numeric x
column values are sorted, e.g. "avg(price)
descending".
'y_order_by': An expression or
aggregate expression by which non-numeric y
column values are sorted, e.g. "avg(price)",
which defaults to "avg(price) ascending".
'scale_type_x': Type of x axis
scale.
Supported values:
'none': No scale is applied to
the x axis.
'log': A base-10 log scale is
applied to the x axis.
The default value is 'none'.
'scale_type_y': Type of y axis
scale.
Supported values:
'none': No scale is applied to
the y axis.
'log': A base-10 log scale is
applied to the y axis.
The default value is 'none'.
'min_max_scaled': If this
options is set to "false", this endpoint
expects request's min/max values are not yet
scaled. They will be scaled according to
scale_type_x or scale_type_y for response. If
this options is set to "true", this endpoint
expects request's min/max values are already
scaled according to
scale_type_x/scale_type_y. Response's min/max
values will be equal to request's min/max
values. The default value is 'false'.
'jitter_x': Amplitude of
horizontal jitter applied to non-numeric x
column values. The default value is '0.0'.
'jitter_y': Amplitude of
vertical jitter applied to non-numeric y
column values. The default value is '0.0'.
'plot_all': If this options is
set to "true", all non-numeric column values
are plotted ignoring min_x, max_x, min_y and
max_y parameters. The default value is
'false'.
Scatter plot is the only plot type currently supported. A non-numeric column
can be specified as x or y column and jitters can be added to them to avoid
excessive overlapping. All color values must be in the format RRGGBB or
AARRGGBB (to specify the alpha value).
The image is contained in the image_data field.
Parameters:
Name
Type
Description
request
Object
Request object containing the parameters for the
operation.