batch_size |
Specifies number of records to process before inserting. |
column_formats |
For each target column specified, applies the column-property-bound format to the source data loaded into that column. Each column format will contain a mapping of one or more of its column properties to an appropriate format for each property. Currently supported column properties include date, time, & datetime. The parameter value must be formatted as a JSON string of maps of column names to maps of column properties to their corresponding column formats, e.g., { "order_date" : { "date" : "%Y.%m.%d" }, "order_time" : { "time" : "%H:%M:%S" } }. See default_column_formats for valid format syntax. |
columns_to_load |
For delimited_text file_type only. Specifies a comma-delimited list of column positions or names to load instead of loading all columns in the file(s); if more than one file is being loaded, the list of columns will apply to all files. Column numbers can be specified discretely or as a range, e.g., a value of '5,7,1..3' will create a table with the first column in the table being the fifth column in the file, followed by seventh column in the file, then the first column through the fourth column in the file. |
default_column_formats |
Specifies the default format to be applied to source data loaded into columns with the corresponding column property. This default column-property-bound format can be overridden by specifying a column property & format for a given target column in column_formats. For each specified annotation, the format will apply to all columns with that annotation unless a custom column_formats for that annotation is specified. The parameter value must be formatted as a JSON string that is a map of column properties to their respective column formats, e.g., { "date" : "%Y.%m.%d", "time" : "%H:%M:%S" }. Column formats are specified as a string of control characters and plain text. The supported control characters are 'Y', 'm', 'd', 'H', 'M', 'S', and 's', which follow the Linux 'strptime()' specification, as well as 's', which specifies seconds and fractional seconds (though the fractional component will be truncated past milliseconds). Formats for the 'date' annotation must include the 'Y', 'm', and 'd' control characters. Formats for the 'time' annotation must include the 'H', 'M', and either 'S' or 's' (but not both) control characters. Formats for the 'datetime' annotation meet both the 'date' and 'time' control character requirements. For example, '{"datetime" : "%m/%d/%Y %H:%M:%S" }' would be used to interpret text as "05/04/2000 12:12:11" |
dry_run |
If set to true, no data will be inserted but the file will be read with the applied error_handling mode and the number of valid records that would be normally inserted are returned. The default value is false. The supported values are:
|
error_handling |
Specifies how errors should be handled upon insertion. The default value is Permissive.
Supported Values |
Description |
permissive |
Records with missing columns are populated with nulls if possible; otherwise, the malformed records are skipped. |
ignore_bad_records |
Malformed records are skipped. |
abort |
Stops current insertion and aborts entire operation when an error is encountered. |
|
file_type |
File type for the file(s). The default value is delimited_text.
Supported
Values |
Description |
delimited_text |
Indicates the file(s) are in delimited text format, e.g., CSV, TSV, PSV, etc. |
|
loading_mode |
Specifies how to divide data loading among nodes. The default value is head.
Supported Values |
Description |
head |
The head node loads all data. All files must be available on the head node. |
distributed_shared |
The worker nodes coordinate loading a set of files that are available to all of them. All files must be available on all nodes. This option is best when there is a shared file system. |
distributed_local |
Each worker node loads all files that are available to it. This option is best when each worker node has its own file system. |
|
text_comment_string |
For delimited_text file_type only. All lines in the file(s) starting with the provided string are ignored. The comment string has no effect unless it appears at the beginning of a line. The default value is '#'. |
text_delimiter |
For delimited_text file_type only. Specifies the delimiter for values and columns in the header row (if present). Must be a single character. The default value is ','. |
text_escape_character |
For delimited_text file_type only. The character used in the file(s) to escape certain character sequences in text. For example, the escape character followed by a literal 'n' escapes to a newline character within the field. Can be used within quoted string to escape a quote character. An empty value for this option does not specify an escape character. |
text_has_header |
For delimited_text file_type only. Indicates whether the delimited text files have a header row. The default value is true. The supported values are:
|
text_header_property_delimiter |
For delimited_text file_type only. Specifies the delimiter for column properties in the header row (if present). Cannot be set to same value as text_delimiter. The default value is '|'. |
text_null_string |
For delimited_text file_type only. The value in the file(s) to treat as a null value in the database. The default value is ''. |
text_quote_character |
For delimited_text file_type only. The quote character used in the file(s), typically encompassing a field value. The character must appear at beginning and end of field to take effect. Delimiters within quoted fields are not treated as delimiters. Within a quoted field, double quotes (") can be used to escape a single literal quote character. To not have a quote character, specify an empty string (""). The default value is '"'. |
truncate_table |
If set to true, truncates the table specified by input parameter table_name prior to loading the file(s). The default value is false. The supported values are:
|
num_tasks_per_rank |
Optional: number of tasks for reading file per rank. Default will be external_file_reader_num_tasks |