- Azure (Microsoft blob storage)
- GCS (Google Cloud Storage)
- HDFS (Apache Hadoop Distributed File System)
- JDBC (Java Database Connectivity, using a user-supplied driver or one of the drivers on the supported list)
-
Kafka (streaming feed)
- Apache
- Confluent
- S3 (Amazon S3 Bucket)
The following default hosts are used for Azure, GCS, & S3, but can be
overridden in the
location parameter:- Azure:
<service_account_name>.blob.core.windows.net - GCS:
storage.googleapis.com - S3:
<region>.amazonaws.com
- External tables (see also the CREATE EXTERNAL TABLE command in SQL)
- Insert records (from files) API calls (see also the LOAD DATA command in SQL)
The data source will be validated upon creation, by default, and will
fail to be created if an authorized connection cannot be established.
Managing Data Sources
A data source can be managed using the following API endpoint calls. For managing data sources in SQL, see CREATE DATA SOURCE.| API Call | Description |
|---|---|
| /create/datasource | Creates a data source, given a location and connection information |
| /alter/datasource | Modifies the properties of a data source, validating the new connection |
| /drop/datasource | Removes the data source reference from the database; will not modify the external source data |
| /show/datasource | Outputs the data source properties; passwords are redacted |
| /grant/permission/datasource | Grants the permission for a user to connect to a data source |
| /revoke/permission/datasource | Revokes the permission for a user to connect to a data source |
Creating a Data Source
To create a data source,kin_ds, that connects to an Amazon S3 bucket,
kinetica_ds, in the US East (N. Virginia) region:
For Amazon S3 connections, the
user_name & password
parameters refer to the AWS Access ID & Key, respectively.Provider-Specific Syntax
Several authentication schemes across multiple providers are supported.Azure
GCS
HDFS
JDBC
Kafka (Apache)
The
location can be a comma-delimited list of Kafka URLs to be
used for high-availability; only one of which will be streamed from
at any given time.Kafka (Confluent)
The
location can be a comma-delimited list of Kafka URLs to be
used for high-availability; only one of which will be streamed from
at any given time.S3
Limitations
- Azure anonymous data sources are only supported when both the container and the contained objects allow anonymous access.
- HDFS systems with wire encryption are not supported.
- Kafka data sources require an associated credential object for authentication.