An ingest is a means for getting data into Kinetica to be used in models.
The Ingest Details page provides a detailed look at a given ingest, including configuration information, state, and a data preview (if the ingest has a destination table configured).
A batch ingest uses the Kinetica Input/Output (KIO) tool to ingest a set of data at once.
To create a batch ingest:
Provide a name for the ingest
Optionally, provide a description for the ingest, then click Next.
Select a Source Type.
Fill the rest of the Source and Destination fields, then click Next.
Important
All sources require a credential; consult Security for more information.
Review the summary, then click Create.
A streaming ingest uses Kafka to continuously stream data into Kinetica for use in AAW.
To create a streaming ingest:
Provide a name for the ingest
Optionally, provide a description for the ingest, then click Next.
Fill the Source and Destination fields, then click Next.
Important
All sources require a credential; consult Security for more information.
Review the summary, then click Create.
A BYOC (bring your own container) ingest uses a Docker container to continuously ingest data into Kinetica for use in AAW.
To create a BYOC ingest:
Provide a Docker container source URI, e.g.,
<repo-name>/<image-name>:<tag-name>
Tip
Optionally, click Inspect to preview container metadata and pre-populate some fields.
Provide a name for the ingest
Optionally, provide a description for the ingest
Add environment variable key value pairs:
Provide a destination table name for the ingest
Click Import
Datasets are used for providing the training and test data for models.
The Dataset Details page provides a detailed look at a given dataset, including configuration information and state.
To create a new dataset:
Provide a Name for the dataset.
Optionally, provide a Description.
Select a Source Table.
Note
The list of source tables is populated with tables in the Kinetica installation associated with this instance of AAW.
Select one or more Columns from the table, or click Select All to select all columns.
Optionally, provide a filter expression for the columns.
Click Create.
Feature Sets transform columns (features) from datasets using inline and relational transforms.
The Feature Set Details page provides a detailed look at a given feature set, including state, features, function, and lambda function information.
To create a new feature set:
Provide a Name for the feature set.
Optionally, provide a Description.
Select a transform type:
Inline Transforms -- Transforms static data using provided functions to prepare the data for use in models for all deployment types
Select an existing Dataset.
Create new features using one of the following methods:
Click Transform Feature to create new features one at a time.
Click the Advanced Input slider and paste a JSON-formatted feature list into the text field.
Tip
When first enabling Advanced Input, an example input is displayed to help get you started on writing a JSON-formatted feature list
Relational Transform -- Allows for constantly changing data via a materialized view to be used in models for continuous deployments
Click Create.