Ingests
An ingest is a means for getting data into Kinetica to be used in models.
- Point to + Add New Ingest then click one of the options to begin creating an ingest.
- If an ingest has an output table, click the output table name to view a preview of the table
- Type into Filter to filter down the ingests
- Click to display any archived ingests
- Click to refresh the table
- Click to export the table's values as JSON or CSV
- Click an existing ingest to display additional actions:
- Click View Selection to open the Ingest Details page
- Click Start to start the ingest job
- Click Export Entity to export the ingest as a JSON object
- Click Terminate to stop the ingest
- Click Clone to clone the ingest
- Click Archive to archive the ingest; it will be stopped (if not already) and hidden from the list of ingests
- Click Description / Config to review summary information for the ingest
Details
The Ingest Details page provides a detailed look at a given ingest, including configuration information, state, and a data preview (if the ingest has a destination table configured).
- Click Back to return to the Ingests page
- Click Start to start the ingest job (if it's not running currently)
- Click Export Entity to export the ingest as a JSON object
- Click Terminate to remove the ingest
- Click Clone to clone the ingest
- Click Archive to archive the ingest; it will be stopped (if not already) and hidden from the list of ingests
Batch
A batch ingest uses an internal mechanism to ingest a set of data at once.
To create a batch ingest:
Provide a name for the ingest
Optionally, provide a description for the ingest, then click Next.
Select a Source Type.
Fill the rest of the Source and Destination fields, then click Next.
Important
All sources require a credential; consult Security for more information.
Review the summary, then click Create.
Streaming
A streaming ingest uses Kafka to continuously stream data into Kinetica for use in AAW.
To create a streaming ingest:
Provide a name for the ingest
Optionally, provide a description for the ingest, then click Next.
Fill the Source and Destination fields, then click Next.
Important
All sources require a credential; consult Security for more information.
Review the summary, then click Create.
Import BYOC
A BYOC (bring your own container) ingest uses a Docker container to continuously ingest data into Kinetica for use in AAW.
To create a BYOC ingest:
Provide a Docker container source URI, e.g.,
<repo-name>/<image-name>:<tag-name>
Tip
Optionally, click Inspect to preview container metadata and pre-populate some fields.
Provide a name for the ingest
Optionally, provide a description for the ingest
Add environment variable key value pairs:
- Click Add to create as many environment variables as necessary
- Provide a key name for each environment variable
- Provide a value for each environment variable
Provide a destination table name for the ingest
Click Import
Datasets
Datasets are used for providing the training and test data for models.
- Click + Add New Dataset to begin the dataset setup process.
- Click the Source Table name to display the table data
- Type into Filter to filter down the datasets
- Click to display any archived datasets.
- Click to refresh the table
- Click to export the table's values as JSON or CSV
- Click an existing dataset to display additional actions:
- Click View Selection to open the Dataset Details page
- Click Preview Data to open a dataset preview
- Click Description / Config to review summary information for the dataset
- Click Clone to clone the dataset
- Click Export Entity to export the dataset as a JSON object
- Click Archive to archive the dataset; it will be hidden from the list of datasets
- Click Description / Config to review summary information for the dataset
Details
The Dataset Details page provides a detailed look at a given dataset, including configuration information and state.
- In the Configuration table, click View to view the columns for the dataset
- Click Back to return to the Datasets page
- Click Preview Data to open a dataset preview
- Click Clone to clone the dataset
- Click Export Entity to export the dataset as a JSON object
- Click Archive to archive the dataset; it will be hidden from the list of datasets
New Dataset
To create a new dataset:
Provide a Name for the dataset.
Optionally, provide a Description.
Select a Source Table.
Note
The list of source tables is populated with tables in the Kinetica installation associated with this instance of AAW.
Select one or more Columns from the table, or click Select All to select all columns.
Optionally, provide a filter expression for the columns.
Click Create.
Feature Sets
Feature Sets transform columns (features) from datasets using inline and relational transforms.
- Click + Add New Feature Set to begin the feature set setup process.
- Click the Dataset name to display the Dataset Details for the selected dataset
- Type into Filter to filter down the feature sets
- Click to display any archived feature sets.
- Click to refresh the table
- Click to export the table's values as JSON or CSV
- Click an existing feature set to display additional actions:
- Click View Selection to open the Feature Set Details page
- Click Clone to clone the feature set
- Click Export Entity to export the feature set as a JSON object
- Click Archive to archive the feature set; it will be hidden from the list of feature set
Details
The Feature Set Details page provides a detailed look at a given feature set, including state, features, function, and lambda function information.
- Click Back to return to the Feature Sets page
- Click Clone to clone the feature set
- Click Export Entity to export the feature set as a JSON object
- Click Archive to archive the feature set; it will be hidden from the list of feature set
New Feature Set
To create a new feature set:
Provide a Name for the feature set.
Optionally, provide a Description.
Select a transform type:
Inline Transforms -- Transforms static data using provided functions to prepare the data for use in models for all deployment types
Select an existing Dataset.
Create new features using one of the following methods:
Click Transform Feature to create new features one at a time.
- Provide a Feature Name.
- Select a Column from the selected dataset.
- Select one or more Function(s) to transform the column.
Click the Advanced Input slider and paste a JSON-formatted feature list into the text field.
Tip
When first enabling Advanced Input, an example input is displayed to help get you started on writing a JSON-formatted feature list
Relational Transform -- Allows for constantly changing data via a materialized view to be used in models for continuous deployments
- Provide a name for existing materialized view.
- Provide the name of a timestamp column in the materialized view.
Click Create.