Kinetica provides the Active Analytics Workbench (AAW) with the goal of simplifying and accelerating data science and machine learning in a scalable fashion. With AAW, users can ingest data, train models, make inferences (answers/output from models), and even audit models with a few endpoints (or clicks via the UI). The AAW package can be automatically installed via KAgent and coexists with the database--meaning easy access to one's data and GPUs. AAW leverages Kubernetes to deploy, train, and test models.
The AAW workflow is defined by four key concepts:
Data -- comprises ingests, datasets, and feature sets
Models -- functions, statistical models, regressions, data models, and more that are deployed to enable inferencing capabilities. AAW can deploy any number of replicas of the model, allowing for scalability and better resource management. AAW currently only supports Blackbox models, which are models where implementation details are abstracted and housed in Docker containers. Input and output are the only available interface; they also don't require a training dataset.
Important
Blackbox models rely on the Kinetica Blackbox SDK to fetch AAW-compatible output from a custom Blackbox. Kinetica can assist in Blackbox model container creation if necessary. Consult New Blackbox Model for more information.
Deployments represent all models that have been deployed. Deployed models can have inference tests run manually, automatically, or in batches depending on the type of deployment. Currently there are three types of deployments:
Audits represent the ability to audit a model deployment to ensure its training, testing, and inferencing are untampered. Audits can drill into specific inferences from a deployment and filter the inference by input parameter, process status, and more.
Kinetica recommends installing AAW using KAgent, as KAgent can install everything AAW requires and can preconfigure everything for the best out-of-the-box experience. Consult Cluster for AAW package install instructions using KAgent. Since AAW relies on Kubernetes, there are two official methods of setting up Kubernetes with KAgent:
After the installation is finished, a kml
service is available to manage the
AAW user interface and API.
Important
AAW will inherit existing login information from the cluster, so logging into the AAW user interface will use the same credentials as GAdmin
Kinetica recommends users opt for an external Kubernetes cluster for the following reasons:
Amazon Web Services (AWS) has an Elastic Kubernetes Service (EKS) and Microsoft Azure has an Azure Kubernetes Service that are fully managed services that can be used with AAW.
If users still wish to opt for the embedded Kubernetes setup, please consider the following limitations:
/opt/gpudb/kml/utils
that can assist with
some of the cluster managementImportant
AAW will inherit existing login information from the cluster, so logging into the AAW user interface will use the same credentials as GAdmin
All the main logs for the AAW service and API are located in
/opt/gpudb/kml/logs
. All logs for the AAW user interface are located
in /opt/gpudb/kml/ui/logs
.