FluxCD
This section describes how to use FluxCD for GitOps-based deployment of Dynamo inference graphs. GitOps enables you to manage your Dynamo deployments declaratively using Git as the source of truth. We’ll use the aggregated vLLM example to demonstrate the workflow.
Prerequisites
- A Kubernetes cluster with Dynamo Kubernetes Platform installed
- FluxCD installed in your cluster
- A Git repository to store your deployment configurations
Workflow Overview
The GitOps workflow for Dynamo deployments consists of three main steps:
- Build and push the Dynamo Operator
- Create and commit a DynamoGraphDeployment custom resource for initial deployment
- Update the graph by building a new version and updating the CR for subsequent updates
Step 1: Build and Push Dynamo Operator
First, follow to See Install Dynamo Kubernetes Platform.
Step 2: Create Initial Deployment
Create a new file in your Git repository (e.g., deployments/llm-agg.yaml) with the following content:
Commit and push this file to your Git repository. FluxCD will detect the new CR and create the initial Dynamo deployment in your cluster.
Step 3: Update Existing Deployment
To update your pipeline, just update the associated DynamoGraphDeployment CRD
The Dynamo operator will automatically reconcile it.
Monitoring the Deployment
You can monitor the deployment status using: