Frontend
The Dynamo Frontend is the API gateway for serving LLM inference requests. It provides OpenAI-compatible HTTP endpoints and KServe gRPC endpoints, handling request preprocessing, routing, and response formatting.
Feature Matrix
Quick Start
Prerequisites
- Dynamo platform installed
etcdandnats-server -jsrunning- At least one backend worker registered
HTTP Frontend
This starts an OpenAI-compatible HTTP server with integrated preprocessing and routing. Backends are auto-discovered when they call register_llm.
KServe gRPC Frontend
See the Frontend Guide for KServe-specific configuration and message formats.
Kubernetes
Configuration
See the Frontend Guide for full configuration options.