Installation#

This guide describes how to install AIBrix manifests in different platforms.

Currently, AIBrix installation does rely on other cloud specific features. It’s fully compatible with vanilla Kubernetes.

Install AIBrix on Cloud Kubernetes Clusters#

Attention

AIBrix will install Envoy Gateway and KubeRay in your environment. If you already have these components installed, you can use corresponding manifest to skip them.

Stable Version#

# Install component dependencies
kubectl create -f https://github.com/vllm-project/aibrix/releases/download/v0.3.0/aibrix-dependency-v0.3.0.yaml

# Install aibrix components
kubectl create -f https://github.com/vllm-project/aibrix/releases/download/v0.3.0/aibrix-core-v0.3.0.yaml

# For custom configurations
git clone https://github.com/vllm-project/aibrix.git
cd aibrix
kubectl apply -k config/overlays/release

Nightly Version#

# clone the latest repo
git clone https://github.com/vllm-project/aibrix.git
cd aibrix

# Install component dependencies
kubectl apply -k config/dependency --server-side
kubectl apply -k config/default

Install AIBrix in testing Environments#

Install Individual AIBrix Components#

Autoscaler#

kubectl apply -k config/standalone/autoscaler-controller/

Distributed Inference#

kubectl apply -k config/standalone/distributed-inference-controller/

Model Adapter(Lora)#

kubectl apply -k config/standalone/model-adapter-controller

KV Cache#

kubectl apply -k config/standalone/kv-cache-controller