Autoscaling# Overview of AIBrix Autoscaler# Autoscaling is crucial for deploying Large Language Model (LLM) services on Kubernetes (K8s), as timely scaling up handles peaks in request traffic, and scaling down conserves resources when demand wanes. Metric-based Autoscaling Optimizer-based Autoscaler