Build and run multi-tenant load and chaos experiments for LLM inference platforms, including skewed traffic mixes, burst events, and post-mortem report generation. Use when creating load generators, designing tenant profiles, validating shedding behavior, or analyzing breaking points from benchmark runs.
Run load testing in controlled phases:
scripts/chaos_runner.py as the base CLI for load scenarios.