Benchmark ATOM vLLM Performance

atom-vllm-benchmark-guidecommandsetup L1★104

What it does

Run and analyze ATOM vLLM benchmark performance across concurrency/throughput metrics

Best for

Performance A/B testing of ATOM vLLM plugin when precise, reproducible metrics across concurrency levels and throughput profiles are required.

Inputs

Outputs

Requires

Preconditions

Failure modes

· VRAM exhaustion = OOM during benchmark, incomplete results
· Server not ready = curl http://localhost:8000/v1/models fails
· Container reuse across concurrency points = timing artifacts, unreliable comparison
· Log level too verbose = rocm-smi output polluted, metrics hard to parse
· Recipe not found = fallback to standard config, may not be model-optimal

Trust signals

· Enforces fresh container per concurrency point (eliminates state carryover)
· Cites recipe-first policy (use model-specific settings if available)
· Includes environment checklist (cache clear, VRAM verification, log levels)
· Provides full benchmark matrix (6 concurrency levels × 3 ISL/OSL combinations)