vllm-deployment

Installation

SKILL.md

vLLM Model Serving and Inference

Quick Start

Docker (CPU)

docker run --rm -p 8000:8000 \
  --shm-size=4g \
  --cap-add SYS_NICE \
  --security-opt seccomp=unconfined \
  -e VLLM_CPU_KVCACHE_SPACE=4 \
  <vllm-cpu-image> \
  --model <model-name> \
  --dtype float32
# Access: http://localhost:8000

Docker (GPU)

Installs

Repository

stakpak/community-paks

GitHub Stars

First Seen

Feb 10, 2026

Security Audits

Gen Agent Trust HubFail

SocketPass

SnykWarn

vllm-deployment — stakpak/community-paks