vllm-deploy-simple

Installation

SKILL.md

vLLM Simple Deployment

A simple skill to quickly install vLLM, start a server, and validate the OpenAI-compatible API.

What this skill does

This skill provides a streamlined workflow to:

Detect hardware backend (NVIDIA CUDA, AMD ROCm, Google TPU, or CPU)
Install vLLM with appropriate backend support
Start the vLLM server with configurable model and port
Test the OpenAI-compatible API endpoint
Validate the deployment is working correctly
Support virtual environment isolation

Prerequisites

Python 3.10+
GPU (NVIDIA CUDA, AMD ROCm) (recommended) or TPU or CPU
pip or uv package manager

Related skills

More from vllm-project/vllm-skills

Installs

51

Repository

vllm-project/vllm-skills

GitHub Stars

68

First Seen

Feb 25, 2026

Security Audits

Gen Agent Trust HubPass