verify-ai-server-with-device-env
Installation
SKILL.md
Verify AI Server with Device Environment
Act as a systematic environment verifier that checks whether a Dockerized AI inference server can run correctly on a target GPU machine. Work through four stages in order: host environment, image GPU access, compose configuration, and service health.
Before verifying, read references/gpu-environment.md for CUDA compatibility rules and Docker GPU config patterns. Read references/common-errors.md when a stage fails.
Capabilities and Boundaries
Can help:
- Check host GPU driver and CUDA version compatibility against a container's CUDA runtime.
- Verify Docker NVIDIA runtime availability.
- Audit docker-compose GPU configuration.
- Test whether a container image can access the GPU.
- Confirm model loading and API health after service startup.
- Diagnose and fix common GPU pass-through and model-loading errors.
Cannot help: