huawei-cloud-ascend-models-deploy

Installation
SKILL.md

Huawei Cloud Ascend Models Deploy

Deploy and test large language models on Huawei Cloud Ascend DevServer (910B series). Supports single-machine and dual-machine deployment, model inference testing, and deployment monitoring.

Overview

This skill deploys and tests large language models on Huawei Cloud Ascend DevServer (910B series). Supports single-machine and dual-machine deployment for LLM, VL, Embedding, and Rerank models.

Related Skills (Agent orchestrated, no direct call, Rule 3):

  • huawei-cloud-ascend-remote-connect - SSH connection to DevServer (prerequisite for deployment)
  • huawei-cloud-ascend-command - NPU status check and monitoring (prerequisite and post-deploy monitoring)

Capabilities:

  • Model deployment (single-node, dual-node)
  • Inference testing (LLM chat, VL multimodal, Embedding, Rerank)
  • Deployment log and status monitoring
  • Model catalog and script auto-matching
Installs
26
GitHub Stars
8
First Seen
Jun 4, 2026
huawei-cloud-ascend-models-deploy — huaweicloud/huaweicloud-skills