pytorch-onnx

Installation
SKILL.md

Overview

ONNX (Open Neural Network Exchange) is an open format built to represent machine learning models. Exporting PyTorch models to ONNX allows them to be executed in environments without Python or PyTorch, using high-performance engines like ONNX Runtime.

When to Use

Use ONNX for cross-language deployment (C++, Java, C#), edge deployment (mobile/IoT), or to leverage specialized hardware accelerators (like TensorRT) that support ONNX as an input format.

Decision Tree

  1. Does your model accept variable batch sizes?
    • SPECIFY: dynamic_axes in the torch.onnx.export call.
  2. Do you need the fastest possible inference on a CPU?
    • APPLY: Quantization using the ONNX Runtime quantization tool.
  3. Are you deploying to a C++ environment without Python?
    • EXPORT: To ONNX and load using the ONNX Runtime C++ API.

Workflows

Related skills

More from cuba6112/skillfactory

Installs
5
First Seen
Feb 9, 2026