cpp-reinforcement-learning

Installation

SKILL.md

C++ Reinforcement Learning

Overview

This skill covers implementing reinforcement learning algorithms in C++ using LibTorch (PyTorch C++ frontend) and modern C++17/20 features. It provides patterns for building high-performance RL systems suitable for production deployment, robotics, game AI, and real-time applications.

When to Use

Implementing DQN, PPO, SAC, or other RL algorithms in C++
Building performance-critical RL training pipelines
Creating efficient replay buffers with proper memory management
Deploying trained models with ONNX Runtime
Parallelizing environment rollouts across threads
Integrating RL with existing C++ codebases (games, robotics, simulations)

Core Libraries

Primary: LibTorch (PyTorch C++ Frontend)

Related skills

More from aznatkoiny/skills

deep-learning
Comprehensive guide for Deep Learning with Keras 3 (Multi-Backend: JAX, TensorFlow, PyTorch). Use when building neural networks, CNNs for computer vision, RNNs/Transformers for NLP, time series forecasting, or generative models (VAEs, GANs). Covers model building (Sequential/Functional/Subclassing APIs), custom training loops, data augmentation, transfer learning, and production best practices.
6
x402-payments
|
1
openclaw-setup
Set up, install, configure, and deploy OpenClaw (formerly ClawdBot/MoltBot) — a personal AI assistant that runs on your own devices and connects to messaging channels. Use when users ask to "set up OpenClaw," "install ClawdBot," "install MoltBot," "deploy a personal AI assistant," "configure OpenClaw on Mac," "deploy OpenClaw to VPS," "set up OpenClaw on Hostinger," "connect OpenClaw to Telegram," "configure iMessage with OpenClaw," or any variation involving OpenClaw installation, gateway configuration, channel setup, Anthropic auth, or security hardening. Also triggers on "openclaw onboard," "openclaw doctor," "openclaw security audit," troubleshooting OpenClaw deployments, OpenClaw security, OpenClaw cost control, or ClawHub skills safety.
1
prompt-optimizer
Optimize prompts for Claude 4.x models using Anthropic's official best practices. Use when users want to improve, refine, or create effective prompts for Claude. Triggers include requests to optimize prompts, make prompts more effective, fix underperforming prompts, create system prompts, improve instruction following, reduce verbosity, control output formatting, or enhance agentic/tool-use behaviors. Also use when users report Claude is being too verbose, not following instructions, not using tools properly, or producing generic outputs.
1
resume-updater
>
1
reinforcement-learning
|
1

Installs

Repository

aznatkoiny/skills

GitHub Stars

First Seen

Mar 18, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykPass

cpp-reinforcement-learning

C++ Reinforcement Learning

Overview

When to Use

Core Libraries

Primary: LibTorch (PyTorch C++ Frontend)

More from aznatkoiny/skills

deep-learning

x402-payments

openclaw-setup

prompt-optimizer

resume-updater

reinforcement-learning