RL Env From Description

Convert a plain-English description of an RL training environment into runnable code across OpenEnv, OpenReward (ORS), Verifiers, and NeMo Gym. Two other framework variants (SkyRL Gym, GEM) are secondary and only relevant for text-action-with-tag-parsing envs — produce them only if the user asks.

When to use

A user describes an env in plain English (a goal, an action surface, or a reward shape) and wants code.
A user asks to "build an env for X", "scaffold an RL env", "port this env to OpenEnv/ORS/Verifiers/NeMo Gym".
A user already has a runnable env in one framework and wants the same env in others.
A user asks "what's the right way to design my reward / state / tool surface for this task" — start with the interview below, then implement.

Do not use for: training runs (TRL/GRPO config), evaluation harness work, or general agent-design questions that don't end with new env code.

Recommended layout (suggest, don't impose)

A clean shape that scales well — but the user gets to pick the actual paths:

rl-env-from-description

RL Env From Description

When to use

Recommended layout (suggest, don't impose)

More from adithya-s-k/rl_envs_101

generate-verifiers-env

generate-openenv-env

generate-nemo-gym-env

generate-ors-env