generate-nemo-gym-env

Build the NeMo Gym variant of an env. NeMo Gym is NVIDIA's RL gym layer, optimized for Ray-based orchestration and post-episode grading. The Python package is nemo_gym (installed via pip install git+https://github.com/NVIDIA-NeMo/Gym).

Concept

NeMo Gym is NVIDIA's RL gym layer for LLM agents. It's built on Ray and ships a FastAPI-based SimpleResourcesServer that exposes one POST /<tool> endpoint per tool, plus the standard /seed_session (cookie-based session bootstrap) and /verify (post-episode grader). Targets docs.nvidia.com/nemo/gym/latest.

When the user has a shared domain module (<domain>.py) and wants a NeMo Gym variant, wrap it. Don't duplicate logic.

Archetypes

Archetype	Hallmarks
Pure-Python game	Single tool endpoint; `/verify` does substring match against `ground_truth`.
Stateful sandbox	Per-session sandbox in `self.sessions`; lazy init on first tool call.
Vision / computer-use	One endpoint per action; `/verify` rewards trajectories that called `terminate(success)`.

generate-nemo-gym-env

generate-nemo-gym-env

Concept

Archetypes

Recommended file layout

More from adithya-s-k/rl_envs_101

generate-verifiers-env

generate-openenv-env

generate-ors-env

rl-env-from-description