generate-ors-env

Build the ORS variant of an env using the official openreward >= 0.1.33 package (the ors-sdk name is a common mistake — it does not exist on PyPI).

Concept

ORS is the Open Reward Standard (openrewardstandard.io) — an HTTP REST + Server-Sent Events protocol for agent envs. Reward arrives inline with every ToolOutput, which is the framework's defining feature compared to OpenEnv (external/post-hoc reward) and NeMo Gym (post-episode /verify).

When the user has a shared domain module (<domain>.py) and wants an ORS variant, never duplicate domain logic into the framework folder — wrap it.

Archetypes

Archetype	Hallmarks
Pure-Python game	Single `@tool`, `tasks.py` with N task dicts forming the `train` split, terminal reward via `finished=True`.
Stateful sandbox	`setup()` allocates resources from `task_spec`; `teardown()` frees them; per-tool reward stubs.
Vision / computer-use	`ImageBlock(data=<base64>, mimeType="image/png")` returns; `terminate(status)` tool emits the terminal reward.

generate-ors-env

generate-ors-env

Concept

Archetypes

Imports — exactly these

More from adithya-s-k/rl_envs_101

generate-verifiers-env

generate-openenv-env

generate-nemo-gym-env

rl-env-from-description