photo-agents-autonomous-llm

Installation
SKILL.md

Photo Agents Autonomous LLM Skill

Skill by ara.so — AI Agent Skills collection.

Overview

Photo Agents is a Python framework for building autonomous, self-evolving AI agents that ground their understanding in visual observations of the screen. Unlike traditional text-only agents, Photo Agents implements a perceive → reason → act cycle with a layered memory architecture inspired by biological cognition: vision input, bounded observations stored in layers (L1-L4), and skills the agent writes from real successes.

Key capabilities:

  • Multi-provider LLM routing (Anthropic Claude, OpenAI GPT, failover sessions)
  • Layered memory system (working/global/SOP/session archive)
  • Physical execution tools (file I/O, sandboxed code, browser automation via Chrome DevTools Protocol)
  • Multiple client interfaces (CLI, Streamlit web app, PyQt desktop, chat platform bots)
  • Self-evolving through reflection and skill generation

Installation

Basic Installation

Installs
95
First Seen
May 17, 2026
photo-agents-autonomous-llm — aradotso/ai-agent-skills