baoyu-danger-gemini-web

Originally fromyelban/baoyu-skills.tw
Installation
Summary

Reverse-engineered Gemini Web API for text and image generation with multi-turn conversation support.

  • Supports text generation, image generation from prompts, and vision input via reference images for analysis or variation
  • Includes four model options: gemini-3-pro (default), gemini-3-flash, gemini-3-flash-thinking, and gemini-3.1-pro-preview
  • Multi-turn conversation via session IDs with automatic cookie caching and browser-based Google authentication
  • Requires user consent before first use; supports custom configuration via EXTEND.md for model defaults, proxy settings, and data directory paths
SKILL.md

Gemini Web Client

Text/image generation via Gemini Web API. Supports reference images and multi-turn conversations.

User Input Tools

When this skill prompts the user, follow this tool-selection rule (priority order):

  1. Prefer built-in user-input tools exposed by the current agent runtime — e.g., AskUserQuestion, request_user_input, clarify, ask_user, or any equivalent.
  2. Fallback: if no such tool exists, emit a numbered plain-text message and ask the user to reply with the chosen number/answer for each question.
  3. Batching: if the tool supports multiple questions per call, combine all applicable questions into a single call; if only single-question, ask them one at a time in priority order.

Concrete AskUserQuestion references below are examples — substitute the local equivalent in other runtimes.

Script Directory

Important: All scripts are located in the scripts/ subdirectory of this skill.

Agent Execution Instructions:

Related skills

More from jimliu/baoyu-skills

Installs
18.2K
GitHub Stars
18.1K
First Seen
Jan 19, 2026