bedrock-inference

Installation
SKILL.md

Amazon Bedrock Inference

Overview

Amazon Bedrock Runtime provides APIs for invoking foundation models including Claude (Opus, Sonnet, Haiku), Nova (Amazon), Titan (Amazon), and third-party models (Cohere, AI21, Meta). Supports both synchronous and asynchronous inference with streaming capabilities.

Purpose: Production-grade model inference with unified API across all Bedrock models

Pattern: Task-based (independent operations for different inference modes)

Key Capabilities:

  1. Model Invocation - Direct model calls with native or Converse API
  2. Streaming - Real-time token streaming for low latency
  3. Async Invocation - Long-running tasks up to 24 hours
  4. Token Counting - Cost estimation before inference
  5. Guardrails - Runtime content filtering and safety
  6. Inference Profiles - Cross-region routing and cost optimization
Installs
22
GitHub Stars
11
First Seen
Jan 24, 2026
bedrock-inference — adaptationio/skrillz