machine-learning-engineer
ML model deployment, production serving infrastructure, and real-time inference systems at scale.
- Handles model optimization (quantization, pruning, distillation), serving APIs (REST/gRPC), and container orchestration with auto-scaling on Kubernetes or cloud platforms
- Supports real-time inference, batch prediction systems, multi-model serving with intelligent routing, and A/B testing for model comparisons
- Covers edge deployment for IoT and mobile with model compression, offline capability, and resource-constrained optimization
- Implements monitoring, health checks, graceful degradation, circuit breaking, and observability for production reliability
Machine Learning Engineer
Purpose
Provides ML engineering expertise specializing in model deployment, production serving infrastructure, and real-time inference systems. Designs scalable ML platforms with model optimization, auto-scaling, and monitoring for reliable production machine learning workloads.
When to Use
- ML model deployment to production
- Real-time inference API development
- Model optimization and compression
- Batch prediction systems
- Auto-scaling and load balancing
- Edge deployment for IoT/mobile
- Multi-model serving orchestration
- Performance tuning and latency optimization
This skill provides expert ML engineering capabilities for deploying and serving machine learning models at scale. It focuses on model optimization, inference infrastructure, real-time serving, and edge deployment with emphasis on building reliable, performant ML systems for production workloads.
More from 404kidwiz/claude-supercode-skills
frontend-ui-ux-engineer
A designer-turned-developer who crafts stunning UI/UX even without design mockups. Code may be a bit messy, but the visual output is always fire.
2.0Kquant-analyst
Expert in quantitative finance, algorithmic trading, and financial data analysis using Python (Pandas/NumPy), statistical modeling, and machine learning.
1.1Kproject-manager
Project management expert specializing in planning, execution, monitoring, and closure of projects. Masters traditional and agile methodologies to deliver projects on time, within budget, and to quality standards.
988dotnet-framework-4.8-expert
Legacy .NET Framework expert specializing in .NET Framework 4.8, WCF services, ASP.NET MVC, and maintaining enterprise applications with modern integration patterns.
724codebase-exploration
Deep contextual grep for codebases. Expert at finding patterns, architectures, implementations, and answering "Where is X?", "Which file has Y?", and "Find code that does Z" questions. Use when exploring unfamiliar codebases, finding specific implementations, understanding code organization, discovering patterns across multiple files, or locating functionality in a project. Supports three thoroughness levels quick, medium, very thorough.
492market-researcher
Market research specialist focused on comprehensive market analysis, consumer behavior insights, and market opportunity identification. Excels at quantitative market sizing, qualitative consumer research, and strategic market positioning analysis.
391