voice-note-to-midi

Installation
SKILL.md

🎡 Voice Note to MIDI

Transform your voice memos, humming, and melodic recordings into clean, quantized MIDI files ready for your DAW.

What It Does

This skill provides a complete audio-to-MIDI conversion pipeline that:

  1. Stem Separation - Uses HPSS (Harmonic-Percussive Source Separation) to isolate melodic content from drums, noise, and background sounds
  2. ML-Powered Pitch Detection - Leverages Spotify's Basic Pitch model for accurate fundamental frequency extraction
  3. Key Detection - Automatically detects the musical key of your recording using Krumhansl-Kessler key profiles
  4. Intelligent Quantization - Snaps notes to a configurable timing grid with optional key-aware pitch correction
  5. Post-Processing - Applies octave pruning, overlap-based harmonic removal, and legato note merging for clean output

Pipeline Architecture

Audio Input (WAV/M4A/MP3)
    ↓
Related skills

More from thinkfleetai/thinkfleet-engine

Installs
4
First Seen
Mar 1, 2026