Machine Learning Wiki

A self-contained markdown knowledge base on transformer architectures, attention cost and efficiency, and long-context scaling. This skill is for querying it: the knowledge is already compiled into articles under wiki/, so read those rather than re-deriving from scratch.

Keep this current: as the wiki grows, update the name and description above so they describe what it actually covers and trigger on the right questions.

(Sample note: this example wiki lives in examples/ within the llm-wiki repo. To load it as a skill, place the directory in your skills path named ml-llm-wiki, so the directory matches the name above.)

Maintenance and deeper analysis - ingesting sources, superseding stale knowledge, linting, auditing, critiquing reasoning - is not done here. Use the llm-wiki skill, which owns the write workflow and the file format. The llm-wiki skill is required to keep this wiki current; without it the wiki is still readable, but do not hand-edit articles outside the conventions in wiki/README.md.

What's inside

One topic so far, machine-learning: how attention works, why its memory cost was once thought to be a hard quadratic limit and why that turned out to be an implementation artefact, and what makes long context practical.

ml-llm-wiki

Machine Learning Wiki

What's inside

How to query