code-llm-papers-guide
Code LLM Papers Guide
Overview
This curated collection covers LLMs for code — from foundational models (Codex, CodeGen, StarCoder) through code generation, completion, repair, translation, and understanding. Accompanies a TMLR survey paper providing systematic categorization. Tracks 500+ papers across pre-training, fine-tuning, evaluation, and application of code-focused language models.
Taxonomy
Code LLMs
├── Pre-training
│ ├── Encoder-only (CodeBERT, GraphCodeBERT)
│ ├── Decoder-only (Codex, CodeGen, StarCoder, DeepSeek-Coder)
│ └── Encoder-Decoder (CodeT5, PLBART)
├── Fine-tuning & Alignment
│ ├── Instruction tuning (WizardCoder, Magicoder)
│ ├── RLHF for code (CodeRL)
│ └── Self-play (AlphaCode)
├── Applications
More from wentorai/research-plugins
academic-paper-summarizer
Summarize academic papers with structured extraction of key elements
43academic-translation-guide
Academic translation, post-editing, and Chinglish correction guide
38academic-writing-refiner
Checklist-driven academic English polishing and Chinglish correction
34academic-citation-manager
Manage academic citations across BibTeX, APA, MLA, and Chicago formats
33abstract-writing-guide
Craft structured research abstracts that maximize clarity and journal acceptance
15ai-writing-humanizer
Remove AI-generated patterns to produce natural, authentic academic writing
14