Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
Archive.md		Archive.md
README.md		README.md

Repository files navigation

LLM Papers

Compression

Compression for AGI 2023.02
Language Modeling Is Compression 2023.09

Language

Language is primarily a tool for communication rather than thought 2024.06

Scaling

Training Compute-Optimal Large Language Models 2022.03
Predicting Emergent Abilities with Infinite Resolution Evaluation 2023.10
The Platonic Representation Hypothesis 2024.05
Parables on the Power of Planning in AI: From Poker to Diplomacy 2024.05

Representation

Language Models Represent Space and Time 2023.10

Architecture

Alignment

Scalable Oversight

Self-critiquing models for assisting human evaluators 2022.06
Weak-to-strong generalization 2023.12
Prover-Verifier Games improve legibility of LLM outputs 2024.07

Theorem Proving

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search 2024.08

ICL

Larger language models do in-context learning differently 2023.03
Many-Shot In-Context Learning 2024.04

Math & Reasoning

Tool

WebGPT: Browser-assisted question-answering with human feedback 2021.12

Report

GPT-4 Architecture, Infrastructure, Training Dataset, Costs, Vision, MoE 2023.07
Llama 2: Open Foundation and Fine-Tuned Chat Models 2023.07
Gemini 1.0 2023.12
Gemini 1.5 2024.02
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence 2024.06
GPT-4o mini: advancing cost-efficient intelligence 2024.07
AI achieves silver-medal standard solving International Mathematical Olympiad problems 2024.07
The Llama 3 Herd of Models 2024.07
Learning to Reason with LLMs 2024.09
Introducing ChatGPT Pro 2024.12

Talk & Blog

State of GPT 2023.05
Some intuitions about large language models 2023.11
MiniCPM：揭示端侧大语言模型的无限潜力 2024.04
Llama 3 Opens the Second Chapter of the Game of Scale 2024.04
Successful language model evals 2024.05
Three hypotheses on LLM reasoning 2024.12
Claude’s Character 2024.06

Physics of Language Models

Evaluation

Quality

Scaling Laws and Interpretability of Learning from Repeated Data 2022.05

Efficient

LoRA: Low-Rank Adaptation of Large Language Models 2021.06

Merging

Evolutionary Optimization of Model Merging Recipes 2024.03

Model Spec

OpenAI Model Spec 240508 2024.05

Multimodality Papers

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published