Researcher Profile

Margaret Mitchell

Large-scale open code data (The Stack)

Co-author, The Stack

Co-authored The Stack: a major permissively-licensed dataset used for open code models.

Topics

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Last updated

March 20, 2026

Best First Clicks

The Stack: 3 TB of permissively licensed source codepaper

Known For

The ideas, systems, and research directions that make this person worth knowing.

Large-scale open code data (The Stack)

The Stack: 3 TB of permissively licensed source code

Datasets

Code

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

The Stack: 3 TB of permissively licensed source codepaper

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared topic

Koray Kavukcuoglu

Large-scale training, systems

4 sources

A high-signal figure for understanding how DeepMind turned ambitious research systems into durable products, especially across reinforcement learning, speech, and code generation.

Google DeepMind Multimodal Systems & Infrastructure

Start HereWaveNet

Shared topic

Matteo Grella

RWKV and efficient sequence modeling

4 sources

Worth keeping because he is one of the original RWKV coauthors who clearly did not stop there: his public work moves into production AI for crisis intelligence, security-aware infrastructure tooling, and later open-LLM experimentation.

Open Models Systems & Infrastructure

Start HereMatteo Grella at Crisis24

Shared topic

Xiangru Tang

RWKV and efficient sequence modeling

5 sources

Worth keeping because it connects an early RWKV byline to a much more visible later research program in agentic AI, biomedical discovery, and code-focused evaluation, which makes the page far more useful than a one-paper ghost profile.

Open Models Evaluation & Benchmarks

Start HereXiangru Tang

Shared topic

Aaron Grattafiori

Open foundation models for code (Code Llama)

1 source

Co-authored Code Llama: a key open-model reference for code generation and coding assistants.

Meta Open Models Code Models

Start HereCode Llama: Open Foundation Models for Code

Shared topic

Alexandre Défossez

Open foundation models for code (Code Llama)