Home/Researchers/Margaret Mitchell

Researcher Profile

Margaret Mitchell

Large-scale open code data (The Stack)

Co-author, The Stack

Co-authored The Stack: a major permissively-licensed dataset used for open code models.

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Last updated

March 20, 2026

Known For

The ideas, systems, and research directions that make this person worth knowing.

01

Large-scale open code data (The Stack)

02

The Stack: 3 TB of permissively licensed source code

03

Datasets

04

Code

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared topic

Matteo Grella

RWKV and efficient sequence modeling

4 sources

Worth keeping because he is one of the original RWKV coauthors who clearly did not stop there: his public work moves into production AI for crisis intelligence, security-aware infrastructure tooling, and later open-LLM experimentation.

Shared topic

Xiangru Tang

RWKV and efficient sequence modeling

5 sources

Worth keeping because it connects an early RWKV byline to a much more visible later research program in agentic AI, biomedical discovery, and code-focused evaluation, which makes the page far more useful than a one-paper ghost profile.

Start HereXiangru Tang