Home/Researchers/Shawn Presser

Researcher Profile

Editor reviewed

Shawn Presser

Open-source LLMs (EleutherAI)

Open-source builder with public data and tooling contributions

Worth knowing in the open-model ecosystem because his profile combines authorship on The Pile with a large body of public code and notes rather than only one flagship paper.

Organizations

EleutherAI

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Official And External Links

Known For

The ideas, systems, and research directions that make this person worth knowing.

01

The Pile dataset

02

Open-source tooling and utilities

03

A broad public code footprint around machine learning systems

04

Open-source LLMs (EleutherAI)

05

GPT-NeoX (GitHub)

06

EleutherAI (GitHub)

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

Supporting Sources

Additional links that help verify and flesh out this profile.

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Noa Nabeshima

Open-source LLMs (EleutherAI)

5 sources

A useful long-tail open-model page because it connects one of the lesser-known contributors to The Pile with a newer line of small public datasets and Hugging Face releases instead of leaving the profile as generic EleutherAI boilerplate.

Start HereNoa Nabeshima

Shared canonical source

Travis Hoppe

Open-source LLMs (EleutherAI)

5 sources

Worth knowing as one of the early open-data contributors around the EleutherAI orbit, with a profile that mixes work on The Pile with a long tail of small, public NLP and machine-learning experiments.

Start HereTravis Hoppe