Researcher Profile

Editor reviewed

Aran Komatsuzaki

Open-source LLMs (EleutherAI)

GPT-J co-lead and long-time open-model builder

An important open-model researcher for understanding how early public LLM efforts, scaling heuristics, and open data work fed into the broader modern model ecosystem.

Organizations

Georgia Institute of Technology

Labs

EleutherAI

Topics

Open Models

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Last reviewed

March 18, 2026

Best First Clicks

About Me – Aran Komatsuzakiprofile GPT-J / mesh-transformer-jaxprofile Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpointspaper

Official And External Links

Website ↗X / Twitter ↗OpenAlex ↗

Known For

The ideas, systems, and research directions that make this person worth knowing.

GPT-J and early open-source LLMs

Scaling-method intuition and sparse upcycling

Public-facing model and dataset building

Open-source LLMs (EleutherAI)

GPT-NeoX (GitHub)

EleutherAI (GitHub)

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

About Me – Aran Komatsuzakiprofile GPT-J / mesh-transformer-jaxprofile Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpointspaper GPT-NeoX (GitHub)project EleutherAI (GitHub)project

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

EleutherAI: Going Beyond "Open Science" to "Science in the Open"paper OpenAlexprofile

Supporting Sources

Additional links that help verify and flesh out this profile.

EleutherAI: Going Beyond "Open Science" to "Science in the Open"paper OpenAlexprofile

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Ben Wang

Open-source LLMs (EleutherAI)

5 sources

Important for the bridge between early open-model scaling work and later frontier closed-model systems, especially around architecture and training-stack choices that ended up mattering at both ends of the field.

EleutherAI Open Models Systems & Infrastructure

Start HereBen Wang

Shared canonical source

Anish Thite

Open-source LLMs (EleutherAI)

5 sources

Useful to follow if you care about the practical evaluation layer of open models, especially where benchmark tooling and reproducible comparisons actually shape what the ecosystem measures.

EleutherAI Open Models Evaluation & Benchmarks

Start HereAnish Thite

Shared canonical source

Charles Foster

Open-source LLMs (EleutherAI)

5 sources

A useful person to track for the evaluation side of AI risk work, especially where open-model benchmarking meets the question of which measurements are actually trustworthy enough to inform decisions.

EleutherAI Open Models Evaluation & Benchmarks

Start HereCharles Foster

Shared canonical source

Eric Hallahan

Open-source LLMs (EleutherAI)

5 sources

Useful because his footprint runs through the early EleutherAI training stack, GPT-NeoX, and Pythia, which makes the page a better map of open-model infrastructure than a generic one-paper profile.

EleutherAI Open Models Systems & Infrastructure

Start HereAbout Eric Hallahan

Shared canonical source

Horace He

Open-source LLMs (EleutherAI)

5 sources

One of the best people to track if you care about the practical performance layer of modern AI systems, especially where compilers, kernels, and model-serving speed actually move the frontier.

EleutherAI Open Models Systems & Infrastructure

Start HereHorace He

Shared canonical source

Jason Phang

Open-source LLMs (EleutherAI)

5 sources

One of the better people to study for the thread connecting classic transfer learning in NLP to modern large-model evaluation and open-model research practice.

EleutherAI Open Models Evaluation & Benchmarks

Start HereJason Phang at NYU CDS