Home/Researchers/Albert Gu

Researcher Profile

Editor reviewed

Albert Gu

State space models for sequence modeling

Assistant professor at Carnegie Mellon University

A key researcher for understanding why state-space models became a serious alternative to standard transformer stacks rather than a recurring side path.

Organizations

Carnegie Mellon University

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Official And External Links

Known For

The ideas, systems, and research directions that make this person worth knowing.

01

State-space models for sequence modeling

02

Mamba

03

Long-context and efficient sequence architectures

04

State space models for sequence modeling

05

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

06

Mamba (GitHub)

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

Supporting Sources

Additional links that help verify and flesh out this profile.

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Tri Dao

Efficient sequence models + attention kernels

4 sources

One of the clearest researchers to follow for efficient sequence-model systems, especially the line of work that made frontier training and inference materially faster rather than merely cleaner on paper.

Start HereTri Dao

Shared topic

Alan Arazi

Hybrid Transformer–Mamba language models (Jamba)

4 sources

A valuable page in this cluster because his public role description is unusually specific: post-training, steerability, and AI-generated evaluation data are exactly the kinds of practical problems strong researcher pages should make discoverable.

Start HereAlan Arazi

Shared topic

Amir Bergman

Hybrid Transformer–Mamba language models (Jamba)

3 sources

A useful systems-facing page because it ties one of the less-public engineers on the Jamba line to the practical work of turning hybrid-model research into shipped model releases.

Start HereAI21 Labs

Shared topic

Barak Peleg

Hybrid Transformer–Mamba language models (Jamba)

3 sources

Worth tracking on the architecture side of AI21 because his profile sits where infrastructure leadership, hybrid-model design, and the mechanics of shipping long-context systems overlap.

Start HereBarak Peleg