Researcher Profile

Editor reviewed

Albert Gu

State space models for sequence modeling

Assistant professor at Carnegie Mellon University

A key researcher for understanding why state-space models became a serious alternative to standard transformer stacks rather than a recurring side path.

Organizations

Carnegie Mellon University

Topics

Systems & Infrastructure

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Last reviewed

March 18, 2026

Best First Clicks

Albert Guprofile Mamba: Linear-Time Sequence Modeling with Selective State Spacespaper Efficiently Modeling Long Sequences with Structured State Spacespaper

Official And External Links

OpenAlex ↗

Known For

The ideas, systems, and research directions that make this person worth knowing.

State-space models for sequence modeling

Mamba

Long-context and efficient sequence architectures

State space models for sequence modeling

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Mamba (GitHub)

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Albert Guprofile Mamba: Linear-Time Sequence Modeling with Selective State Spacespaper Efficiently Modeling Long Sequences with Structured State Spacespaper Mamba (GitHub)project

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

Hungry Hungry Hippos: Towards Language Modeling with State Space Modelspaper OpenAlexprofile

Supporting Sources

Additional links that help verify and flesh out this profile.

OpenAlexprofile

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Tri Dao

Efficient sequence models + attention kernels

4 sources

One of the clearest researchers to follow for efficient sequence-model systems, especially the line of work that made frontier training and inference materially faster rather than merely cleaner on paper.

Systems & Infrastructure

Start HereTri Dao

Shared topic

Alan Arazi

Hybrid Transformer–Mamba language models (Jamba)

4 sources

A valuable page in this cluster because his public role description is unusually specific: post-training, steerability, and AI-generated evaluation data are exactly the kinds of practical problems strong researcher pages should make discoverable.

AI21 Post-Training & Alignment Evaluation & Benchmarks

Start HereAlan Arazi

Shared topic

Amir Bergman

Hybrid Transformer–Mamba language models (Jamba)

3 sources

A useful systems-facing page because it ties one of the less-public engineers on the Jamba line to the practical work of turning hybrid-model research into shipped model releases.

AI21 Systems & Infrastructure

Start HereAI21 Labs

Shared topic

Avshalom Manevich

Hybrid Transformer–Mamba language models (Jamba)

4 sources

A useful page because his public trail is broader than the generic Jamba author stub: it runs from earlier language grounding and text-similarity work into Jamba-1.5 and later multimodal hallucination mitigation.

AI21 Multimodal Systems & Infrastructure

Start HereJamba-1.5: Hybrid Transformer-Mamba Models at Scale

Shared topic

Barak Lenz

Hybrid Transformer–Mamba language models (Jamba)

4 sources

One of the higher-signal people to know in the hybrid-LLM line because he sits at the point where AI21’s research architecture, long-context systems work, and real product deployment meet.

AI21 Evaluation & Benchmarks Systems & Infrastructure

Start HereWhy AI Leaderboards Miss the Point

Shared topic

Barak Peleg

Hybrid Transformer–Mamba language models (Jamba)

3 sources

Worth tracking on the architecture side of AI21 because his profile sits where infrastructure leadership, hybrid-model design, and the mechanics of shipping long-context systems overlap.

AI21 Systems & Infrastructure

Start HereBarak Peleg