Worth knowing because his work links earlier dense-retrieval research to later MRKL and Jamba systems, which makes his page a good bridge between classic NLP retrieval and newer hybrid LLM stacks.
Researcher Profile
Editor reviewedYoav Shoham
Hybrid Transformer–Mamba language models (Jamba)
Professor emeritus at Stanford University and co-founder of AI21 Labs
A field-shaping figure for agentic AI and multi-agent reasoning long before the current LLM cycle, and now one of the clearest bridges between that older intellectual lineage and AI21’s frontier-model work.
Organizations
Labs
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last reviewed
March 18, 2026
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Game theory and multi-agent systems
02
Institution building around AI21 Labs
03
Tracking the field through AI Index
04
Hybrid Transformer–Mamba language models (Jamba)
05
Jamba: A Hybrid Transformer-Mamba Language Model
06
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
An especially valuable page for understanding how AI systems get judged in practice, because it puts human evaluation and rubric design at the center rather than treating them as an afterthought to model building.
A useful long-tail AI21 page because it ties one of the less-public contributors to the company’s modular reasoning and hybrid-model line instead of leaving the profile as a generic Jamba coauthor page.
A strong researcher to study if you care about the semantic side of modern language systems, especially where evaluation, structured meaning representation, and tool-using LLM architectures meet.
Useful for understanding the AI21 line of work that tries to combine tool use, modular reasoning, and hybrid sequence architectures instead of treating LLMs as pure next-token engines.
Important because his work bridges classical machine-learning theory, autonomous-driving safety, and more recent frontier-model research rather than staying inside a single subfield.