Home/Researchers/Gal Shachaf

Researcher Profile

Editor reviewed

Gal Shachaf

Hybrid Transformer–Mamba language models (Jamba)

Researcher working on retrieval, modular reasoning, and hybrid language-model systems

Worth knowing because his work links earlier dense-retrieval research to later MRKL and Jamba systems, which makes his page a good bridge between classic NLP retrieval and newer hybrid LLM stacks.

Organizations

AI21 Labs

Labs

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Known For

The ideas, systems, and research directions that make this person worth knowing.

01

Dense retrieval without supervision

02

MRKL-style modular language systems

03

Hybrid language models at AI21

04

Hybrid Transformer–Mamba language models (Jamba)

05

Jamba: A Hybrid Transformer-Mamba Language Model

06

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Clara Fridman

Hybrid Transformer–Mamba language models (Jamba)

4 sources

A distinctive page in this AI21 cluster because she brings a linguistics and human-evaluation angle to model work, especially around user interaction, multilingual language behavior, and how LLM performance gets tested in practice.

Start HereAI21 Labs

Shared canonical source

Noam Rozen

Hybrid Transformer–Mamba language models (Jamba)

4 sources

A useful long-tail AI21 page because it ties one of the less-public contributors to the company’s modular reasoning and hybrid-model line instead of leaving the profile as a generic Jamba coauthor page.

Start HereNoam Rozen

Shared canonical source

Tal Ness

Hybrid Transformer–Mamba language models (Jamba)

4 sources

A worthwhile long-tail researcher page because it makes the data-and-evaluation layer of modern language-model work visible instead of treating frontier systems as if they were only architecture or scaling stories.

Start HereTal Ness