Researcher Profile

Editor reviewed

Rui-Jie Zhu

RWKV and efficient sequence modeling

PhD student at the University of California, Santa Cruz working on efficient language models and spiking neural networks

Probably the strongest page in this batch because he spans the original RWKV paper, Eagle/Finch-adjacent work, and later efficient-language-model papers like SpikeGPT and Gated Slot Attention instead of ending at a single coauthor credit.

Organizations

University of California, Santa Cruz

Topics

Open Models Systems & Infrastructure Diffusion & Generative Media

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Last reviewed

March 18, 2026

Best First Clicks

Rui-Jie Zhuprofile RWKV: Reinventing RNNs for the Transformer Erapaper SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networkspaper

Known For

The ideas, systems, and research directions that make this person worth knowing.

RWKV and Eagle/Finch sequence-model work

SpikeGPT

Efficient language modeling and spiking neural networks

RWKV and efficient sequence modeling

RWKV: Reinventing RNNs for the Transformer Era

RWKV (project)

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Rui-Jie Zhuprofile RWKV: Reinventing RNNs for the Transformer Erapaper SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networkspaper RWKV (project)project

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

Gated Slot Attention for Efficient Linear-Time Sequence Modelingpaper

Supporting Sources

Additional links that help verify and flesh out this profile.

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrencepaper Gated Slot Attention for Efficient Linear-Time Sequence Modelingpaper

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Qihang Zhao

RWKV and efficient sequence modeling

4 sources

Useful because his work connects the main RWKV sequence-model line with the RWKV-inspired SpikeGPT branch, making the page more informative than a single coauthor record.

Open Models Systems & Infrastructure

Start HereRWKV: Reinventing RNNs for the Transformer Era

Shared canonical source

Bo Peng

RWKV and efficient sequence modeling

4 sources

Worth tracking if you care about alternatives to the standard transformer playbook, especially the line of work trying to keep strong language-model performance while making inference and memory use much cheaper.

Open Models Systems & Infrastructure

Start HereRWKV: Reinventing RNNs for the Transformer Era

Shared canonical source

Eric Alcaide

RWKV and efficient sequence modeling

5 sources

A distinctive page because his work bridges open-sequence-model experimentation with applied machine learning for molecules, proteins, and structural biology, and he shows up on multiple RWKV-family papers including the hybrid GoldFinch branch rather than only the first release.

Open Models Systems & Infrastructure

Start HereEric Alcaide

Shared canonical source

Alon Albalak

RWKV and efficient sequence modeling

5 sources

A strong open-model and data-centric page because his work sits close to the infrastructure that made OLMo and Dolma useful to the broader research community rather than just another benchmark-driven model release.

Open Models Evaluation & Benchmarks

Start HereAlon Albalak

Shared canonical source

Samuel Arcadinho

RWKV and efficient sequence modeling

2 sources

Co-authored RWKV: Reinventing RNNs for the Transformer Era.

Open Models Systems & Infrastructure

Start HereRWKV: Reinventing RNNs for the Transformer Era

Shared canonical source

Huanqi Cao

RWKV and efficient sequence modeling

4 sources

Useful because it turns an otherwise thin RWKV byline into a real systems profile: after the original paper, his public work tracks toward large-scale pretraining infrastructure, pipeline parallelism, and systems support for frontier-scale models.

Open Models Systems & Infrastructure

Start HereHuanqi Cao at Tsinghua University