Home/Researchers/Chunyuan Li

Researcher Profile

Chunyuan Li

Visual instruction tuning (LLaVA)

Researcher at Microsoft

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Organizations

Microsoft

Topics

Open Models Multimodal Post-Training & Alignment

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Last updated

March 20, 2026

Best First Clicks

Visual Instruction Tuningpaper LLaVA (GitHub)project

Official And External Links

OpenAlex ↗

Known For

The ideas, systems, and research directions that make this person worth knowing.

Visual instruction tuning (LLaVA)

Visual Instruction Tuning

LLaVA (GitHub)

LLaVA

Multimodal

Vision-language

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Visual Instruction Tuningpaper LLaVA (GitHub)project

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

OpenAlexprofile

Supporting Sources

Additional links that help verify and flesh out this profile.

OpenAlexprofile

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Haotian Liu

Visual instruction tuning (LLaVA)

2 sources

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Open Models Multimodal

Start HereVisual Instruction Tuning

Shared canonical source

Qingyang Wu

Visual instruction tuning (LLaVA)

2 sources

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Open Models Multimodal

Start HereVisual Instruction Tuning

Shared canonical source

Yong Jae Lee

Visual instruction tuning (LLaVA)

2 sources

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Open Models Multimodal

Start HereVisual Instruction Tuning

Shared topics

Sid Black

Open-source LLMs, training

3 sources

A useful anchor for the open-model ecosystem because his path runs from EleutherAI’s training efforts into a more explicit alignment and interpretability agenda at Conjecture.

EleutherAI Open Models Post-Training & Alignment

Start HereConjecture

Shared topics

Connor Leahy

Open models, governance, communication

4 sources

An important bridge figure between open-weight language-model communities and the modern alignment debate, especially when you want to understand how frontier capability, openness, and control arguments collide in practice.

EleutherAI Open Models Post-Training & Alignment

Start HereConjecture

Shared topics

Rohan Anil

Gemini (multimodal foundation models)

4 sources

One of the more useful people to study for the Gemini era because his work spans both the text-core of multimodal frontier models and the optimization tricks that make those systems cheaper and more stable to train.

Open Models Multimodal

Start HereGemini: A Family of Highly Capable Multimodal Models