Home/Researchers/Silvio Savarese

Researcher Profile

Silvio Savarese

BLIP-2 and frozen-encoder multimodal LLMs

Co-author, BLIP-2

Co-authored BLIP-2: a key step toward efficient vision-language models built around LLM backbones.

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Known For

The ideas, systems, and research directions that make this person worth knowing.

01

BLIP-2 and frozen-encoder multimodal LLMs

02

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

03

Multimodal

04

Vision-language

05

Vision-Language

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared topics

Radu Soricut

Gemini (multimodal foundation models)

4 sources

Important for understanding how multilingual NLP, translation, and multimodal reasoning meet inside production-scale frontier systems rather than staying separate research tracks.

Start HereRadu Soricut