Co-authored BLIP: a high-impact recipe for unified vision-language understanding and generation.
Researcher Profile
Caiming Xiong
Bootstrapped vision-language pretraining (BLIP)
Researcher at Salesforce
Co-authored BLIP: a high-impact recipe for unified vision-language understanding and generation.
Organizations
Topics
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last updated
March 20, 2026
Official And External Links
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Bootstrapped vision-language pretraining (BLIP)
02
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
03
Multimodal
04
Vision-language
05
Vision-Language
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Signature Works
Additional papers, projects, or repositories that help flesh out the profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
Co-authored BLIP: a high-impact recipe for unified vision-language understanding and generation.
Co-authored BLIP: a high-impact recipe for unified vision-language understanding and generation.
Co-authored CLIP: a core reference for contrastive multimodal pretraining.
Co-authored CLIP: a core reference for contrastive multimodal pretraining.
Co-authored CLIP: a core reference for contrastive multimodal pretraining.