Thao Nguyen (Shibe)

alt text
Photo taken at Googleplex @ Summer 2024 ╮ (. ❛ ᴗ ❛.) ╭
 

Hi, I'm Thao! 👋
I'm a CS PhD student @ UW—Madison 🦡, working with Prof. Yong Jae Lee.
I'm fortunate to also have support from senior students: Dr. Utkarsh Ojha, Dr. Yuheng Li, Dr. Haotian Liu 🌋, and Dr. Mu Cai~

Before that, I was an AI Research Resident @ VinAI Research (acquired by Qualcomm), where I had the privilege of working closely with Dr. Anh Tran, Prof. Minh Hoai Nguyen, Dr. Duc Thanh Nguyen, and many amazing folks there.
These people have motivated me to pursuit higher education.
Thank you! 🙇‍♀️🎓

。𖦹°‧ I'm taking a PhD Minor in Educational Psychology and was lucky enough to learn a lot about stereotypes and STEM from Prof. Christy Starr (thank you!!).
This inspired me to create viet-wisc: a collection of Vietnamese women working in Computer Science (PhD). If you're 🇻🇳🙋‍♀️, you might find it inspiring~ 🌱

Email / GitHub / Google Scholar / Resume | CV

I'm looking for research opportunities/interns around Personalized AI ~~ Please email me if you think I'd be a good fit~ thao.nguyen@wisc.edu 🤗

About Me

4th year PhD Student – Univeristy of Wisconsin - Madison.

Contact: “thao.nguyen at wisc dot edu“ | “thao.ntp0414 at gmail dot com”.

Research interests: Mutimodal Models; Image Understanding/ Generation.

Publications


alt text  

X-Fusion: Introducing New Modality to Frozen Large Language Models
Sicheng Mo, Thao Nguyen, Xun Huang, Siddharth Srinivasan Iyer, Yijun Li, Yuchen Liu, Abhishek Tandon, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, Yuheng Li
International Conference on Computer Vision (ICCV), 2025
Best paper at 🏆 CVPR 2025 Workshop: "Transformers for Vision (T4V)"
[ProjectPage, Code, Paper]

alt text  

Yo'Chameleon: Personalized Vision and Language Generation
Thao Nguyen, Krishna Kumar Singh, Jing Shi, Trung Bui, Yong Jae Lee, Yuheng Li
Conference on Computer Vision and Pattern Recognition (CVPR), 2025
(also accepted at 🧷 CVPR 2025 Workshop: "What is Next in Multimodal Foundation Models?")
[ProjectPage, Poster, Code, Paper]

alt text  

Yo'LLaVA: Your Personalized Language and Vision Assistant
Thao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee
Neural Information Processing Systems (NeurIPS), 2024
[ProjectPage, Poster, Code, Paper]

alt text  

Edit One for All: Interactive Batch Image Editing
Thao Nguyen, Utkarsh Ojha, Yuheng Li, Haotian Liu, Yong Jae Lee
Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[ProjectPage, Poster, Code, Paper]

alt text  

Visual Instruction Inversion: Image Editing via Visual Prompting
Thao Nguyen, Yuheng Li, Utkarsh Ojha, Yong Jae Lee
Neural Information Processing Systems (NeurIPS), 2023
[ProjectPage, Poster, Code, Paper]

alt text  

Lipstick ain't enough: Beyond Color Matching for In-the-Wild Makeup Transfer
Thao Nguyen, Anh Tran, Minh Hoai
Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[ProjectPage, Code, Video, Paper]

Not CS publication :D

alt text  

Are Dining Expectations Culturally Conditioned? An analysis of Asian vs. American Restaurants (yes, it's not a CS conference :D)
Thao Nguyen (+Yuheng Li)
International Conference on Quantitative Ethnography (ICQE), 2025
Best poster nomination 🏆 | work done during my PhD minor course with Prof. David Williamson Shaffer 😊
[Poster, Paper]

Misc

alt text  
— I'm proud to be a hooman-assistant of Bo the Shiba and Mam the Cat (.❛ ᴗ ❛.)

"🎉 We are featuring on multiple Thao's papers. Can you spot us? 😝"