Vatsal Agarwal

I am a second year Ph.D student in the department of Computer Science at the University of Maryland (UMD), advised by Professor Abhinav Shrivastava. My research lies at the intersection of computer vision and deep learning.

I completed my undergrad in Computer Science at the University of Maryland in 2022. During my undergrad, I worked as a research assistant at the National Institutes of Health Clinical Center to develop deep learning models for medical imaging. I also interned as a machine learning engineer at DeepHealth.

In the past, I have been fortunate to have worked with Youbao Tang, Bill Lotter, Ronald Summers, Suhas Srinivasan.

Email  /  CV  /  Scholar  /  Github

profile photo

Research

I am interested in developing computer vision models that are able to leverage multimodal knowledge from vision-language models for scene understanding tasks such as open-vocabulary segmentation. I am also broadly interested in developing efficient attention methods that scale across various applications including image classification and human mesh recovery.

maps MAPS: Memory Augmented Panoptic Segmentation
Vatsal Agarwal, Saksham Suri, Max Ehrlich, Abhinav Shrivastava
Under Review
Paper

Explore potential of VLM-generated neural memory for panoptic segmentation.

diff_ssl2 Do Text-free Diffusion Models Learn Discriminative Visual Representations?
Matthew Gwilliam* , Soumik Mukhopadhyay*, Vatsal Agarwal, Yosuke Yamaguchi, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Abhinav Shrivastava
Under Review
Project Page |Paper

Explore diffusion models as unified unsupervised image representation learning models for many recognition tasks. Propose DifFormer and DifFeed, novel mechanisms for fusing diffusion features for image classification.

diff_ssl1 Diffusion Models Beat GANs on Image Classification
Matthew Gwilliam* , Soumik Mukhopadhyay*, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Abhinav Shrivastava
preprint only
Project Page |Paper

Show the possible utility of diffusion models as unified unsupervised image representation learners.

coarsemetro Coarse-to-Fine Human Mesh Recovery with Transformers
Vatsal Agarwal, Mara Levy, Max Ehrlich, Youbao Tang, Ning Zhang, Abhinav Shrivastava
Under Review
Paper

Build efficient Transformer design for non-parametric human mesh recovery with coarse-to-fine pipeline.

coarsemetro Weakly Supervised Lesion Co-segmentation on CT Scans
Vatsal Agarwal*, Youbao Tang*, Jing Xiao, Ronald M. Summers
IEEE International Symposium on Biomedical Imaging (ISBI), 2020
Paper

Conducted thorough investigation on impact of different attention mechanisms for lesion segmentation.

coarsemetro Weakly-Supervised Lesion Segmentation on CT Scans using Co-Segmentation
Vatsal Agarwal*, Youbao Tang*, Jing Xiao, Ronald M. Summers
SPIE Medical Imaging, 2020
Paper

Developed attention-based co-segmentation model and applied it to task of lesion segmentation.


Template credits