Rohit Jena
Iām a Ph.D. student in CIS at University of Pennsylvania advised by Prof. Pratik Chaudhari and Prof. James C. Gee.
My research broadly aims to answer the following questions:
(1) How can we incorporate task-specific invariances for correspondence matching problems? (2) What kind of self-supervised representations help us discover these task-invariant representations? (3) What kind of transferability will these representations have? My research questions stem from my belief in specialist models instead of generalist ones.
I spent Summer 2024 at the NeMo team at NVIDIA where I worked on alignment of text-to-image diffusion models to improve the Pareto front of the alignment-diversity trade-off. Previously, I interned at Amazon Lab126 where I worked on mesh-NeRF hybrids for rigged 3D avatars from 360 degree videos.
I completed my Masters in Robotics at The Robotics Institute, Carnegie Mellon University where I was advised by Prof. Katia Sycara. I also worked with Prof. Kayhan Batmanghelich on segmentation for medical images. I completed my bachelors in Computer Science and Engineering from Indian Institute of Technology, Bombay in 2019. My undergraduate thesis is based on Perfect Sampling and Uncertainty Estimation in Deep Networks where I was advised by Prof. Suyash P. Awate.
Selected Publications
- WACVElucidating optimal reward-diversity tradeoffs in text-to-image diffusion modelsWinter Conference on Applications of Computer Vision 2025Work done at NVIDIA
- CVPRBeyond mAP: Towards better evaluation of instance segmentationConference on Computer Vision and Pattern Recognition 2023Highlight paper
Acceptance rate ā¼2.5% of all papers, 10% of accepted papers
Icons in the CV are adapted from Font Awesome under the Creative Commons BY license.