Publications

Decoding Children's Gait Behavior

ECCV 2026
Yifan Shen, Boyi Li, Meihuan Huang, Yuanzhe Liu, Xu Cao, Jinyang Jin, Zhengyuan Li, Anglin Liu, Junho Kim, Jingyuan Zhu, Lan Fangzhou, Jianguo Cao, Jintai Chen, Ismini Lourentzou, James Matthew Rehg

Kirin: Animal Motion Generation from In-the-Wild Video

ECCV 2026
Brian Nlong Zhao, Zhuoyang Pan, James Matthew Rehg, Jiajun Wu, Shangzhe Wu

Layer-Aware Video Composition via Split-then-Merge

ECCV 2026
Ozgur Kara, Yujia Chen, Ming-Hsuan Yang, James Matthew Rehg, Wen-Sheng Chu, Du Tran

Narrative-Driven Paper-to-Slide Generation via ArcDeck

ECCV 2026
Tarik Can Ozden, Sachidanand VS, Furkan Horoz, Ozgur Kara, Junho Kim, James Matthew Rehg

STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding

ECCV 2026
Junho Kim, Hosu Lee, James Matthew Rehg, Minsu Kim, Yong Man Ro

CoherentHand: Temporally Consistent 3D Hand Trajectory Synthesis with Semantic Motion Priors

CVPR 2026, Findings
Bikram Boote, Junho Kim, Ozgur Kara, Sangmin Lee, James Matthew Rehg

Forecasting 3D Scanpaths in Egocentric Video

CVPR 2026
Fiona Ryan, Ishwarya Ananthabhotla, Yijun Qian, Judy Hoffman, James Matthew Rehg, Vamsi Krishna Ithapu, Calvin Murdock

Gaze Target Estimation Anywhere with Concepts

CVPR 2026
Xu Cao, Houze Yang, Vipin Gunda, Zhongyi Zhou, Tianyu Xu, Adarsh Kowdle, Inki Kim, James Matthew Rehg

How Much 3D Do Video Foundation Models Encode?

CVPR 2026
Zixuan Huang, Xiang Li, Zhaoyang Lv, James Matthew Rehg

Learning Predictive Visuomotor Coordination

CVPR 2026, Findings
Wenqi Jia, Bolin Lai, Miao Liu, Danfei Xu, James Matthew Rehg

Omni-MMSI: Toward Identity-attributed Social Interaction Understanding

CVPR 2026
Xinpeng Li, Bolin Lai, Hardy Chen, Shijian Deng, Cihang Xie, Yuyin Zhou, James Matthew Rehg, Yapeng Tian

Toward Diffusible High-Dimensional Latent Spaces: A Frequency Perspective

CVPR 2026
Bolin Lai, Xudong Wang, Saketh Rambhatla, James Matthew Rehg, Zsolt Kira, Rohit Girdhar, Ishan Misra

Vinedresser3D: Agentic Text-guided 3D Editing

CVPR 2026
Yankuan Chi, Xiang Li, Zixuan Huang, James Matthew Rehg

DiffVax: Optimization-Free Image Immunization Against Diffusion-Based Editing

ICLR 2026
Tarik Can Ozden, Ozgur Kara, Oguzhan Akcin, Kerem Zaman, Shashank Srivastava, Sandeep P. Chinchali, James Matthew Rehg

Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation

NeurIPS 2025 (Spotlight, Acceptance rate 3.2%)
Xiang Li, Zirui Wang, Zixuan Huang, James Matthew Rehg

DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images

NeurIPS 2025
Ozgur Kara, Harris Nisar, James Matthew Rehg

Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs

NeurIPS 2025
Yifan Shen, Yuanzhe Liu, Jingyuan Zhu, Xu Cao, Xiaofeng Zhang, Yixiao He, Wenming Ye, James Matthew Rehg, Ismini Lourentzou

Toward Human Deictic Gesture Target Estimation

NeurIPS 2025
Xu Cao, Pranav Virupaksha, Sangmin Lee, Bolin Lai, Wenqi Jia, Jintai Chen, James Matthew Rehg

Gaze-LLE: Gaze Target Estimation via Large-scale Learned Encoders

CVPR 2025 (Highlight, Acceptance rate 3.0%)
Fiona Ryan, Ajay Bati, Sangmin Lee, Daniel Bolya, Judy Hoffman, James M Rehg

Improving Personalized Search with Regularized Low-Rank Parameter Updates

CVPR 2025 (Highlight, Acceptance rate 3.0%)
Fiona Ryan, Josef Sivic, Fabian Caba Heilbron, Judy Hoffman, James M Rehg, Bryan Russell