Publications | Vision and Autonomy Intelligence Lab

Click the tags below to view our work by category. Please see Google Scholar for more recent works and arXiv papers.

2026

ICRA

Learning Sidewalk Autopilot from Multi-Scale Imitation with Corrective Behavior Expansion

Honglin He, Yukai Ma, Brad Squicciarini, Wayne Wu, and Bolei Zhou

In IEEE International Conference on Robotics and Automation, 2026

PDF Website
CVPR

AURA: Multi-modal Shared Autonomy for Urban Navigation

Yukai Ma, Honglin He, Selina Song, Wayne Wu, and Bolei Zhou

In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026

PDF Website
CVPR

Group Diffusion: Enhancing Image Generation by Unlocking Cross-Sample Collaboration

Sicheng Mo, Thao Nguyen, Richard Zhang, Nicholas Kolkin, Siddharth Srinivasan Iyer, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, and Yuheng Li

In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026

PDF Website
CVPR Highlight

Vista4D: Video Reshooting with 4D Point Clouds

Kuan Heng Lin, Zhizheng Liu, Pablo Salamanca, Yash Kant, Ryan Burgert, Koichi Namekata, Yiwei Zhao, Bolei Zhou, Micah Goldblum, Paul Debevec, and Ning Yu

In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026

PDF Website
CVPR

AnimaMimic: Imitating 3D Animation from Video Priors

Tianyi Xie, Yunuo Chen, Yaowei Guo, Yin Yang, Bolei Zhou, Demetri Terzopoulos, Ying Jiang, and Chenfanfu Jiang

In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026

PDF Website
HRI

DiSCo: Diffusion Sequence Copilots for Shared Autonomy

Andy Wang, Xu Yan, Brandon McMahan, Michael Zhou, Yuyang Yuan, Johannes Y. Lee, Ali Shreif, Matthew Li, Zhenghao Peng, Bolei Zhou, Yuchen Cui, and Jonathan C. Kao

In Proceedings of the 21st ACM/IEEE International Conference on Human-Robot Interaction, 2026

PDF Website
ICLR

UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos

Mingxuan Liu, Honglin He, Elisa Ricci, Wayne Wu, and Bolei Zhou

In International Conference on Learning Representation, 2026

PDF Website
ICLR

From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning

Honglin He, Yukai Ma, Brad Squicciarini, Wayne Wu, and Bolei Zhou

In International Conference on Learning Representation, 2026

PDF Website
ICLR

Joint Optimization for 4D Human-Scene Reconstruction in the Wild

Zhizheng Liu, Joe Lin, Wayne Wu, and Bolei Zhou

In International Conference on Learning Representation, 2026

PDF Website
ICLR

SceneStreamer: Continuous Scenario Generation as Next Token Group Prediction

Zhenghao Peng, Yuxin Liu, and Bolei Zhou

In International Conference on Learning Representation, 2026

PDF Website

2025

NeurIPS Spotlight

Predictive Preference Learning from Human Interventions

Haoyuan Cai, Zhenghao Peng, and Bolei Zhou

In Advances in Neural Information Processing Systems, 2025

PDF Website
NeurIPS

Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation

Yuxin Liu, Zhenghao Peng, Xuanhao Cui, and Bolei Zhou

In Advances in Neural Information Processing Systems, 2025

PDF Website
NeurIPS

AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning

Zewei Zhou, Tianhui Cai, Seth Z Zhao, Yun Zhang, Zhiyu Huang, Bolei Zhou, and Jiaqi Ma

In Advances in Neural Information Processing Systems, 2025

PDF Website
ICCV

X-Fusion: Introducing New Modality to Frozen Large Language Models

Sicheng Mo, Thao Nguyen, Xun Huang, Siddharth Srinivasan Iyer, Yijun Li, Yuchen Liu, Abhishek Tandon, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, and Yuheng Li

In International Conference on Computer Vision, 2025

PDF Website
ICCV

Occupancy Learning with Spatiotemporal Memory

Ziyang Leng, Jiawei Yang, Wenlong Yi, and Bolei Zhou

In International Conference on Computer Vision, 2025

PDF Website
IROS

CooPre: Cooperative Pretraining for V2X Cooperative Perception

Seth Z. Zhao, Hao Xiang, Chenfeng Xu, Xin Xia, Bolei Zhou, and Jiaqi Ma

In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025

PDF Website
ICCV

V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction

Zewei Zhou, Hao Xiang, Zhaoliang Zheng, Seth Z Zhao, Mingyue Lei, Yun Zhang, Tianhui Cai, Xinyi Liu, Johnson Liu, Maheswari Bajji, Xin Xia, Zhiyu Huang, Bolei Zhou, and Jiaqi Ma

In International Conference on Computer Vision, 2025

PDF Website
ICCV

TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction

Zewei Zhou^*, Seth Z. Zhao^*, Tianhui Cai, Zhiyu Huang, Bolei Zhou, and Jiaqi Ma

In International Conference on Computer Vision, 2025

PDF Website
ICCV

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Cheng-Fu Yang, Da Yin, Wenbo Hu, Heng Ji, Nanyun Peng, Bolei Zhou, and Kai-Wei Chang

In International Conference on Computer Vision, 2025

PDF
ICML

WOMD-Reasoning: A Large-Scale Dataset for Interaction Reasoning in Driving

Yiheng Li, Cunxin Fan, Chongjian Ge, Zhihao Zhao, Chenran Li, Chenfeng Xu, Huaxiu Yao, Masayoshi Tomizuka, Bolei Zhou, Chen Tang, Mingyu Ding, and Wei Zhan

In International Conference on Machine Learning, 2025

PDF Website
ICML

AIM: Robot-Gated Interactive Imitation Learning with Adaptive Intervention Mechanism

Haoyuan Cai, Zhenghao Peng, and Bolei Zhou

In International Conference on Machine Learning, 2025

PDF Website
CVPR Highlight

Towards Autonomous Micromobility through Scalable Urban Simulation

Wayne Wu^*, Honglin He^*, Chaoyuan Zhang, Jack He, Seth Z. Zhao, Ran Gong, Quanyi Li, and Bolei Zhou

In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PDF Website
CVPR

Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation

Ziyang Xie, Zhizheng Liu, Zhenghao Peng, Wayne Wu, and Bolei Zhou

In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PDF Website
CVPR

Embodied Scene Understanding for Vision Language Models via MetaVQA

Weizhen Wang, Chenda Duan, Zhenghao Peng, Yuxin Liu, and Bolei Zhou

In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PDF Website
ICRA

Data-Efficient Learning from Human Interventions for Mobile Robots

Zhenghao Peng, Zhizheng Liu, and Bolei Zhou

In International Conference on Robotics and Automation, 2025

PDF Website
RA-L

BEVCon: Advancing Bird’s Eye View Perception With Contrastive Learning

Ziyang Leng, Jiawei Yang, Zhicheng Ren, and Bolei Zhou

In IEEE Robotics and Automation Letters, 2025

PDF Code
ICLR Spotlight

MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility

Wayne Wu, Honglin He, Jack He, Yiran Wang, Chenda Duan, Zhizheng Liu, Quanyi Li, and Bolei Zhou

International Conference on Learning Representations, 2025

PDF Website
ICLR

Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels

Zhizheng Liu, Joe Lin, Wayne Wu, and Bolei Zhou

International Conference on Learning Representations, 2025

PDF Website
ICLR

3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Qihang Zhang, Yinghao Xu, Chaoyang Wang, Hsin-Ying Lee, Gordon Wetzstein, Bolei Zhou, and Ceyuan Yang

International Conference on Learning Representations, 2025

PDF Website

2024

arXiv

Urban Scene Diffusion through Semantic Occupancy Map

Junge Zhang, Qihang Zhang, Li Zhang, Ramana Rao Kompella, Gaowen Liu, Jiachen Li, and Bolei Zhou

In Preprint, 2024

PDF Website
RA-L

BEVGen: Street-View Image Generation from a Bird’s-Eye View Layout

Alexander Swerdlow, Runsheng Xu, and Bolei Zhou

In IEEE Robotics and Automation Letters, 2024

PDF Code Website
NeurIPS

SimGen: Simulator-conditioned Driving Scene Generation

Yunsong Zhou, Michael Simon, Zhenghao Peng, Sicheng Mo, Hongzi Zhu, Minyi Guo, and Bolei Zhou

Advances in Neural Information Processing Systems, 2024

PDF Website
NeurIPS

Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance

Kuan Heng Lin, Sicheng Mo, Ben Klingher, Fangzhou Mu, and Bolei Zhou

Advances in Neural Information Processing Systems, 2024

PDF Website
NeurIPS

Shared Autonomy with IDA: Interventional Diffusion Assistance

Brandon J McMahan, Zhenghao Peng, Bolei Zhou, and Jonathan C Kao

Advances in Neural Information Processing Systems, 2024

PDF
Nature Communications

Localization and recognition of human action in 3D using transformers

Jiankai Sun, Linjiang Huang, Hongsong Wang, Chuanyang Zheng, Jianing Qiu, Md Tauhidul Islam, Enze Xie, Bolei Zhou, Lei Xing, Arjun Chandrasekaran, and others

Communications Engineering, 2024
Nature

Experiment-free Exoskeleton Assistance via Learning in Simulation

Shuzhen Luo, Menghan Jiang, Sainan Zhang, Junxi Zhu, Shuangyue Yu, Israel Dominguez Silva, Tian Wang, Elliott Rouse, Bolei Zhou, Hyunwoo Yuk, Xianlian Zhou, and Hao Su

Nature, 2024

PDF Video Website
Nature BME

Accurate prediction of disease-risk factors from volumetric medical scans by a deep vision model pre-trained with 2D scans

Oren Avram, Berkin Durmus, Nadav Rakocz, Giulia Corradetti, Ulzee An, Muneeswar G Nittala, Prerit Terway, Akos Rudas, Zeyuan Johnson Chen, Yu Wakatsuki, and others

Nature Biomedical Engineering, 2024

PDF Website
TPAMI

Spatial Steerability of GANs via Self-Supervision from Discriminator

Jianyuan Wang, Lalit Bhagat, Ceyuan Yang, Yinghao Xu, Yujun Shen, Hongdong Li, and Bolei Zhou

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024

PDF
CVPR

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition

Sicheng Mo, Fangzhou Mu, Kuan Heng Lin, Yanli Liu, Bochen Guan, Yin Li, and Bolei Zhou

IEEE/CVF Computer Vision and Pattern Recognition Conference, 2024

PDF Website
CVPR

Scenewiz3d: Towards text-guided 3d scene composition

Qihang Zhang, Chaoyang Wang, Aliaksandr Siarohin, Peiye Zhuang, Yinghao Xu, Ceyuan Yang, Dahua Lin, Bolei Zhou, Sergey Tulyakov, and Hsin-Ying Lee

IEEE/CVF Computer Vision and Pattern Recognition Conference, 2024

PDF Website
CVPR

Berfscene: Bev-conditioned equivariant radiance fields for infinite 3d scene generation

Qihang Zhang, Yinghao Xu, Yujun Shen, Bo Dai, Bolei Zhou, and Ceyuan Yang

IEEE/CVF Computer Vision and Pattern Recognition Conference, 2024

PDF Website
TMLR

Unsupervised Discovery of Steerable Factors When Graph Deep Generative Models Are Entangled

Shengchao Liu, Chengpeng Wang, Jiarui Lu, Weili Nie, Hanchen Wang, Zhuoxinran Li, Bolei Zhou, and Jian Tang

Transactions on Machine Learning Research, 2024

PDF
RAL

Street-View Image Generation from a Bird’s-Eye View Layout

Alexander Swerdlow, Runsheng Xu, and Bolei Zhou

IEEE Robotics and Automation Letters, 2024

PDF Website
TPAMI

In-Domain GAN Inversion for Faithful Reconstruction and Editability

Jiapeng Zhu, Yujun Shen, Yinghao Xu, Deli Zhao, Qifeng Chen, and Bolei Zhou

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024

PDF
3DV spotlight

Efficient 3d articulated human generation with layered surface volumes

Yinghao Xu, Wang Yifan, Alexander W Bergman, Menglei Chai, Bolei Zhou, and Gordon Wetzstein

The 11th International Conference on 3D Vision, 2024

PDF Website

2023

NeurIPS Spotlight

Learning from Active Human Involvement through Proxy Value Propagation

Zhenghao Peng, Wenjie Mo, Chenda Duan, Quanyi Li, and Bolei Zhou

In Advances in Neural Information Processing Systems, 2023

Code Website
CoRL

CAT: Closed-loop Adversarial Training for Safe End-to-End Driving

Linrui Zhang, Zhenghao Peng, Quanyi Li, and Bolei Zhou

In 7th Annual Conference on Robot Learning, 2023

PDF Code Website
NeurIPS

ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling

Quanyi Li^*, Zhenghao Peng^*, Lan Feng^*, Zhizheng Liu, Chenda Duan, Wenjie Mo, and Bolei Zhou

In Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2023

PDF Code Website
ICLR

Guarded Policy Optimization with Imperfect Online Demonstrations

Zhenghai Xue, Zhenghao Peng, Quanyi Li, Zhihan Liu, and Bolei Zhou

In International Conference on Learning Representation, 2023

PDF Code Website
ICRA

TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios

Lan Feng^*, Quanyi Li^*, Zhenghao Peng^*, Shuhan Tan, and Bolei Zhou

In International Conference on Robotics and Automation, 2023

PDF Video Code Website
ICCV

One-shot generative domain adaptation

Ceyuan Yang, Yujun Shen, Zhiyi Zhang, Yinghao Xu, Jiapeng Zhu, Zhirong Wu, and Bolei Zhou

International Conference on Computer Vision, 2023

PDF Website
TMLR

ChemSpacE: Interpretable and Interactive Chemical Space Exploration

Yuanqi Du, Xian Liu, Nilay Shah, Shengchao Liu, Jieyu Zhang, and Bolei Zhou

Transactions on Machine Learning Research, 2023

PDF Code
CVPR Highlight

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis

Yinghao Xu, Menglei Chai, Zifan Shi, Sida Peng, Ivan Skorokhodov, Aliaksandr Siarohin, Ceyuan Yang, Yujun Shen, Hsin-Ying Lee, Bolei Zhou, and Sergey Tulyakov

IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PDF Website
CVPR Highlight

V2V4Real: A large-scale real-world dataset for Vehicle-to-Vehicle Cooperative Perception

Runsheng Xu, Xin Xia, Jinlong Li, Hanzhao Li, Shuo Zhang, Zhengzhong Tu, Zonglin Meng, Hao Xiang, Xiaoyu Dong, Rui Song, Hongkai Yu, Bolei Zhou, and Jiaqi Ma

IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PDF Website
ICLR

Towards Smooth Video Composition

Qihang Zhang, Ceyuan Yang, Yujun Shen, Yinghao Xu, and Bolei Zhou

International Conference on Learning Representations, 2023

PDF Website
ICRA

V2XP-ASG: Generating Adversarial Scenes for Vehicle-to-Everything Perception

Hao Xiang, Runsheng Xu, Xin Xia, Zhaoliang Zheng, Bolei Zhou, and Jiaqi Ma

IEEE International Conference on Robotics and Automation, 2023

PDF
TPAMI

GH-Feat: Learning Versatile Generative Hierarchical Features From GANs

Yinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, and Bolei Zhou

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023

PDF

2022

NeurIPS

Human-AI Shared Control via Policy Dissection

Quanyi Li, Zhenghao Peng, Haibin Wu, Lan Feng, and Bolei Zhou

In Advances in Neural Information Processing Systems, 2022

PDF Code Website
ICLR

Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization

Quanyi Li^*, Zhenghao Peng^*, and Bolei Zhou

In International Conference on Learning Representations, 2022

PDF Video Code Website
NeurIPS

Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping

Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, and Bolei Zhou

Neural Information Processing Systems, 2022

PDF
NeurIPS

Improving GANs with A Dynamic Discriminator

Ceyuan Yang, Yujun Shen, Yinghao Xu, Deli Zhao, Bo Dai, and Bolei Zhou

Neural Information Processing Systems, 2022

PDF Website
CoRL

CoBEVT: Cooperative Bird’s Eye View Semantic Segmentation with Sparse Transformers

Runsheng Xu, Zhengzhong Tu, Hao Xiang, Wei Shao, Bolei Zhou, and Jiaqi Ma

Conference on Robot Learning, 2022

PDF
ECCV

Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining

Qihang Zhang, Zhenghao Peng, and Bolei Zhou

European Conference on Computer Vision, 2022

PDF Website
ECCV Oral

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, and Bolei Zhou

European Conference on Computer Vision, 2022

PDF Website
CVPR

Improving GAN Equilibrium by Raising Spatial Awareness

Jianyuan Wang, Ceyuan Yang, Yinghao Xu, Yujun Shen, Hongdong Li, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PDF Code Website
CVPR

3D-aware Image Synthesis via Learning Structural and Textural Representations

Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PDF Code Website
CVPR

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PDF
CVPR Oral

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

Yinghao Xu, Fangyun Wei, Xiao Sun, Ceyuan Yang, Yujun Shen, Bo Dai, Bolei Zhou, and Stephen Lin

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PDF
ICRA+RAL

PlaTe: Visually-Grounded Planning with Transformers in Procedural Tasks

Jiankai Sun, De-An Huang, Bo Lu, Yunhui Liu, Bolei Zhou, and Animesh Garg

IEEE Robotics and Automation Letters, 2022

PDF Website
AAAI

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing

Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, and Xiaowei Zhou

AAAI Conference on Artificial Intelligence, 2022

PDF
AAAI

SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations

Zhenyu Li, Zehui Chen, Ang Li, Liangji Fang, Qinhong Jiang, Xianming Liu, Junjun Jiang, Bolei Zhou, and Hang Zhao

AAAI Conference on Artificial Intelligence, 2022

PDF
Book Chapter

Interpreting Generative Adversarial Networks for Interactive Image Generation

Bolei Zhou

xxAI - Beyond explainable Artificial Intelligence, 2022

PDF Video Slides
IJCV

Disentangled inference for gans with latently invertible autoencoder

Jiapeng Zhu, Deli Zhao, Bo Zhang, and Bolei Zhou

International Journal on Computer Vision, 2022

PDF
TPAMI

Gan inversion: A survey

Weihao Xia, Yulun Zhang, Yujiu Yang, Jing-Hao Xue, Bolei Zhou, and Ming-Hsuan Yang

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022

PDF

2021

NeurIPS

Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization

Zhenghao Peng, Quanyi Li, Chunxiao Liu, and Bolei Zhou

In Advances in Neural Information Processing Systems, 2021

PDF Code Website
CoRL

Safe Driving via Expert Guided Policy Optimization

Zhenghao Peng^*, Quanyi Li^*, Chunxiao Liu, and Bolei Zhou

In 5th Annual Conference on Robot Learning , 2021

PDF Video Code Website
TPAMI

MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning

Quanyi Li^*, Zhenghao Peng^*, Lan Feng, Qihang Zhang, Zhenghai Xue, and Bolei Zhou

In , 2021

PDF Video Code Website
NeurIPS

Data-Efficient Instance Generation from Instance Discrimination

Ceyuan Yang, Yujun Shen, Yinghao Xu, and Bolei Zhou

Neural Information Processing Systems, 2021

PDF Code Website
CVPR Oral

Closed-form factorization of latent semantics in GANs

Yujun Shen, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021

PDF Website
CVPR Oral

Generative hierarchical features from synthesizing images

Yinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021

PDF Website
CVPR

Instance localization for self-supervised detection pretraining

Ceyuan Yang, Zhirong Wu, Bolei Zhou, and Stephen Lin

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021

PDF
CVPR

Multimodal Motion Prediction with Stacked Transformers

Yicheng Liu, Jinghuai Zhang, Liangji Fang, Qinhong Jiang, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021

PDF Website
CVPR

Positional encoding as spatial inductive bias in gans

Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, and Chen Change Loy

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021

PDF
ICCV

TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization

Wei Gao, Fang Wan, Xingjia Pan, Zhiliang Peng, Qi Tian, Zhenjun Han, Bolei Zhou, and Qixiang Ye

International Conference on Computer Vision, 2021

PDF Code
HDSR

Deepminer: Discovering interpretable representations for mammogram classification and explanation

Jimmy Wu, Bolei Zhou, Diondra Peck, Scott Hsieh, Vandana Dialani, Lester Mackey, and Genevieve Patterson

Harvard Data Science Review, 2021

PDF
IJCV

Semantic hierarchy emerges in deep generative representations for scene synthesis

Ceyuan Yang, Yujun Shen, and Bolei Zhou

International Journal of Computer Vision, 2021

PDF Code Website
TIP

Texture Memory-Augmented Deep Patch-Based Image Inpainting

Rui Xu, Minghao Guo, Jiaqi Wang, Xiaoxiao Li, Bolei Zhou, and Chen Change Loy

IEEE Transactions on Image Processing, 2021

PDF
Deep Learning for Scene Classification: A Survey

Delu Zeng, Minyu Liao, Mohammad Tavakolian, Yulan Guo, Bolei Zhou, Dewen Hu, Matti Pietikäinen, and Li Liu

arXiv preprint arXiv:2101.10531, 2021
IROS+RAL

Adversarial Inverse Reinforcement Learning With Self-Attention Dynamics Model

Jiankai Sun, Lantao Yu, Pinqian Dong, Bo Lu, and Bolei Zhou

IEEE Robotics and Automation Letters, 2021

PDF Code
Unsupervised Image Transformation Learning via Generative Adversarial Networks

Kaiwen Zha, Yujun Shen, and Bolei Zhou

arXiv preprint arXiv:2103.07751, 2021
AAAI

HiABP: Hierarchical Initialized ABP for Unsupervised Representation Learning

Jiankai Sun, Rui Liu, and Bolei Zhou

In Proceedings of the AAAI Conference on Artificial Intelligence, 2021

PDF
Safe Exploration by Solving Early Terminated MDP

Hao Sun, Ziping Xu, Meng Fang, Zhenghao Peng, Jiadong Guo, Bo Dai, and Bolei Zhou

arXiv preprint arXiv:2107.04200, 2021

PDF

2020

IROS+RAL

Cross-view semantic segmentation for sensing surroundings

Bowen Pan, Jiankai Sun, Ho Yin Tiga Leung, Alex Andonian, and Bolei Zhou

IEEE Robotics and Automation Letters, 2020

PDF Website
Semantic photo manipulation with a generative image prior

David Bau, Hendrik Strobelt, William Peebles, Jonas Wulff, Bolei Zhou, Jun-Yan Zhu, and Antonio Torralba

SIGGRAPH, 2020

PDF Website
CVPR

Interpreting the latent space of gans for semantic face editing

Yujun Shen, Jinjin Gu, Xiaoou Tang, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PDF Website
CoRL

Learning Driving Decisions by Imitating Drivers’ Control Behaviors

Junning Huang, Sirui Xie, Jiankai Sun, Qiurui Ma, Chunxiao Liu, Jianping Shi, Dahua Lin, and Bolei Zhou

Conference on Robot Learning, 2020

PDF
AAAI

Every frame counts: joint learning of video segmentation and optical flow

Mingyu Ding, Zhe Wang, Bolei Zhou, Jianping Shi, Zhiwu Lu, and Ping Luo

In Proceedings of the AAAI Conference on Artificial Intelligence, 2020

PDF
CVPR

Image Processing Using Multi-Code GAN Prior

Jinjin Gu, Yujun Shen, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PDF Website
CVPR

Transmomo: Invariance-driven unsupervised video motion retargeting

Zhuoqian Yang, Wentao Zhu, Wayne Wu, Chen Qian, Qiang Zhou, Bolei Zhou, and Chen Change Loy

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PDF Website
ECCV

In-domain gan inversion for real image editing

Jiapeng Zhu, Yujun Shen, Deli Zhao, and Bolei Zhou

In European Conference on Computer Vision, 2020

PDF Website
CVPR

A local-to-global approach to multi-modal movie scene segmentation

Anyi Rao, Linning Xu, Yu Xiong, Guodong Xu, Qingqiu Huang, Bolei Zhou, and Dahua Lin

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PDF Website
CVPR

Temporal pyramid network for action recognition

Ceyuan Yang, Yinghao Xu, Jianping Shi, Bo Dai, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PDF Code
CVPR

Tpnet: Trajectory proposal network for motion prediction

Liangji Fang, Qinhong Jiang, Jianping Shi, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PDF Website
Evolutionary Stochastic Policy Distillation

Hao Sun, Xinyu Pan, Bo Dai, Dahua Lin, and Bolei Zhou

arXiv preprint arXiv:2004.12909, 2020

PDF
Novel Policy Seeking with Constrained Optimization

Hao Sun, Zhenghao Peng, Bo Dai, Jian Guo, Dahua Lin, and Bolei Zhou

arXiv preprint arXiv:2005.10696, 2020

PDF
TPAMI

InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs

Yujun Shen, Ceyuan Yang, Xiaoou Tang, and Bolei Zhou

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020

PDF Website
ECCV

A unified framework for shot type classification based on subject centric lens

Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, and Dahua Lin

In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XI 16, 2020
PNAS

Understanding the role of individual units in a deep neural network

David Bau, Jun-Yan Zhu, Hendrik Strobelt, Agata Lapedriza, Bolei Zhou, and Antonio Torralba

Proceedings of the National Academy of Sciences, 2020

PDF Website
Improving the Fairness of Deep Generative Models without Retraining

Shuhan Tan, Yujun Shen, and Bolei Zhou

arXiv preprint arXiv:2012.04842, 2020

PDF
Neuro-symbolic program search for autonomous driving decision module design

Jiankai Sun1 Hao Sun1 Tian Han, and Bolei Zhou

Conference on Robot Learning, 2020

PDF
Improving the Generalization of End-to-End Driving through Procedural Generation

Quanyi Li, Zhenghao Peng, Qihang Zhang, Chunxiao Liu, and Bolei Zhou

arXiv preprint arXiv:2012.13681, 2020

PDF Code

2019

TPAMI

Moments in time dataset: one million videos for event understanding

Mathew Monfort, Alex Andonian, Bolei Zhou, Kandan Ramakrishnan, Sarah Adel Bargal, Tom Yan, Lisa Brown, Quanfu Fan, Dan Gutfreund, Carl Vondrick, and others

IEEE transactions on pattern analysis and machine intelligence, 2019

PDF Code Website
ICLR

Gan dissection: Visualizing and understanding generative adversarial networks

David Bau, Jun-Yan Zhu, Hendrik Strobelt, Bolei Zhou, Joshua B Tenenbaum, William T Freeman, and Antonio Torralba

International Conference on Learning Representations, 2019

PDF Website
Discovering place-informative scenes and objects using social media photos

Fan Zhang, Bolei Zhou, Carlo Ratti, and Yu Liu

Royal Society open science, 2019
CVPR

Deep flow-guided video inpainting

Rui Xu, Xiaoxiao Li, Bolei Zhou, and Chen Change Loy

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019

PDF Code Website
CVPR

Drivingstereo: A large-scale dataset for stereo matching in autonomous driving scenarios

Guorun Yang, Xiao Song, Chaoqin Huang, Zhidong Deng, Jianping Shi, and Bolei Zhou

In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019

PDF Website
Comparing the interpretability of deep networks via network dissection

Bolei Zhou, David Bau, Aude Oliva, and Antonio Torralba

In Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, 2019
CVPR

Reasoning about human-object interactions through dual attention networks

Tete Xiao, Quanfu Fan, Dan Gutfreund, Mathew Monfort, Aude Oliva, and Bolei Zhou

In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019

PDF Website
NeurIPS Spotlight

Policy Continuation with Hindsight Inverse Dynamics

Hao Sun, Zhizhong Li, Xiaotong Liu, Dahua Lin, and Bolei Zhou

In Advances in Neural Information Processing Systems, 2019

PDF
CVPR Oral

A graph-based framework to bridge movies and synopses

Yu Xiong, Qingqiu Huang, Lingfeng Guo, Hang Zhou, Bolei Zhou, and Dahua Lin

In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019

PDF
ICCV Oral

Seeing what a gan cannot generate

David Bau, Jun-Yan Zhu, Jonas Wulff, William Peebles, Hendrik Strobelt, Bolei Zhou, and Antonio Torralba

In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019

PDF Website

2018

IJCV

Semantic understanding of scenes through the ADE20K dataset

Bolei Zhou, Hang Zhao, Xavier Puig, Tete Xiao, Sanja Fidler, Adela Barriuso, and Antonio Torralba

International Journal on Computer Vision, 2018

PDF Code Website
CVPR

Visual question generation as dual task of visual question answering

Yikang Li, Nan Duan, Bolei Zhou, Xiao Chu, Wanli Ouyang, Xiaogang Wang, and Ming Zhou

In Proceedings of the IEEE conference on computer vision and pattern recognition, 2018

PDF Code
TPAMI

Interpreting Deep Visual Representations via Network Dissection

Bolei Zhou, David Bau, Aude Oliva, and Antonio Torralba

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018

PDF Code Website
ECCV

Temporal relational reasoning in videos

Bolei Zhou, Alex Andonian, Aude Oliva, and Antonio Torralba

In European Conference on Computer Vision, 2018

PDF Code Website
Expert identification of visual primitives used by CNNs during mammogram classification

Jimmy Wu, Diondra Peck, Scott Hsieh, Vandana Dialani, Constance D Lehman, Bolei Zhou, Vasilis Syrgkanis, Lester Mackey, and Genevieve Patterson

In Medical Imaging 2018: Computer-Aided Diagnosis, 2018

PDF
CVPR

Recurrent residual module for fast inference in videos

Bowen Pan, Wuwei Lin, Xiaolin Fang, Chaoqin Huang, Bolei Zhou, and Cewu Lu

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018

PDF
Revisiting the importance of individual units in cnns via ablation

Bolei Zhou, Yiyou Sun, David Bau, and Antonio Torralba

arXiv preprint arXiv:1806.02891, 2018

PDF
ECCV

Factorizable net: an efficient subgraph-based framework for scene graph generation

Yikang Li, Wanli Ouyang, Bolei Zhou, Jianping Shi, Chao Zhang, and Xiaogang Wang

In Proceedings of the European Conference on Computer Vision, 2018

PDF
ECCV

Unified perceptual parsing for scene understanding

Tete Xiao, Yingcheng Liu, Bolei Zhou, Yuning Jiang, and Jian Sun

In European Conference on Computer Vision, 2018

PDF Code
IROS

Real-time object pose estimation with pose interpreter networks

Jimmy Wu, Bolei Zhou, Rebecca Russell, Vincent Kee, Syler Wagner, Mitchell Hebert, Antonio Torralba, and David MS Johnson

In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018

PDF Code
ECCV

Single image intrinsic decomposition without a single intrinsic image

Wei-Chiu Ma, Hang Chu, Bolei Zhou, Raquel Urtasun, and Antonio Torralba

In Proceedings of the European Conference on Computer Vision, 2018

PDF
ECCV

Interpretable basis decomposition for visual explanation

Bolei Zhou, Yiyou Sun, David Bau, and Antonio Torralba

In Proceedings of the European Conference on Computer Vision, 2018

PDF Code
Measuring human perceptions of a large-scale urban region using machine learning

Fan Zhang, Bolei Zhou, Liu Liu, Yu Liu, Helene H Fung, Hui Lin, and Carlo Ratti

Landscape and Urban Planning, 2018
Interpretable representation learning for visual intelligence

Bolei Zhou

PhD Thesis, 2018

Video
Facefeat-gan: a two-stage approach for identity-preserving face synthesis

Yujun Shen, Bolei Zhou, Ping Luo, and Xiaoou Tang

arXiv preprint arXiv:1812.01288, 2018

2017

CVPR

Person search with natural language description

Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017

PDF
IROS Oral

Segicp: Integrated deep semantic segmentation and pose estimation

Jay M Wong, Vincent Kee, Tiffany Le, Syler Wagner, Gian-Luca Mariottini, Abraham Schneider, Lei Hamilton, Rahul Chipalkatty, Mitchell Hebert, David MS Johnson, and others

In 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017

PDF
ICCV

Open vocabulary scene parsing

Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, and Antonio Torralba

In Proceedings of the IEEE International Conference on Computer Vision, 2017

PDF Website
CVPR

Scene parsing through ade20k dataset

Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, and Antonio Torralba

In Proceedings of the IEEE conference on computer vision and pattern recognition, 2017

PDF Code Website
CVPR Oral

Network dissection: Quantifying interpretability of deep visual representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, and Antonio Torralba

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017

PDF Website
TPAMI

Places: A 10 million image database for scene recognition

Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba

IEEE transactions on pattern analysis and machine intelligence, 2017

PDF Code Website
ICCV

Scene graph generation from objects, phrases and region captions

Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, and Xiaogang Wang

In Proceedings of the IEEE international conference on computer vision, 2017

PDF Code

2016

CVPR

Learning deep features for discriminative localization

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba

In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016

PDF Video Website
AISTATS Oral

Optimization as estimation with Gaussian processes in bandit settings

Zi Wang, Bolei Zhou, and Stefanie Jegelka

In Artificial Intelligence and Statistics, 2016

PDF
C-IMAGE: city cognitive mapping through geo-tagged photos

Liu Liu, Bolei Zhou, Jinhua Zhao, and Brent D Ryan

GeoJournal, 2016

2015

CVPR

Conceptlearner: Discovering visual concepts from weakly labeled image collections

Bolei Zhou, Vignesh Jagadeesh, and Robinson Piramuthu

In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015

PDF
ICLR Oral

Object detectors emerge in deep scene cnns

Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba

In International Conference on Learning Representations, 2015

PDF
Simple baseline for visual question answering

Bolei Zhou, Yuandong Tian, Sainbayar Sukhbaatar, Arthur Szlam, and Rob Fergus

arXiv preprint arXiv:1512.02167, 2015

PDF

2014

TPAMI

Measuring Crowd Collectiveness

Bolei Zhou, Xiaoou Tang, Hepeng Zhang, and Xiaogang Wang

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014

PDF Website
IJCV

Learning Collective Crowd Behaviors with Dynamic Pedestrian-Agents

Bolei Zhou, Xiaoou Tang, and Xiaogang Wang

International Journal of Computer Vision, 2014

PDF Website
ECCV

Recognizing city identity via attribute analysis of geo-tagged images

Bolei Zhou, Liu Liu, Aude Oliva, and Antonio Torralba

In European conference on computer vision, 2014

PDF
NeurIPS Spotlight

Learning Deep Features for Scene Recognition using Places Database

Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, and Aude Oliva

In NIPS, 2014

PDF Website

2013

CVPR Oral

Measuring Crowd Collectiveness

Bolei Zhou, Xiaoou Tang, and Xiaogang Wang

IEEE Conference on Computer Vision and Pattern Recognition, 2013

PDF Website

2012

CVPR Oral

Understanding collective crowd behaviors: Learning a mixture model of dynamic pedestrian-agents

Bolei Zhou, Xiaogang Wang, and Xiaoou Tang

In IEEE Conference on Computer Vision and Pattern Recognition, 2012

HTML PDF
ECCV

Coherent filtering: Detecting coherent motions from crowd clutters

Bolei Zhou, Xiaoou Tang, and Xiaogang Wang

In European Conference on Computer Vision, 2012

PDF Website

2011

CVPR

Random field topic model for semantic region analysis in crowded scenes from tracklets

Bolei Zhou, Xiaogang Wang, and Xiaoou Tang

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

PDF Website