Publications
Complete list of publications from the Vision and Autonomy Intelligence Lab at UCLA.
Click the buttons below to view our work by category.
2025
- NeurIPS SpotlightPredictive Representation Learning for Human-Robot InteractionIn Advances in Neural Information Processing Systems, 2025
2024
- NeurIPS
- Localization and recognition of human action in 3D using transformersCommunications Engineering, 2024
- TPAMISpatial Steerability of GANs via Self-Supervision from DiscriminatorIEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
- Unsupervised Discovery of Steerable Factors When Graph Deep Generative Models Are EntangledTransactions on Machine Learning Research, 2024
- TPAMIIn-Domain GAN Inversion for Faithful Reconstruction and EditabilityIEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
2023
- ICRAV2XP-ASG: Generating Adversarial Scenes for Vehicle-to-Everything PerceptionIEEE International Conference on Robotics and Automation, 2023
- TPAMIGH-Feat: Learning Versatile Generative Hierarchical Features From GANsIEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
2022
- NeurIPSOptimistic Curiosity Exploration and Conservative Exploitation with Linear Reward ShapingNeural Information Processing Systems, 2022
- CoRLCoBEVT: Cooperative Bird’s Eye View Semantic Segmentation with Sparse TransformersConference on Robot Learning, 2022
- CVPRLearning Hierarchical Cross-Modal Association for Co-Speech Gesture GenerationIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
- CVPR OralCross-Model Pseudo-Labeling for Semi-Supervised Action RecognitionIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
- AAAIVisual Sound Localization in the Wild by Cross-Modal Interference ErasingAAAI Conference on Artificial Intelligence, 2022
- AAAISimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual RepresentationsAAAI Conference on Artificial Intelligence, 2022
- IJCVDisentangled inference for gans with latently invertible autoencoderInternational Journal on Computer Vision, 2022
- TPAMI
2021
- CVPRInstance localization for self-supervised detection pretrainingIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021
- CVPRPositional encoding as spatial inductive bias in gansIn Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021
- HDSRDeepminer: Discovering interpretable representations for mammogram classification and explanationHarvard Data Science Review, 2021
- TIPTexture Memory-Augmented Deep Patch-Based Image InpaintingIEEE Transactions on Image Processing, 2021
- Deep Learning for Scene Classification: A SurveyarXiv preprint arXiv:2101.10531, 2021
- Unsupervised Image Transformation Learning via Generative Adversarial NetworksarXiv preprint arXiv:2103.07751, 2021
- AAAIHiABP: Hierarchical Initialized ABP for Unsupervised Representation LearningIn Proceedings of the AAAI Conference on Artificial Intelligence, 2021
-
2020
- CoRLLearning Driving Decisions by Imitating Drivers’ Control BehaviorsConference on Robot Learning, 2020
- AAAIEvery frame counts: joint learning of video segmentation and optical flowIn Proceedings of the AAAI Conference on Artificial Intelligence, 2020
-
-
- ECCVA unified framework for shot type classification based on subject centric lensIn Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XI 16, 2020
- Improving the Fairness of Deep Generative Models without RetrainingarXiv preprint arXiv:2012.04842, 2020
- Neuro-symbolic program search for autonomous driving decision module designConference on Robot Learning, 2020
2019
- Discovering place-informative scenes and objects using social media photosRoyal Society open science, 2019
- Comparing the interpretability of deep networks via network dissectionIn Explainable AI: Interpreting, Explaining and Visualizing Deep Learning, 2019
- NeurIPS SpotlightPolicy Continuation with Hindsight Inverse DynamicsIn Advances in Neural Information Processing Systems, 2019
- CVPR OralA graph-based framework to bridge movies and synopsesIn Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019
2018
- Expert identification of visual primitives used by CNNs during mammogram classificationIn Medical Imaging 2018: Computer-Aided Diagnosis, 2018
- CVPRRecurrent residual module for fast inference in videosIn Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018
- Revisiting the importance of individual units in cnns via ablationarXiv preprint arXiv:1806.02891, 2018
- ECCVFactorizable net: an efficient subgraph-based framework for scene graph generationIn Proceedings of the European Conference on Computer Vision, 2018
- ECCVSingle image intrinsic decomposition without a single intrinsic imageIn Proceedings of the European Conference on Computer Vision, 2018
- Measuring human perceptions of a large-scale urban region using machine learningLandscape and Urban Planning, 2018
-
- Facefeat-gan: a two-stage approach for identity-preserving face synthesisarXiv preprint arXiv:1812.01288, 2018
2017
- CVPRPerson search with natural language descriptionIn Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017
- IROS OralSegicp: Integrated deep semantic segmentation and pose estimationIn 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017
2016
- AISTATS OralOptimization as estimation with Gaussian processes in bandit settingsIn Artificial Intelligence and Statistics, 2016
- C-IMAGE: city cognitive mapping through geo-tagged photosGeoJournal, 2016
2015
- CVPRConceptlearner: Discovering visual concepts from weakly labeled image collectionsIn Proceedings of the IEEE conference on computer vision and pattern recognition, 2015
- ICLR OralObject detectors emerge in deep scene cnnsIn International Conference on Learning Representations, 2015
-
2014
- ECCVRecognizing city identity via attribute analysis of geo-tagged imagesIn European conference on computer vision, 2014