Research

MEID: Mixture-of-Experts with Internal Distillation for Long-Tailed Video Recognition
Xinjie Li, Huijuan Xu
Thirty-Seven AAAI Conference on Artificial Intelligence (AAAI), 2023
Paper Code

Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions
Zhi Li, Lu He, Huijuan Xu
European Conference on Computer Vision (ECCV), 2022
Paper Code

Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency
Jin Liu, Chongfeng Fan, Fengyu Zhou, Huijuan Xu
North American Chapter of the Association for Computational Linguistics (NAACL Findings), 2022
Paper Code

Disentangled Action Recognition with Knowledge Bases
Zhekun Luo, Shalini Ghosh, Devin Guillory, Keizo Kato, Trevor Darrell, Huijuan Xu
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Paper Code

Temporal Action Detection with Multi-level Supervision
Baifeng Shi, Qi Dai, Judy Hoffman, Kate Saenko, Trevor Darrell, Huijuan Xu
International Conference on Computer Vision (ICCV), 2021
Paper Code

Meta-Baseline: Rethinking the Effectiveness of Simple Meta-Learning for Few-Shot Learning
Yinbo Chen, Zhuang Liu, Huijuan Xu, Trevor Darrell, Xiaolong Wang
International Conference on Computer Vision (ICCV), 2021
Paper Code

LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval
Reuben Tan, Huijuan Xu, Kate Saenko, Bryan A. Plummer
IEEE Winter Conference on Applications of Computer Vision (WACV), 2021
Paper

Auxiliary Task Reweighting for Minimum-data Learning
Baifeng Shi, Judy Hoffman, Kate Saenko, Trevor Darrell, Huijuan Xu
Neural Information Processing Systems (NeurIPS), 2020
Paper Code Project page

Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning
Zhekun Luo, Devin Guillory, Baifeng Shi, Wei Ke, Fang Wan, Trevor Darrell, Huijuan Xu
European Conference on Computer Vision (ECCV), 2020
Paper Code Demo1 Demo2

Learning Canonical Representations for Scene Graph to Image Generation
Roei Herzig*, Amir Bar*, Huijuan Xu, Gal Chechik, Trevor Darrell, Amir Globerson
European Conference on Computer Vision (ECCV), 2020
Paper Code Project page

Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska, Tete Xiao, Roei Herzig, Huijuan Xu†, Xiaolong Wang†, Trevor Darrell†
Computer Vision and Pattern Recognition (CVPR), 2020
Paper Dataset Project page

TwoStreamVAN: Improving Motion Modeling in Video Generation
Ximeng Sun, Huijuan Xu, Kate Saenko
IEEE Winter Conference on Applications of Computer Vision (WACV), 2020
Paper Code Demo

Spatio-Temporal Action Graph Networks
Roei Herzig*, Elad Levi*, Huijuan Xu*, Hang Gao, Eli Brosh, Xiaolong Wang, Amir Globerson, Trevor Darrell
International Conference on Computer Vision Workshop (ICCVW), 2019
Paper Code

Learning Instance Activation Maps for Weakly Supervised Instance Segmentation
Yi Zhu, Yanzhao Zhou, Huijuan Xu, Qixiang Ye, David Doermann, Jianbin Jiao
Computer Vision and Pattern Recognition (CVPR), 2019
Paper Project page

Two-Stream Region Convolutional 3D Network for Temporal Activity Detection
Huijuan Xu, Abir Das, Kate Saenko
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Paper Code

Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Huijuan Xu, Kun He, Bryan A. Plummer, Leonid Sigal, Stan Sclaroff, Kate Saenko
Thirty-Third AAAI Conference on Artificial Intelligence (AAAI), 2019
Paper Code Demo1 Demo2

Joint Event Detection and Description in Continuous Video Streams
Huijuan Xu, Boyang Li, Vasili Ramanishka, Leonid Sigal, Kate Saenko
IEEE Winter Conference on Applications of Computer Vision (WACV), 2019
Paper Code Demo1 Demo2

R-C3D: Region Convolutional 3D Network for Temporal Activity Detection
Huijuan Xu, Abir Das, Kate Saenko
International Conference on Computer Vision (ICCV), 2017
Paper Code Demo Project page

Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu, Kate Saenko
European Conference on Computer Vision (ECCV), 2016
Paper Code Video spotlight

Translating Videos to Natural Language Using Deep Recurrent Neural Networks
Subhashini Venugopalan, Huijuan Xu, Jeff Donahue, Marcus Rohrbach, Raymond Mooney, Kate Saenko
North American Chapter of the Association for Computational Linguistics (NAACL), 2015
Paper Project page