Blog
Baidu at CVPR 2020

2020-06-09

Back to list

The annual Computer Vision and Pattern Recognition Conference (CVPR), the premier annual computer vision event, will run June 14 – 19 as a virtual event due to the COVID-19 pandemic. This year, a record-breaking 1,470 papers from 6,656 valid submissions were accepted by the conference, yielding a 22 percent paper acceptance rate, down from 25 percent in 2019 and 29.6 percent in 2018.


Baidu has had 22 research papers accepted at CVPR 2020, covering a wide range of topics such as video understanding, image super-resolution, and instance segmentation and object detection for autonomous driving. You can learn more about our work being presented in the list below.


FaceScape: a Large-scale High Quality 3D Face Dataset and Detailed Riggable 3D Face Prediction

Haotian Yang, Hao Zhu, Yanru Wang, Mingkai Huang, Qiu Shen, Ruigang Yang, Xun Cao

Joint 3D Instance Segmentation and Object Detection for Autonomous Driving

Dingfu Zhou, Jin Fang, Xibin Song, Liu Liu, Junbo Yin, Yuchao Dai, Hongdong Li, Ruigang Yang

Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration

Yang He, Yuhang Ding, Ping Liu, Linchao Zhu, Hanwang Zhang, Yi Yang

Unsupervised Person Re-identification via Softened Similarity Learning

Yutian Lin, Lingxi Xie, Yu Wu, Chenggang Yan, Qi Tian

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

Linchao Zhu, Yi Yang

Channel Attention based Iterative Residual Learning for Depth Map Super-Resolution

Xibin Song, Yuchao Dai, Dingfu Zhou, Liu Liu, Wei Li, Hongdong Li, Ruigang Yang

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

Junbo Yin, Wenguan Wang, Qinghao Meng, Ruigang Yang, Jianbing Shen

Cogradient Descent for Bilinear Optimization

Li'an Zhuo, Baochang Zhang, Linlin Yang, Hanlin Chen, Qixiang Ye, David Doermann, Rongrong Ji, Guodong Guo

Hierarchical Pyramid Diverse Attention Networks for Face Recognition

Qiangchang Wang, Tianyi Wu, He Zheng, Guodong Guo

ActBERT: Learning Global-Local Video-Text Representations

Linchao Zhu, Yi Yang

Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation

Min-Hung Chen, Baopu Li, Yingze Bao, Ghassan AlRegib, Zsolt Kira

Memory Aggregation Networks for Efficient Interactive Video ObjectSegmentation

Jiaxu Miao, Yunchao Wei, Yi Yang

3D Part Guided Image Editing for Fine-grained Object Understanding

Zongdai Liu, Feixiang Lu, Peng Wang, Hui Miao, Liangjun Zhang, Ruigang Yang, Bin Zhou

LiDAR-Based Online 3D Video Object Detection With Graph-Based Message Passing and Spatiotemporal Transformer Attention

Junbo Yin, Jianbing Shen, Chenye Guan, Dingfu Zhou, Ruigang Yang

Gated Channel Transformation for Visual Recognition

Zongxin Yang, Linchao Zhu, Yu Wu, Yi Yang

GP-NAS: Gaussian Process based Neural Architecture Search

Zhihang Li, Teng Xi, Jiankang Deng, Gang Zhang, Shengzhao Wen, Ran He

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Deli Yu, Xuan Li, Chengquan Zhang, Tao Liu, Junyu Han, Jingtuo Liu, Errui Ding

Imitative Non-Autoregressive Modeling for Trajectory Forecasting andImputation

Mengshi Qi, Jie Qin, Yu Wu, Yi Yang

HAMBox: Delving Into Mining High-Quality Anchors on Face Detection

Yang Liu, Xu Tang, Junyu Han, Jingtuo Liu, Dinger Rui, Xiang Wu

Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection

Liang Du, Xiaoqing Ye, Xiao Tan, Jianfeng Feng, Zhenbo Xu, Errui Ding, Shilei Wen

Cross-modality Person re-identification with Shared-Specific Feature Transfer

Yan Lu, Yue Wu, Bin Liu, Tianzhu Zhang, Baopu Li, Qi Chu, Nenghai Yu

BFBox: Searching Face-Appropriate Backbone and Feature Pyramid Network for Face Detector

Yang Liu, Xu Tang

 

Workshop Papers:

Co-Learn Sounding Object Visual Grounding and Visually Indicated Sound Separation in A Cycle

Yapeng Tian, Di Hu, Chenliang Xu

A Two-Stage Framework for Multiple Sound-Source Localization

Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin

Does Ambient Sound Help? - Audiovisual Crowd Counting

Di Hu, Lichao Mou, Qingzhong Wang, Junyu Gao, Yuansheng Hua, Dejing Dou, Xiaoxiang Zhu

Heterogeneous Scene Analysis via Self-supervised Audiovisual Learning

Di Hu, Zheng Wang, Haoyi Xiong, Dong Wang, Feiping Nie, Dejing Dou