共计 93634 个字符,预计需要花费 235 分钟才能阅读完成。
CVPR 2021 全副论文颁布,曾经凋谢下载,下边是所有 1600 多篇录用论文
CVPR 2021 全副论文已凋谢,pdf 下载链接:
链接: https://pan.baidu.com/s/1GWkq… 提取码: vwkx(4.3G)
今年 CVPR 论文更新:https://github.com/Sophia-11/…
Invertible Denoising Network: A Light Solution for Real Noise Removal Yang Liu, Zhenyue Qin, Saeed Anwar, Pan Ji, Dongwoo Kim, Sabrina Caldwell, Tom Gedeon [pdf] [arXiv] [bibtex]
Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction Bohan Wu, Suraj Nair, Roberto Martin-Martin, Li Fei-Fei, Chelsea Finn [pdf] [supp] [bibtex]
Over-the-Air Adversarial Flickering Attacks Against Video Recognition Networks Roi Pony, Itay Naeh, Shie Mannor [pdf] [supp] [arXiv] [bibtex]
Encoder Fusion Network With Co-Attention Embedding for Referring Image Segmentation Guang Feng, Zhiwei Hu, Lihe Zhang, Huchuan Lu [pdf] [arXiv] [bibtex]
Polka Lines: Learning Structured Illumination and Reconstruction for Active Stereo Seung-Hwan Baek, Felix Heide [pdf] [supp] [arXiv] [bibtex]
Image Inpainting With External-Internal Learning and Monochromic Bottleneck Tengfei Wang, Hao Ouyang, Qifeng Chen [pdf] [supp] [arXiv] [bibtex]
Patch2Pix: Epipolar-Guided Pixel-Level Correspondences Qunjie Zhou, Torsten Sattler, Laura Leal-Taixe [pdf] [supp] [bibtex]
Diverse Part Discovery: Occluded Person Re-Identification With Part-Aware Transformer Yulin Li, Jianfeng He, Tianzhu Zhang, Xiang Liu, Yongdong Zhang, Feng Wu [pdf] [supp] [bibtex]
Counterfactual Zero-Shot and Open-Set Visual Recognition Zhongqi Yue, Tan Wang, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang [pdf] [supp] [arXiv] [bibtex]
Person30K: A Dual-Meta Generalization Network for Person Re-Identification Yan Bai, Jile Jiao, Wang Ce, Jun Liu, Yihang Lou, Xuetao Feng, Ling-Yu Duan [pdf] [bibtex]
Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition Stephen Hausler, Sourav Garg, Ming Xu, Michael Milford, Tobias Fischer [pdf] [supp] [bibtex]
Visually Informed Binaural Audio Generation without Binaural Audios Xudong Xu, Hang Zhou, Ziwei Liu, Bo Dai, Xiaogang Wang, Dahua Lin [pdf] [supp] [arXiv] [bibtex]
Dual Attention Guided Gaze Target Detection in the Wild Yi Fang, Jiapeng Tang, Wang Shen, Wei Shen, Xiao Gu, Li Song, Guangtao Zhai [pdf] [bibtex]
Privacy Preserving Localization and Mapping From Uncalibrated Cameras Marcel Geppert, Viktor Larsson, Pablo Speciale, Johannes L. Schonberger, Marc Pollefeys [pdf] [supp] [bibtex]
Learning Calibrated Medical Image Segmentation via Multi-Rater Agreement Modeling Wei Ji, Shuang Yu, Junde Wu, Kai Ma, Cheng Bian, Qi Bi, Jingjing Li, Hanruo Liu, Li Cheng, Yefeng Zheng [pdf] [bibtex]
Points As Queries: Weakly Semi-Supervised Object Detection by Points Liangyu Chen, Tong Yang, Xiangyu Zhang, Wei Zhang, Jian Sun [pdf] [arXiv] [bibtex]
Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network Ruicheng Feng, Chongyi Li, Huaijin Chen, Shuai Li, Chen Change Loy, Jinwei Gu [pdf] [supp] [arXiv] [bibtex]
iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression Shifeng Zhang, Chen Zhang, Ning Kang, Zhenguo Li [pdf] [supp] [arXiv] [bibtex]
Pose Recognition With Cascade Transformers Ke Li, Shijie Wang, Xiang Zhang, Yifan Xu, Weijian Xu, Zhuowen Tu [pdf] [arXiv] [bibtex]
Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection Zhenyu Wang, Yali Li, Ye Guo, Lu Fang, Shengjin Wang [pdf] [supp] [arXiv] [bibtex]
Prototype-Guided Saliency Feature Learning for Person Search Hanjae Kim, Sunghun Joung, Ig-Jae Kim, Kwanghoon Sohn [pdf] [bibtex]
Contrastive Learning for Compact Single Image Dehazing Haiyan Wu, Yanyun Qu, Shaohui Lin, Jian Zhou, Ruizhi Qiao, Zhizhong Zhang, Yuan Xie, Lizhuang Ma [pdf] [supp] [arXiv] [bibtex]
I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors Chaoqi Chen, Zebiao Zheng, Yue Huang, Xinghao Ding, Yizhou Yu [pdf] [arXiv] [bibtex]
Body Meshes as Points Jianfeng Zhang, Dongdong Yu, Jun Hao Liew, Xuecheng Nie, Jiashi Feng [pdf] [supp] [arXiv] [bibtex]
Pixel-Aligned Volumetric Avatars Amit Raj, Michael Zollhofer, Tomas Simon, Jason Saragih, Shunsuke Saito, James Hays, Stephen Lombardi [pdf] [supp] [bibtex]
UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training Mingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng, Linjie Li, Zhou Yu, Jingjing Liu [pdf] [supp] [arXiv] [bibtex]
Generative PointNet: Deep Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification Jianwen Xie, Yifei Xu, Zilong Zheng, Song-Chun Zhu, Ying Nian Wu [pdf] [arXiv] [bibtex]
Blur, Noise, and Compression Robust Generative Adversarial Networks Takuhiro Kaneko, Tatsuya Harada [pdf] [arXiv] [bibtex]
Invisible Perturbations: Physical Adversarial Examples Exploiting the Rolling Shutter Effect Athena Sayles, Ashish Hooda, Mohit Gupta, Rahul Chatterjee, Earlence Fernandes [pdf] [supp] [arXiv] [bibtex]
Introvert: Human Trajectory Prediction via Conditional 3D Attention Nasim Shafiee, Taskin Padir, Ehsan Elhamifar [pdf] [supp] [bibtex]
Camouflaged Object Segmentation With Distraction Mining Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, Deng-Ping Fan [pdf] [supp] [arXiv] [bibtex]
RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction Yinyu Nie, Ji Hou, Xiaoguang Han, Matthias Niessner [pdf] [supp] [bibtex]
In the Light of Feature Distributions: Moment Matching for Neural Style Transfer Nikolai Kalischek, Jan D. Wegner, Konrad Schindler [pdf] [supp] [arXiv] [bibtex]
DOTS: Decoupling Operation and Topology in Differentiable Architecture Search Yu-Chao Gu, Li-Juan Wang, Yun Liu, Yi Yang, Yu-Huan Wu, Shao-Ping Lu, Ming-Ming Cheng [pdf] [supp] [arXiv] [bibtex]
DriveGAN: Towards a Controllable High-Quality Neural Simulation Seung Wook Kim, Jonah Philion, Antonio Torralba, Sanja Fidler [pdf] [supp] [arXiv] [bibtex]
Style-Aware Normalized Loss for Improving Arbitrary Style Transfer Jiaxin Cheng, Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Prem Natarajan [pdf] [supp] [arXiv] [bibtex]
Wide-Depth-Range 6D Object Pose Estimation in Space Yinlin Hu, Sebastien Speierer, Wenzel Jakob, Pascal Fua, Mathieu Salzmann [pdf] [arXiv] [bibtex]
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization Chuming Lin, Chengming Xu, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu [pdf] [supp] [arXiv] [bibtex]
Monocular Depth Estimation via Listwise Ranking Using the Plackett-Luce Model Julian Lienen, Eyke Hullermeier, Ralph Ewerth, Nils Nommensen [pdf] [supp] [bibtex]
Holistic 3D Scene Understanding From a Single Image With Implicit Representation Cheng Zhang, Zhaopeng Cui, Yinda Zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu [pdf] [supp] [arXiv] [bibtex]
MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas J. Guibas [pdf] [supp] [arXiv] [bibtex]
Learning Optical Flow From a Few Matches Shihao Jiang, Yao Lu, Hongdong Li, Richard Hartley [pdf] [arXiv] [bibtex]
Learnable Motion Coherence for Correspondence Pruning Yuan Liu, Lingjie Liu, Cheng Lin, Zhen Dong, Wenping Wang [pdf] [supp] [arXiv] [bibtex]
ManipulaTHOR: A Framework for Visual Object Manipulation Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi [pdf] [supp] [arXiv] [bibtex]
DeepI2P: Image-to-Point Cloud Registration via Deep Classification Jiaxin Li, Gim Hee Lee [pdf] [supp] [arXiv] [bibtex]
Scene-Intuitive Agent for Remote Embodied Visual Grounding Xiangru Lin, Guanbin Li, Yizhou Yu [pdf] [supp] [arXiv] [bibtex]
Human-Like Controllable Image Captioning With Verb-Specific Semantic Roles Long Chen, Zhihong Jiang, Jun Xiao, Wei Liu [pdf] [supp] [arXiv] [bibtex]
Enhancing the Transferability of Adversarial Attacks Through Variance Tuning Xiaosen Wang, Kun He [pdf] [supp] [arXiv] [bibtex]
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown [pdf] [supp] [arXiv] [bibtex]
BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan [pdf] [bibtex]
Probabilistic Model Distillation for Semantic Correspondence Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu [pdf] [bibtex]
OpenRooms: An Open Framework for Photorealistic Indoor Scene Datasets Zhengqin Li, Ting-Wei Yu, Shen Sang, Sarah Wang, Meng Song, Yuhan Liu, Yu-Ying Yeh, Rui Zhu, Nitesh Gundavarapu, Jia Shi, Sai Bi, Hong-Xing Yu, Zexiang Xu, Kalyan Sunkavalli, Milos Hasan, Ravi Ramamoorthi, Manmohan Chandraker [pdf] [supp] [bibtex]
SSAN: Separable Self-Attention Network for Video Representation Learning Xudong Guo, Xun Guo, Yan Lu [pdf] [arXiv] [bibtex]
4D Panoptic LiDAR Segmentation Mehmet Aygun, Aljosa Osep, Mark Weber, Maxim Maximov, Cyrill Stachniss, Jens Behley, Laura Leal-Taixe [pdf] [supp] [arXiv] [bibtex]
SceneGen: Learning To Generate Realistic Traffic Scenes Shuhan Tan, Kelvin Wong, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]
Natural Adversarial Examples Dan Hendrycks, Kevin Zhao, Steven Basart, Jacob Steinhardt, Dawn Song [pdf] [supp] [arXiv] [bibtex]
CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models Mengyue Yang, Furui Liu, Zhitang Chen, Xinwei Shen, Jianye Hao, Jun Wang [pdf] [supp] [arXiv] [bibtex]
VideoMoCo: Contrastive Video Representation Learning With Temporally Adversarial Examples Tian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, Wei Liu [pdf] [arXiv] [bibtex]
Zero-Shot Instance Segmentation Ye Zheng, Jiahong Wu, Yongqiang Qin, Faen Zhang, Li Cui [pdf] [supp] [arXiv] [bibtex]
Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes Julian Chibane, Aayush Bansal, Verica Lazova, Gerard Pons-Moll [pdf] [supp] [arXiv] [bibtex]
Global Transport for Fluid Reconstruction With Learned Self-Supervision Erik Franz, Barbara Solenthaler, Nils Thuerey [pdf] [supp] [arXiv] [bibtex]
SliceNet: Deep Dense Depth Estimation From a Single Indoor Panorama Using a Slice-Based Representation Giovanni Pintore, Marco Agus, Eva Almansa, Jens Schneider, Enrico Gobbetti [pdf] [supp] [bibtex]
Offboard 3D Object Detection From Point Cloud Sequences Charles R. Qi, Yin Zhou, Mahyar Najibi, Pei Sun, Khoa Vo, Boyang Deng, Dragomir Anguelov [pdf] [supp] [arXiv] [bibtex]
STaR: Self-Supervised Tracking and Reconstruction of Rigid Objects in Motion With Neural Rendering Wentao Yuan, Zhaoyang Lv, Tanner Schmidt, Steven Lovegrove [pdf] [supp] [arXiv] [bibtex]
Generalization on Unseen Domains via Inference-Time Label-Preserving Target Projections Prashant Pandey, Mrigank Raman, Sumanth Varambally, Prathosh AP [pdf] [bibtex]
Monocular 3D Object Detection: An Extrinsic Parameter Free Approach Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang [pdf] [bibtex]
Communication Efficient SGD via Gradient Sampling With Bayes Prior Liuyihan Song, Kang Zhao, Pan Pan, Yu Liu, Yingya Zhang, Yinghui Xu, Rong Jin [pdf] [bibtex]
AdaBins: Depth Estimation Using Adaptive Bins Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka [pdf] [supp] [arXiv] [bibtex]
VirFace: Enhancing Face Recognition via Unlabeled Shallow Data Wenyu Li, Tianchu Guo, Pengyu Li, Binghui Chen, Biao Wang, Wangmeng Zuo, Lei Zhang [pdf] [supp] [bibtex]
Pulsar: Efficient Sphere-Based Neural Rendering Christoph Lassner, Michael Zollhofer [pdf] [supp] [arXiv] [bibtex]
Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification Peng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang [pdf] [arXiv] [bibtex]
Visualizing Adapted Knowledge in Domain Transfer Yunzhong Hou, Liang Zheng [pdf] [arXiv] [bibtex]
Delving into Data: Effectively Substitute Training for Black-box Attack Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue [pdf] [arXiv] [bibtex]
How To Exploit the Transferability of Learned Image Compression to Conventional Codecs Jan P. Klopp, Keng-Chi Liu, Liang-Gee Chen, Shao-Yi Chien [pdf] [supp] [arXiv] [bibtex]
CorrNet3D: Unsupervised End-to-End Learning of Dense Correspondence for 3D Point Clouds Yiming Zeng, Yue Qian, Zhiyu Zhu, Junhui Hou, Hui Yuan, Ying He [pdf] [arXiv] [bibtex]
Single-View Robot Pose and Joint Angle Estimation via Render & Compare Yann Labbe, Justin Carpentier, Mathieu Aubry, Josef Sivic [pdf] [arXiv] [bibtex]
Harmonious Semantic Line Detection via Maximal Weight Clique Selection Dongkwon Jin, Wonhui Park, Seong-Gyun Jeong, Chang-Su Kim [pdf] [supp] [arXiv] [bibtex]
Learning the Non-Differentiable Optimization for Blind Super-Resolution Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao [pdf] [supp] [bibtex]
Progressive Temporal Feature Alignment Network for Video Inpainting Xueyan Zou, Linjie Yang, Ding Liu, Yong Jae Lee [pdf] [supp] [arXiv] [bibtex]
Bottleneck Transformers for Visual Recognition Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani [pdf] [supp] [arXiv] [bibtex]
Calibrated RGB-D Salient Object Detection Wei Ji, Jingjing Li, Shuang Yu, Miao Zhang, Yongri Piao, Shunyu Yao, Qi Bi, Kai Ma, Yefeng Zheng, Huchuan Lu, Li Cheng [pdf] [bibtex]
S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling Ze Yang, Shenlong Wang, Sivabalan Manivasagam, Zeng Huang, Wei-Chiu Ma, Xinchen Yan, Ersin Yumer, Raquel Urtasun [pdf] [supp] [arXiv] [bibtex]
OSTeC: One-Shot Texture Completion Baris Gecer, Jiankang Deng, Stefanos Zafeiriou [pdf] [supp] [arXiv] [bibtex]
Learning To Count Everything Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai [pdf] [supp] [arXiv] [bibtex]
Robust Representation Learning With Feedback for Single Image Deraining Chenghao Chen, Hao Li [pdf] [arXiv] [bibtex]
Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction Feng Liu, Luan Tran, Xiaoming Liu [pdf] [supp] [arXiv] [bibtex]
SSN: Soft Shadow Network for Image Compositing Yichen Sheng, Jianming Zhang, Bedrich Benes [pdf] [supp] [arXiv] [bibtex]
MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection Jia-Chang Feng, Fa-Ting Hong, Wei-Shi Zheng [pdf] [supp] [arXiv] [bibtex]
VinVL: Revisiting Visual Representations in Vision-Language Models Pengchuan Zhang, Xiujun Li, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao [pdf] [supp] [arXiv] [bibtex]
Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression Zigang Geng, Ke Sun, Bin Xiao, Zhaoxiang Zhang, Jingdong Wang [pdf] [arXiv] [bibtex]
CoMoGAN: Continuous Model-Guided Image-to-Image Translation Fabio Pizzati, Pietro Cerri, Raoul de Charette [pdf] [supp] [arXiv] [bibtex]
Self-Supervised Video Hashing via Bidirectional Transformers Shuyan Li, Xiu Li, Jiwen Lu, Jie Zhou [pdf] [bibtex]
From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation Chen Li, Gim Hee Lee [pdf] [arXiv] [bibtex]
Safe Local Motion Planning With Self-Supervised Freespace Forecasting Peiyun Hu, Aaron Huang, John Dolan, David Held, Deva Ramanan [pdf] [supp] [bibtex]
Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng [pdf] [supp] [arXiv] [bibtex]
CondenseNet V2: Sparse Feature Reactivation for Deep Networks Le Yang, Haojun Jiang, Ruojin Cai, Yulin Wang, Shiji Song, Gao Huang, Qi Tian [pdf] [supp] [arXiv] [bibtex]
Learning Graphs for Knowledge Transfer With Limited Labels Pallabi Ghosh, Nirat Saini, Larry S. Davis, Abhinav Shrivastava [pdf] [supp] [bibtex]
DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation Seunghun Lee, Sunghyun Cho, Sunghoon Im [pdf] [supp] [arXiv] [bibtex]
Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding Binbin Huang, Dongze Lian, Weixin Luo, Shenghua Gao [pdf] [arXiv] [bibtex]
Information Bottleneck Disentanglement for Identity Swapping Gege Gao, Huaibo Huang, Chaoyou Fu, Zhaoyang Li, Ran He [pdf] [supp] [bibtex]
DualGraph: A Graph-Based Method for Reasoning About Label Noise HaiYang Zhang, XiMing Xing, Liang Liu [pdf] [bibtex]
Automatic Correction of Internal Units in Generative Neural Networks Ali Tousi, Haedong Jeong, Jiyeon Han, Hwanil Choi, Jaesik Choi [pdf] [arXiv] [bibtex]
Generating Manga From Illustrations via Mimicking Manga Creation Workflow Lvmin Zhang, Xinrui Wang, Qingnan Fan, Yi Ji, Chunping Liu [pdf] [bibtex]
Multi-Decoding Deraining Network and Quasi-Sparsity Based Training Yinglong Wang, Chao Ma, Bing Zeng [pdf] [bibtex]
Open-Vocabulary Object Detection Using Captions Alireza Zareian, Kevin Dela Rosa, Derek Hao Hu, Shih-Fu Chang [pdf] [supp] [arXiv] [bibtex]
Unveiling the Potential of Structure Preserving for Weakly Supervised Object Localization Xingjia Pan, Yingguo Gao, Zhiwen Lin, Fan Tang, Weiming Dong, Haolei Yuan, Feiyue Huang, Changsheng Xu [pdf] [supp] [arXiv] [bibtex]
From Points to Multi-Object 3D Reconstruction Francis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari [pdf] [arXiv] [bibtex]
Dual-Stream Multiple Instance Learning Network for Whole Slide Image Classification With Self-Supervised Contrastive Learning Bin Li, Yin Li, Kevin W. Eliceiri [pdf] [supp] [arXiv] [bibtex]
Regressive Domain Adaptation for Unsupervised Keypoint Detection Junguang Jiang, Yifei Ji, Ximei Wang, Yufeng Liu, Jianmin Wang, Mingsheng Long [pdf] [arXiv] [bibtex]
Mask Guided Matting via Progressive Refinement Network Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille [pdf] [arXiv] [bibtex]
Monocular Reconstruction of Neural Face Reflectance Fields Mallikarjun B R, Ayush Tewari, Tae-Hyun Oh, Tim Weyrich, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt [pdf] [supp] [arXiv] [bibtex]
SelfSAGCN: Self-Supervised Semantic Alignment for Graph Convolution Network Xu Yang, Cheng Deng, Zhiyuan Dang, Kun Wei, Junchi Yan [pdf] [bibtex]
ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-Shot Learning Chaofan Chen, Xiaoshan Yang, Changsheng Xu, Xuhui Huang, Zhe Ma [pdf] [bibtex]
Coarse-Fine Networks for Temporal Activity Detection in Videos Kumara Kahatapitiya, Michael S. Ryoo [pdf] [arXiv] [bibtex]
Can Audio-Visual Integration Strengthen Robustness Under Multimodal Attacks? Yapeng Tian, Chenliang Xu [pdf] [supp] [arXiv] [bibtex]
Deep Gradient Projection Networks for Pan-sharpening Shuang Xu, Jiangshe Zhang, Zixiang Zhao, Kai Sun, Junmin Liu, Chunxia Zhang [pdf] [arXiv] [bibtex]
ReNAS: Relativistic Evaluation of Neural Architecture Search Yixing Xu, Yunhe Wang, Kai Han, Yehui Tang, Shangling Jui, Chunjing Xu, Chang Xu [pdf] [supp] [arXiv] [bibtex]
When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks Jiahang Wang, Sheng Jin, Wentao Liu, Weizhong Liu, Chen Qian, Ping Luo [pdf] [supp] [arXiv] [bibtex]
ReMix: Towards Image-to-Image Translation With Limited Data Jie Cao, Luanxuan Hou, Ming-Hsuan Yang, Ran He, Zhenan Sun [pdf] [supp] [arXiv] [bibtex]
Adaptive Rank Estimate in Robust Principal Component Analysis Zhengqin Xu, Rui He, Shoulie Xie, Shiqian Wu [pdf] [supp] [bibtex]
Continual Adaptation of Visual Representations via Domain Randomization and Meta-Learning Riccardo Volpi, Diane Larlus, Gregory Rogez [pdf] [supp] [arXiv] [bibtex]
DeepACG: Co-Saliency Detection via Semantic-Aware Contrast Gromov-Wasserstein Distance Kaihua Zhang, Mingliang Dong, Bo Liu, Xiao-Tong Yuan, Qingshan Liu [pdf] [bibtex]
SurFree: A Fast Surrogate-Free Black-Box Attack Thibault Maho, Teddy Furon, Erwan Le Merrer [pdf] [arXiv] [bibtex]
Beyond Image to Depth: Improving Depth Prediction Using Echoes Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma [pdf] [supp] [arXiv] [bibtex]
Rich Features for Perceptual Quality Assessment of UGC Videos Yilin Wang, Junjie Ke, Hossein Talebi, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli, Peyman Milanfar, Feng Yang [pdf] [supp] [bibtex]
Sequential Graph Convolutional Network for Active Learning Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim [pdf] [arXiv] [bibtex]
Generative Classifiers as a Basis for Trustworthy Image Classification Radek Mackowiak, Lynton Ardizzone, Ullrich Kothe, Carsten Rother [pdf] [supp] [arXiv] [bibtex]
EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation Yang Jiao, Trac D. Tran, Guangming Shi [pdf] [arXiv] [bibtex]
Localizing Visual Sounds the Hard Way Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman [pdf] [arXiv] [bibtex]
Synthesize-It-Classifier: Learning a Generative Classifier Through Recurrent Self-Analysis Arghya Pal, Raphael C.-W. Phan, KokSheik Wong [pdf] [supp] [bibtex]
Self-Point-Flow: Self-Supervised Scene Flow Estimation From Point Clouds With Optimal Transport and Random Walk Ruibo Li, Guosheng Lin, Lihua Xie [pdf] [supp] [bibtex]
Toward Joint Thing-and-Stuff Mining for Weakly Supervised Panoptic Segmentation Yunhang Shen, Liujuan Cao, Zhiwei Chen, Feihong Lian, Baochang Zhang, Chi Su, Yongjian Wu, Feiyue Huang, Rongrong Ji [pdf] [bibtex]
Intelligent Carpet: Inferring 3D Human Pose From Tactile Signals Yiyue Luo, Yunzhu Li, Michael Foshey, Wan Shou, Pratyusha Sharma, Tomas Palacios, Antonio Torralba, Wojciech Matusik [pdf] [supp] [bibtex]
Railroad Is Not a Train: Saliency As Pseudo-Pixel Supervision for Weakly Supervised Semantic Segmentation Seungho Lee, Minhyun Lee, Jongwuk Lee, Hyunjung Shim [pdf] [supp] [arXiv] [bibtex]
Stable View Synthesis Gernot Riegler, Vladlen Koltun [pdf] [arXiv] [bibtex]
Deep Two-View Structure-From-Motion Revisited Jianyuan Wang, Yiran Zhong, Yuchao Dai, Stan Birchfield, Kaihao Zhang, Nikolai Smolyanskiy, Hongdong Li [pdf] [supp] [arXiv] [bibtex]
Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes Dmytro Kotovenko, Matthias Wright, Arthur Heimbrecht, Bjorn Ommer [pdf] [supp] [arXiv] [bibtex]
Cluster, Split, Fuse, and Update: Meta-Learning for Open Compound Domain Adaptive Semantic Segmentation Rui Gong, Yuhua Chen, Danda Pani Paudel, Yawei Li, Ajad Chhatkuli, Wen Li, Dengxin Dai, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]
Beyond Short Clips: End-to-End Video-Level Learning With Collaborative Memories Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry S. Davis, Heng Wang [pdf] [arXiv] [bibtex]
PointDSC: Robust Point Cloud Registration Using Deep Spatial Consistency Xuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai [pdf] [supp] [arXiv] [bibtex]
Task Programming: Learning Data Efficient Behavior Representations Jennifer J. Sun, Ann Kennedy, Eric Zhan, David J. Anderson, Yisong Yue, Pietro Perona [pdf] [supp] [arXiv] [bibtex]
ACRE: Abstract Causal REasoning Beyond Covariation Chi Zhang, Baoxiong Jia, Mark Edmonds, Song-Chun Zhu, Yixin Zhu [pdf] [supp] [arXiv] [bibtex]
DeepLM: Large-Scale Nonlinear Least Squares on Deep Learning Frameworks Using Stochastic Domain Decomposition Jingwei Huang, Shan Huang, Mingwei Sun [pdf] [supp] [bibtex]
TDN: Temporal Difference Networks for Efficient Action Recognition Limin Wang, Zhan Tong, Bin Ji, Gangshan Wu [pdf] [supp] [arXiv] [bibtex]
LiBRe: A Practical Bayesian Approach to Adversarial Detection Zhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu [pdf] [supp] [arXiv] [bibtex]
ArtCoder: An End-to-End Method for Generating Scanning-Robust Stylized QR Codes Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu, Tao Ren [pdf] [bibtex]
Self-Supervised Pillar Motion Learning for Autonomous Driving Chenxu Luo, Xiaodong Yang, Alan Yuille [pdf] [supp] [arXiv] [bibtex]
Quantum Permutation Synchronization Tolga Birdal, Vladislav Golyanik, Christian Theobalt, Leonidas J. Guibas [pdf] [supp] [arXiv] [bibtex]
QAIR: Practical Query-Efficient Black-Box Attacks for Image Retrieval Xiaodan Li, Jinfeng Li, Yuefeng Chen, Shaokai Ye, Yuan He, Shuhui Wang, Hang Su, Hui Xue [pdf] [supp] [arXiv] [bibtex]
MagFace: A Universal Representation for Face Recognition and Quality Assessment Qiang Meng, Shichao Zhao, Zhida Huang, Feng Zhou [pdf] [supp] [arXiv] [bibtex]
Wasserstein Barycenter for Multi-Source Domain Adaptation Eduardo Fernandes Montesuma, Fred Maurice Ngole Mboula [pdf] [supp] [bibtex]
Unsupervised Hyperbolic Metric Learning Jiexi Yan, Lei Luo, Cheng Deng, Heng Huang [pdf] [bibtex]
Improving Sign Language Translation With Monolingual Data by Sign Back-Translation Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li [pdf] [arXiv] [bibtex]
Background Splitting: Finding Rare Classes in a Sea of Background Ravi Teja Mullapudi, Fait Poms, William R. Mark, Deva Ramanan, Kayvon Fatahalian [pdf] [supp] [arXiv] [bibtex]
Adaptive Convolutions for Structure-Aware Style Transfer Prashanth Chandran, Gaspard Zoss, Paulo Gotardo, Markus Gross, Derek Bradley [pdf] [supp] [bibtex]
Few-Shot Incremental Learning With Continually Evolved Classifiers Chi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu [pdf] [supp] [arXiv] [bibtex]
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions Junbin Xiao, Xindi Shang, Angela Yao, Tat-Seng Chua [pdf] [supp] [bibtex]
LayoutGMN: Neural Graph Matching for Structural Layout Similarity Akshay Gadi Patil, Manyi Li, Matthew Fisher, Manolis Savva, Hao Zhang [pdf] [supp] [arXiv] [bibtex]
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li [pdf] [supp] [bibtex]
ArtEmis: Affective Language for Visual Art Panos Achlioptas, Maks Ovsjanikov, Kilichbek Haydarov, Mohamed Elhoseiny, Leonidas J. Guibas [pdf] [arXiv] [bibtex]
Sketch, Ground, and Refine: Top-Down Dense Video Captioning Chaorui Deng, Shizhe Chen, Da Chen, Yuan He, Qi Wu [pdf] [bibtex]
Learning Normal Dynamics in Videos With Meta Prototype Network Hui Lv, Chen Chen, Zhen Cui, Chunyan Xu, Yong Li, Jian Yang [pdf] [supp] [arXiv] [bibtex]
Graph-Based High-Order Relation Discovery for Fine-Grained Recognition Yifan Zhao, Ke Yan, Feiyue Huang, Jia Li [pdf] [bibtex]
Normal Integration via Inverse Plane Fitting With Minimum Point-to-Plane Distance Xu Cao, Boxin Shi, Fumio Okura, Yasuyuki Matsushita [pdf] [supp] [bibtex]
NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration Zhengang Li, Geng Yuan, Wei Niu, Pu Zhao, Yanyu Li, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin [pdf] [arXiv] [bibtex]
Spatial Feature Calibration and Temporal Fusion for Effective One-Stage Video Instance Segmentation Minghan Li, Shuai Li, Lida Li, Lei Zhang [pdf] [supp] [arXiv] [bibtex]
Learning Asynchronous and Sparse Human-Object Interaction in Videos Romero Morais, Vuong Le, Svetha Venkatesh, Truyen Tran [pdf] [supp] [arXiv] [bibtex]
Single Image Reflection Removal With Absorption Effect Qian Zheng, Boxin Shi, Jinnan Chen, Xudong Jiang, Ling-Yu Duan, Alex C. Kot [pdf] [supp] [bibtex]
One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking Minghao Chen, Jianlong Fu, Haibin Ling [pdf] [supp] [arXiv] [bibtex]
Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo [pdf] [supp] [arXiv] [bibtex]
M3DSSD: Monocular 3D Single Stage Object Detector Shujie Luo, Hang Dai, Ling Shao, Yong Ding [pdf] [arXiv] [bibtex]
Structure-Aware Face Clustering on a Large-Scale Graph With 107 Nodes Shuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie Zhou [pdf] [supp] [bibtex]
Objects Are Different: Flexible Monocular 3D Object Detection Yunpeng Zhang, Jiwen Lu, Jie Zhou [pdf] [arXiv] [bibtex]
Permuted AdaIN: Reducing the Bias Towards Global Statistics in Image Classification Oren Nuriel, Sagie Benaim, Lior Wolf [pdf] [arXiv] [bibtex]
Pixel Codec Avatars Shugao Ma, Tomas Simon, Jason Saragih, Dawei Wang, Yuecheng Li, Fernando De la Torre, Yaser Sheikh [pdf] [supp] [arXiv] [bibtex]
SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification Zijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia [pdf] [supp] [arXiv] [bibtex]
Context-Aware Layout to Image Generation With Enhanced Object Appearance Sen He, Wentong Liao, Michael Ying Yang, Yongxin Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang [pdf] [arXiv] [bibtex]
Mask-Embedded Discriminator With Region-Based Semantic Regularization for Semi-Supervised Class-Conditional Image Synthesis Yi Liu, Xiaoyang Huo, Tianyi Chen, Xiangping Zeng, Si Wu, Zhiwen Yu, Hau-San Wong [pdf] [bibtex]
LEAP: Learning Articulated Occupancy of People Marko Mihajlovic, Yan Zhang, Michael J. Black, Siyu Tang [pdf] [supp] [arXiv] [bibtex]
ANR: Articulated Neural Rendering for Virtual Avatars Amit Raj, Julian Tanke, James Hays, Minh Vo, Carsten Stoll, Christoph Lassner [pdf] [supp] [arXiv] [bibtex]
Flow-Based Kernel Prior With Application to Blind Super-Resolution Jingyun Liang, Kai Zhang, Shuhang Gu, Luc Van Gool, Radu Timofte [pdf] [supp] [arXiv] [bibtex]
Probabilistic Selective Encryption of Convolutional Neural Networks for Hierarchical Services Jinyu Tian, Jiantao Zhou, Jia Duan [pdf] [supp] [arXiv] [bibtex]
Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images Florian Kluger, Hanno Ackermann, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn [pdf] [supp] [arXiv] [bibtex]
Dive Into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition Jiahui She, Yibo Hu, Hailin Shi, Jun Wang, Qiu Shen, Tao Mei [pdf] [supp] [arXiv] [bibtex]
Attention-Guided Image Compression by Deep Reconstruction of Compressive Sensed Saliency Skeleton Xi Zhang, Xiaolin Wu [pdf] [supp] [arXiv] [bibtex]
Cluster-Wise Hierarchical Generative Model for Deep Amortized Clustering Huafeng Liu, Jiaqi Wang, Liping Jing [pdf] [supp] [bibtex]
Mirror3D: Depth Refinement for Mirror Surfaces Jiaqi Tan, Weijie Lin, Angel X. Chang, Manolis Savva [pdf] [supp] [bibtex]
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning Zhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu [pdf] [arXiv] [bibtex]
Reciprocal Transformations for Unsupervised Video Object Segmentation Sucheng Ren, Wenxi Liu, Yongtuo Liu, Haoxin Chen, Guoqiang Han, Shengfeng He [pdf] [supp] [bibtex]
Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark Longyin Wen, Dawei Du, Pengfei Zhu, Qinghua Hu, Qilong Wang, Liefeng Bo, Siwei Lyu [pdf] [arXiv] [bibtex]
Learning Complete 3D Morphable Face Models From Images and Videos Mallikarjun B R, Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt [pdf] [supp] [arXiv] [bibtex]
Bottom-Up Shift and Reasoning for Referring Image Segmentation Sibei Yang, Meng Xia, Guanbin Li, Hong-Yu Zhou, Yizhou Yu [pdf] [bibtex]
Sparse Auxiliary Networks for Unified Monocular Depth Prediction and Completion Vitor Guizilini, Rares Ambrus, Wolfram Burgard, Adrien Gaidon [pdf] [arXiv] [bibtex]
DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes With Biharmonic Coordinates Minghua Liu, Minhyuk Sung, Radomir Mech, Hao Su [pdf] [supp] [arXiv] [bibtex]
Panoptic Segmentation Forecasting Colin Graber, Grace Tsai, Michael Firman, Gabriel Brostow, Alexander G. Schwing [pdf] [supp] [arXiv] [bibtex]
SRDAN: Scale-Aware and Range-Aware Domain Adaptation Network for Cross-Dataset 3D Object Detection Weichen Zhang, Wen Li, Dong Xu [pdf] [bibtex]
Pedestrian and Ego-Vehicle Trajectory Prediction From Monocular Camera Lukas Neumann, Andrea Vedaldi [pdf] [bibtex]
Globally Optimal Relative Pose Estimation With Gravity Prior Yaqing Ding, Daniel Barath, Jian Yang, Hui Kong, Zuzana Kukelova [pdf] [supp] [arXiv] [bibtex]
Mutual CRF-GNN for Few-Shot Learning Shixiang Tang, Dapeng Chen, Lei Bai, Kaijian Liu, Yixiao Ge, Wanli Ouyang [pdf] [supp] [bibtex]
Weakly Supervised Action Selection Learning in Video Junwei Ma, Satya Krishna Gorti, Maksims Volkovs, Guangwei Yu [pdf] [arXiv] [bibtex]
Learning Student Networks in the Wild Hanting Chen, Tianyu Guo, Chang Xu, Wenshuo Li, Chunjing Xu, Chao Xu, Yunhe Wang [pdf] [bibtex]
Distilling Knowledge via Knowledge Review Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia [pdf] [supp] [arXiv] [bibtex]
DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets Jianpeng Zhang, Yutong Xie, Yong Xia, Chunhua Shen [pdf] [arXiv] [bibtex]
Lips Don’t Lie: A Generalisable and Robust Approach To Face Forgery Detection Alexandros Haliassos, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic [pdf] [supp] [bibtex]
Exploring Simple Siamese Representation Learning Xinlei Chen, Kaiming He [pdf] [supp] [arXiv] [bibtex]
CAMERAS: Enhanced Resolution and Sanity Preserving Class Activation Mapping for Image Saliency Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian [pdf] [supp] [bibtex]
3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding Shengheng Deng, Xun Xu, Chaozheng Wu, Ke Chen, Kui Jia [pdf] [supp] [arXiv] [bibtex]
Learning To Segment Actions From Visual and Language Instructions via Differentiable Weak Sequence Alignment Yuhan Shen, Lu Wang, Ehsan Elhamifar [pdf] [supp] [bibtex]
Deep Implicit Templates for 3D Shape Representation Zerong Zheng, Tao Yu, Qionghai Dai, Yebin Liu [pdf] [supp] [arXiv] [bibtex]
Semantic Image Matting Yanan Sun, Chi-Keung Tang, Yu-Wing Tai [pdf] [supp] [arXiv] [bibtex]
Semi-Supervised Semantic Segmentation With Cross Pseudo Supervision Xiaokang Chen, Yuhui Yuan, Gang Zeng, Jingdong Wang [pdf] [supp] [arXiv] [bibtex]
Ranking Neural Checkpoints Yandong Li, Xuhui Jia, Ruoxin Sang, Yukun Zhu, Bradley Green, Liqiang Wang, Boqing Gong [pdf] [supp] [arXiv] [bibtex]
SuperMix: Supervising the Mixing Data Augmentation Ali Dabouei, Sobhan Soleymani, Fariborz Taherkhani, Nasser M. Nasrabadi [pdf] [supp] [arXiv] [bibtex]
Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection Luwei Hou, Yu Zhang, Kui Fu, Jia Li [pdf] [bibtex]
Inception Convolution With Efficient Dilation Search Jie Liu, Chuming Li, Feng Liang, Chen Lin, Ming Sun, Junjie Yan, Wanli Ouyang, Dong Xu [pdf] [arXiv] [bibtex]
Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy Federico Paredes-Valles, Guido C. H. E. de Croon [pdf] [supp] [bibtex]
AdderSR: Towards Energy Efficient Image Super-Resolution Dehua Song, Yunhe Wang, Hanting Chen, Chang Xu, Chunjing Xu, Dacheng Tao [pdf] [supp] [arXiv] [bibtex]
Semi-Supervised Domain Adaptation Based on Dual-Level Domain Mixing for Semantic Segmentation Shuaijun Chen, Xu Jia, Jianzhong He, Yongjie Shi, Jianzhuang Liu [pdf] [supp] [arXiv] [bibtex]
Connecting What To Say With Where To Look by Modeling Human Attention Traces Zihang Meng, Licheng Yu, Ning Zhang, Tamara L. Berg, Babak Damavandi, Vikas Singh, Amy Bearman [pdf] [supp] [arXiv] [bibtex]
Shelf-Supervised Mesh Prediction in the Wild Yufei Ye, Shubham Tulsiani, Abhinav Gupta [pdf] [supp] [arXiv] [bibtex]
Learning To Filter: Siamese Relation Network for Robust Tracking Siyuan Cheng, Bineng Zhong, Guorong Li, Xin Liu, Zhenjun Tang, Xianxian Li, Jing Wang [pdf] [arXiv] [bibtex]
Ensembling With Deep Generative Views Lucy Chai, Jun-Yan Zhu, Eli Shechtman, Phillip Isola, Richard Zhang [pdf] [supp] [arXiv] [bibtex]
Accurate Few-Shot Object Detection With Support-Query Mutual Guidance and Hybrid Loss Lu Zhang, Shuigeng Zhou, Jihong Guan, Ji Zhang [pdf] [supp] [bibtex]
Cascaded Prediction Network via Segment Tree for Temporal Video Grounding Yang Zhao, Zhou Zhao, Zhu Zhang, Zhijie Lin [pdf] [supp] [bibtex]
Posterior Promoted GAN With Distribution Discriminator for Unsupervised Image Synthesis Xianchao Zhang, Ziyang Cheng, Xiaotong Zhang, Han Liu [pdf] [bibtex]
Toward Accurate and Realistic Outfits Visualization With Attention to Details Kedan Li, Min Jin Chong, Jeffrey Zhang, Jingen Liu [pdf] [supp] [bibtex]
Delving Deep Into Many-to-Many Attention for Few-Shot Video Object Segmentation Haoxin Chen, Hanjie Wu, Nanxuan Zhao, Sucheng Ren, Shengfeng He [pdf] [supp] [bibtex]
MongeNet: Efficient Sampler for Geometric Deep Learning Leo Lebrat, Rodrigo Santa Cruz, Clinton Fookes, Olivier Salvado [pdf] [arXiv] [bibtex]
Gated Spatio-Temporal Attention-Guided Video Deblurring Maitreya Suin, A. N. Rajagopalan [pdf] [bibtex]
Learning Multi-Scale Photo Exposure Correction Mahmoud Afifi, Konstantinos G. Derpanis, Bjorn Ommer, Michael S. Brown [pdf] [supp] [arXiv] [bibtex]
Learning Semantic Person Image Generation by Region-Adaptive Normalization Zhengyao Lv, Xiaoming Li, Xin Li, Fu Li, Tianwei Lin, Dongliang He, Wangmeng Zuo [pdf] [arXiv] [bibtex]
Rethinking Class Relations: Absolute-Relative Supervised and Unsupervised Few-Shot Learning Hongguang Zhang, Piotr Koniusz, Songlei Jian, Hongdong Li, Philip H. S. Torr [pdf] [supp] [arXiv] [bibtex]
Divergence Optimization for Noisy Universal Domain Adaptation Qing Yu, Atsushi Hashimoto, Yoshitaka Ushiku [pdf] [supp] [arXiv] [bibtex]
Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning Chengming Xu, Yanwei Fu, Chen Liu, Chengjie Wang, Jilin Li, Feiyue Huang, Li Zhang, Xiangyang Xue [pdf] [supp] [arXiv] [bibtex]
Unsupervised Learning of 3D Object Categories From Videos in the Wild Philipp Henzler, Jeremy Reizenstein, Patrick Labatut, Roman Shapovalov, Tobias Ritschel, Andrea Vedaldi, David Novotny [pdf] [supp] [arXiv] [bibtex]
Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing Yu Wu, Yi Yang [pdf] [bibtex]
Dogfight: Detecting Drones From Drones Videos Muhammad Waseem Ashraf, Waqas Sultani, Mubarak Shah [pdf] [arXiv] [bibtex]
PAUL: Procrustean Autoencoder for Unsupervised Lifting Chaoyang Wang, Simon Lucey [pdf] [arXiv] [bibtex]
Group Collaborative Learning for Co-Salient Object Detection Qi Fan, Deng-Ping Fan, Huazhu Fu, Chi-Keung Tang, Ling Shao, Yu-Wing Tai [pdf] [arXiv] [bibtex]
RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening Sungha Choi, Sanghun Jung, Huiwon Yun, Joanne T. Kim, Seungryong Kim, Jaegul Choo [pdf] [supp] [arXiv] [bibtex]
Monocular Real-Time Full Body Capture With Inter-Part Correlations Yuxiao Zhou, Marc Habermann, Ikhsanul Habibie, Ayush Tewari, Christian Theobalt, Feng Xu [pdf] [supp] [arXiv] [bibtex]
Pre-Trained Image Processing Transformer Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, Wen Gao [pdf] [supp] [arXiv] [bibtex]
Robust and Accurate Object Detection via Adversarial Learning Xiangning Chen, Cihang Xie, Mingxing Tan, Li Zhang, Cho-Jui Hsieh, Boqing Gong [pdf] [supp] [arXiv] [bibtex]
Faster Meta Update Strategy for Noise-Robust Deep Learning Youjiang Xu, Linchao Zhu, Lu Jiang, Yi Yang [pdf] [supp] [arXiv] [bibtex]
ContactOpt: Optimizing Contact To Improve Grasps Patrick Grady, Chengcheng Tang, Christopher D. Twigg, Minh Vo, Samarth Brahmbhatt, Charles C. Kemp [pdf] [supp] [arXiv] [bibtex]
Panoptic-PolarNet: Proposal-Free LiDAR Point Cloud Panoptic Segmentation Zixiang Zhou, Yang Zhang, Hassan Foroosh [pdf] [supp] [bibtex]
Source-Free Domain Adaptation for Semantic Segmentation Yuang Liu, Wei Zhang, Jun Wang [pdf] [supp] [arXiv] [bibtex]
Adaptive Weighted Discriminator for Training Generative Adversarial Networks Vasily Zadorozhnyy, Qiang Cheng, Qiang Ye [pdf] [supp] [arXiv] [bibtex]
Depth From Camera Motion and Object Detection Brent A. Griffin, Jason J. Corso [pdf] [supp] [arXiv] [bibtex]
PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency Jie Liang, Hui Zeng, Miaomiao Cui, Xuansong Xie, Lei Zhang [pdf] [supp] [arXiv] [bibtex]
Transformation Driven Visual Reasoning Xin Hong, Yanyan Lan, Liang Pang, Jiafeng Guo, Xueqi Cheng [pdf] [supp] [arXiv] [bibtex]
Sparse R-CNN: End-to-End Object Detection With Learnable Proposals Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei Li, Zehuan Yuan, Changhu Wang, Ping Luo [pdf] [bibtex]
Plan2Scene: Converting Floorplans to 3D Scenes Madhawa Vidanapathirana, Qirui Wu, Yasutaka Furukawa, Angel X. Chang, Manolis Savva [pdf] [supp] [bibtex]
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges Qingyong Hu, Bo Yang, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham [pdf] [supp] [arXiv] [bibtex]
Towards Open World Object Detection K J Joseph, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian [pdf] [supp] [arXiv] [bibtex]
Conditional Bures Metric for Domain Adaptation You-Wei Luo, Chuan-Xian Ren [pdf] [supp] [bibtex]
DatasetGAN: Efficient Labeled Data Factory With Minimal Human Effort Yuxuan Zhang, Huan Ling, Jun Gao, Kangxue Yin, Jean-Francois Lafleche, Adela Barriuso, Antonio Torralba, Sanja Fidler [pdf] [arXiv] [bibtex]
Repurposing GANs for One-Shot Semantic Part Segmentation Nontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn [pdf] [supp] [arXiv] [bibtex]
Semi-Supervised 3D Hand-Object Poses Estimation With Interactions in Time Shaowei Liu, Hanwen Jiang, Jiarui Xu, Sifei Liu, Xiaolong Wang [pdf] [supp] [bibtex]
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation Yapeng Tian, Di Hu, Chenliang Xu [pdf] [supp] [arXiv] [bibtex]
Digital Gimbal: End-to-End Deep Image Stabilization With Learnable Exposure Times Omer Dahary, Matan Jacoby, Alex M. Bronstein [pdf] [supp] [arXiv] [bibtex]
Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach Xingqian Xu, Zhifei Zhang, Zhaowen Wang, Brian Price, Zhonghao Wang, Humphrey Shi [pdf] [supp] [arXiv] [bibtex]
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning Over Traffic Events Li Xu, He Huang, Jun Liu [pdf] [supp] [bibtex]
T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval Xiaohan Wang, Linchao Zhu, Yi Yang [pdf] [arXiv] [bibtex]
Privacy-Preserving Image Features via Adversarial Affine Subspace Embeddings Mihai Dusmanu, Johannes L. Schonberger, Sudipta N. Sinha, Marc Pollefeys [pdf] [supp] [arXiv] [bibtex]
StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song [pdf] [supp] [arXiv] [bibtex]
Embedding Transfer With Label Relaxation for Improved Metric Learning Sungyeon Kim, Dongwon Kim, Minsu Cho, Suha Kwak [pdf] [supp] [arXiv] [bibtex]
Beyond Static Features for Temporally Consistent 3D Human Pose and Shape From a Video Hongsuk Choi, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee [pdf] [supp] [arXiv] [bibtex]
Layout-Guided Novel View Synthesis From a Single Indoor Panorama Jiale Xu, Jia Zheng, Yanyu Xu, Rui Tang, Shenghua Gao [pdf] [supp] [arXiv] [bibtex]
STMTrack: Template-Free Visual Tracking With Space-Time Memory Networks Zhihong Fu, Qingjie Liu, Zehua Fu, Yunhong Wang [pdf] [arXiv] [bibtex]
Reformulating HOI Detection As Adaptive Set Prediction Mingfei Chen, Yue Liao, Si Liu, Zhiyuan Chen, Fei Wang, Chen Qian [pdf] [arXiv] [bibtex]
Strengthen Learning Tolerance for Weakly Supervised Object Localization Guangyu Guo, Junwei Han, Fang Wan, Dingwen Zhang [pdf] [bibtex]
Mesh Saliency: An Independent Perceptual Measure or a Derivative of Image Saliency? Ran Song, Wei Zhang, Yitian Zhao, Yonghuai Liu, Paul L. Rosin [pdf] [supp] [bibtex]
Passive Inter-Photon Imaging Atul Ingle, Trevor Seets, Mauro Buttafava, Shantanu Gupta, Alberto Tosi, Mohit Gupta, Andreas Velten [pdf] [supp] [arXiv] [bibtex]
Domain Consensus Clustering for Universal Domain Adaptation Guangrui Li, Guoliang Kang, Yi Zhu, Yunchao Wei, Yi Yang [pdf] [supp] [bibtex]
Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations Umberto Michieli, Pietro Zanuttigh [pdf] [supp] [arXiv] [bibtex]
Audio-Driven Emotional Video Portraits Xinya Ji, Hang Zhou, Kaisiyuan Wang, Wayne Wu, Chen Change Loy, Xun Cao, Feng Xu [pdf] [supp] [arXiv] [bibtex]
Pareto Self-Supervised Training for Few-Shot Learning Zhengyu Chen, Jixie Ge, Heshen Zhan, Siteng Huang, Donglin Wang [pdf] [supp] [arXiv] [bibtex]
EnD: Entangling and Disentangling Deep Representations for Bias Correction Enzo Tartaglione, Carlo Alberto Barbano, Marco Grangetto [pdf] [supp] [arXiv] [bibtex]
Recorrupted-to-Recorrupted: Unsupervised Deep Learning for Image Denoising Tongyao Pang, Huan Zheng, Yuhui Quan, Hui Ji [pdf] [supp] [bibtex]
Reconsidering Representation Alignment for Multi-View Clustering Daniel J. Trosten, Sigurd Lokse, Robert Jenssen, Michael Kampffmeyer [pdf] [supp] [arXiv] [bibtex]
Probabilistic Embeddings for Cross-Modal Retrieval Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, Diane Larlus [pdf] [supp] [arXiv] [bibtex]
Cloud2Curve: Generation and Vectorization of Parametric Sketches Ayan Das, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song [pdf] [arXiv] [bibtex]
TransFill: Reference-Guided Image Inpainting by Merging Multiple Color and Spatial Transformations Yuqian Zhou, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi [pdf] [supp] [arXiv] [bibtex]
On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective Nontawat Charoenphakdee, Jayakorn Vongkulbhisal, Nuttapong Chairatanakul, Masashi Sugiyama [pdf] [supp] [arXiv] [bibtex]
VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation Siyuan Qiao, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen [pdf] [supp] [bibtex]
Sequence-to-Sequence Contrastive Learning for Text Recognition Aviad Aberdam, Ron Litman, Shahar Tsiper, Oron Anschel, Ron Slossberg, Shai Mazor, R. Manmatha, Pietro Perona [pdf] [supp] [arXiv] [bibtex]
Prototype-Supervised Adversarial Network for Targeted Attack of Deep Hashing Xunguang Wang, Zheng Zhang, Baoyuan Wu, Fumin Shen, Guangming Lu [pdf] [arXiv] [bibtex]
PD-GAN: Probabilistic Diverse GAN for Image Inpainting Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao [pdf] [supp] [bibtex]
Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph [pdf] [supp] [arXiv] [bibtex]
Learning Deep Latent Variable Models by Short-Run MCMC Inference With Optimal Transport Correction Dongsheng An, Jianwen Xie, Ping Li [pdf] [supp] [bibtex]
MobileDets: Searching for Object Detection Architectures for Mobile Accelerators Yunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin Akin, Gabriel Bender, Yongzhe Wang, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen [pdf] [supp] [arXiv] [bibtex]
Self-Supervised Geometric Perception Heng Yang, Wei Dong, Luca Carlone, Vladlen Koltun [pdf] [supp] [arXiv] [bibtex]
CutPaste: Self-Supervised Learning for Anomaly Detection and Localization Chun-Liang Li, Kihyuk Sohn, Jinsung Yoon, Tomas Pfister [pdf] [supp] [arXiv] [bibtex]
Open World Compositional Zero-Shot Learning Massimiliano Mancini, Muhammad Ferjad Naeem, Yongqin Xian, Zeynep Akata [pdf] [supp] [bibtex]
Bi-GCN: Binary Graph Convolutional Network Junfu Wang, Yunhong Wang, Zhen Yang, Liang Yang, Yuanfang Guo [pdf] [supp] [bibtex]
Complementary Relation Contrastive Distillation Jinguo Zhu, Shixiang Tang, Dapeng Chen, Shijie Yu, Yakun Liu, Mingzhe Rong, Aijun Yang, Xiaohua Wang [pdf] [arXiv] [bibtex]
UnrealPerson: An Adaptive Pipeline Towards Costless Person Re-Identification Tianyu Zhang, Lingxi Xie, Longhui Wei, Zijie Zhuang, Yongfei Zhang, Bo Li, Qi Tian [pdf] [supp] [arXiv] [bibtex]
Iterative Filter Adaptive Network for Single Image Defocus Deblurring Junyong Lee, Hyeongseok Son, Jaesung Rim, Sunghyun Cho, Seungyong Lee [pdf] [supp] [bibtex]
UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning Kunming Luo, Chuan Wang, Shuaicheng Liu, Haoqiang Fan, Jue Wang, Jian Sun [pdf] [arXiv] [bibtex]
House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent for Professional Architects Nelson Nauata, Sepidehsadat Hosseini, Kai-Hung Chang, Hang Chu, Chin-Yi Cheng, Yasutaka Furukawa [pdf] [supp] [bibtex]
HDR Environment Map Estimation for Real-Time Augmented Reality Gowri Somanath, Daniel Kurz [pdf] [supp] [arXiv] [bibtex]
OTA: Optimal Transport Assignment for Object Detection Zheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun [pdf] [supp] [arXiv] [bibtex]
Progressive Semantic Segmentation Chuong Huynh, Anh Tuan Tran, Khoa Luu, Minh Hoai [pdf] [supp] [arXiv] [bibtex]
BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond Kelvin C.K. Chan, Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy [pdf] [supp] [arXiv] [bibtex]
Efficient Multi-Stage Video Denoising With Recurrent Spatio-Temporal Fusion Matteo Maggioni, Yibin Huang, Cheng Li, Shuai Xiao, Zhongqian Fu, Fenglong Song [pdf] [supp] [arXiv] [bibtex]
Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map Elmira Amirloo, Mohsen Rohani, Ershad Banijamali, Jun Luo, Pascal Poupart [pdf] [arXiv] [bibtex]
Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking Fatemeh Saleh, Sadegh Aliakbarian, Hamid Rezatofighi, Mathieu Salzmann, Stephen Gould [pdf] [supp] [arXiv] [bibtex]
Stay Positive: Non-Negative Image Synthesis for Augmented Reality Katie Luo, Guandao Yang, Wenqi Xian, Harald Haraldsson, Bharath Hariharan, Serge Belongie [pdf] [supp] [bibtex]
3D-to-2D Distillation for Indoor Scene Parsing Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu [pdf] [arXiv] [bibtex]
Learning the Best Pooling Strategy for Visual Semantic Embedding Jiacheng Chen, Hexiang Hu, Hao Wu, Yuning Jiang, Changhu Wang [pdf] [supp] [arXiv] [bibtex]
GLAVNet: Global-Local Audio-Visual Cues for Fine-Grained Material Recognition Fengmin Shi, Jie Guo, Haonan Zhang, Shan Yang, Xiying Wang, Yanwen Guo [pdf] [supp] [bibtex]
Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification Xiao Zhang, Yixiao Ge, Yu Qiao, Hongsheng Li [pdf] [bibtex]
Regularizing Generative Adversarial Networks Under Limited Data Hung-Yu Tseng, Lu Jiang, Ce Liu, Ming-Hsuan Yang, Weilong Yang [pdf] [supp] [arXiv] [bibtex]
Skeleton Merger: An Unsupervised Aligned Keypoint Detector Ruoxi Shi, Zhengrong Xue, Yang You, Cewu Lu [pdf] [arXiv] [bibtex]
Regularizing Neural Networks via Adversarial Model Perturbation Yaowei Zheng, Richong Zhang, Yongyi Mao [pdf] [supp] [arXiv] [bibtex]
Learning by Aligning Videos in Time Sanjay Haresh, Sateesh Kumar, Huseyin Coskun, Shahram N. Syed, Andrey Konin, Zeeshan Zia, Quoc-Huy Tran [pdf] [supp] [arXiv] [bibtex]
Contrastive Neural Architecture Search With Neural Architecture Comparators Yaofo Chen, Yong Guo, Qi Chen, Minli Li, Wei Zeng, Yaowei Wang, Mingkui Tan [pdf] [supp] [arXiv] [bibtex]
Implicit Feature Alignment: Learn To Convert Text Recognizer to Text Spotter Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo [pdf] [bibtex]
Populating 3D Scenes by Learning Human-Scene Interaction Mohamed Hassan, Partha Ghosh, Joachim Tesch, Dimitrios Tzionas, Michael J. Black [pdf] [supp] [arXiv] [bibtex]
Variational Pedestrian Detection Yuang Zhang, Huanyu He, Jianguo Li, Yuxi Li, John See, Weiyao Lin [pdf] [supp] [arXiv] [bibtex]
SIPSA-Net: Shift-Invariant Pan Sharpening With Moving Object Alignment for Satellite Imagery Jaehyup Lee, Soomin Seo, Munchurl Kim [pdf] [supp] [bibtex]
Large-Scale Localization Datasets in Crowded Indoor Spaces Donghwan Lee, Soohyun Ryu, Suyong Yeon, Yonghan Lee, Deokhwa Kim, Cheolho Han, Yohann Cabon, Philippe Weinzaepfel, Nicolas Guerin, Gabriela Csurka, Martin Humenberger [pdf] [supp] [arXiv] [bibtex]
Distilling Causal Effect of Data in Class-Incremental Learning Xinting Hu, Kaihua Tang, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang [pdf] [supp] [arXiv] [bibtex]
Backdoor Attacks Against Deep Learning Systems in the Physical World Emily Wenger, Josephine Passananti, Arjun Nitin Bhagoji, Yuanshun Yao, Haitao Zheng, Ben Y. Zhao [pdf] [supp] [arXiv] [bibtex]
A Multiplexed Network for End-to-End, Multilingual OCR Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner [pdf] [arXiv] [bibtex]
Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency Xin Lai, Zhuotao Tian, Li Jiang, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia [pdf] [supp] [bibtex]
Causal Hidden Markov Model for Time Series Disease Forecasting Jing Li, Botong Wu, Xinwei Sun, Yizhou Wang [pdf] [supp] [arXiv] [bibtex]
Generalizable Pedestrian Detection: The Elephant in the Room Irtiza Hasan, Shengcai Liao, Jinpeng Li, Saad Ullah Akram, Ling Shao [pdf] [arXiv] [bibtex]
Focus on Local: Detecting Lane Marker From Bottom Up via Key Point Zhan Qu, Huan Jin, Yang Zhou, Zhen Yang, Wei Zhang [pdf] [arXiv] [bibtex]
Memory-Guided Unsupervised Image-to-Image Translation Somi Jeong, Youngjung Kim, Eungbean Lee, Kwanghoon Sohn [pdf] [arXiv] [bibtex]
Incremental Few-Shot Instance Segmentation Dan Andrei Ganea, Bas Boom, Ronald Poppe [pdf] [supp] [arXiv] [bibtex]
Mining Better Samples for Contrastive Learning of Temporal Correspondence Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn [pdf] [supp] [bibtex]
Scene-Aware Generative Network for Human Motion Synthesis Jingbo Wang, Sijie Yan, Bo Dai, Dahua Lin [pdf] [supp] [arXiv] [bibtex]
Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis Yaxuan Zhu, Ruiqi Gao, Siyuan Huang, Song-Chun Zhu, Ying Nian Wu [pdf] [supp] [arXiv] [bibtex]
PML: Progressive Margin Loss for Long-Tailed Age Classification Zongyong Deng, Hao Liu, Yaoxing Wang, Chenyang Wang, Zekuan Yu, Xuehong Sun [pdf] [arXiv] [bibtex]
Single Image Depth Prediction With Wavelet Decomposition Michael Ramamonjisoa, Michael Firman, Jamie Watson, Vincent Lepetit, Daniyar Turmukhambetov [pdf] [supp] [bibtex]
PVGNet: A Bottom-Up One-Stage 3D Object Detector With Integrated Multi-Level Features Zhenwei Miao, Jikai Chen, Hongyu Pan, Ruiwen Zhang, Kaixuan Liu, Peihan Hao, Jun Zhu, Yang Wang, Xin Zhan [pdf] [bibtex]
Exemplar-Based Open-Set Panoptic Segmentation Network Jaedong Hwang, Seoung Wug Oh, Joon-Young Lee, Bohyung Han [pdf] [supp] [arXiv] [bibtex]
KOALAnet: Blind Super-Resolution Using Kernel-Oriented Adaptive Local Adjustment Soo Ye Kim, Hyeonjun Sim, Munchurl Kim [pdf] [supp] [arXiv] [bibtex]
Learning Deep Classifiers Consistent With Fine-Grained Novelty Detection Jiacheng Cheng, Nuno Vasconcelos [pdf] [supp] [bibtex]
Multiple Object Tracking With Correlation Learning Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu [pdf] [arXiv] [bibtex]
SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data Yuan-Ting Hu, Jiahong Wang, Raymond A. Yeh, Alexander G. Schwing [pdf] [arXiv] [bibtex]
PixMatch: Unsupervised Domain Adaptation via Pixelwise Consistency Training Luke Melas-Kyriazi, Arjun K. Manrai [pdf] [supp] [bibtex]
Deep RGB-D Saliency Detection With Depth-Sensitive Attention and Automatic Multi-Modal Fusion Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li [pdf] [supp] [bibtex]
Exploring Sparsity in Image Super-Resolution for Efficient Inference Longguang Wang, Xiaoyu Dong, Yingqian Wang, Xinyi Ying, Zaiping Lin, Wei An, Yulan Guo [pdf] [supp] [arXiv] [bibtex]
Positive Sample Propagation Along the Audio-Visual Event Line Jinxing Zhou, Liang Zheng, Yiran Zhong, Shijie Hao, Meng Wang [pdf] [arXiv] [bibtex]
Understanding the Behaviour of Contrastive Loss Feng Wang, Huaping Liu [pdf] [supp] [arXiv] [bibtex]
Variational Prototype Learning for Deep Face Recognition Jiankang Deng, Jia Guo, Jing Yang, Alexandros Lattas, Stefanos Zafeiriou [pdf] [bibtex]
StylePeople: A Generative Model of Fullbody Human Avatars Artur Grigorev, Karim Iskakov, Anastasia Ianina, Renat Bashirov, Ilya Zakharkin, Alexander Vakhitov, Victor Lempitsky [pdf] [supp] [arXiv] [bibtex]
Optimal Quantization Using Scaled Codebook Yerlan Idelbayev, Pavlo Molchanov, Maying Shen, Hongxu Yin, Miguel A. Carreira-Perpinan, Jose M. Alvarez [pdf] [bibtex]
RPN Prototype Alignment for Domain Adaptive Object Detector Yixin Zhang, Zilei Wang, Yushi Mao [pdf] [bibtex]
Dual Contradistinctive Generative Autoencoder Gaurav Parmar, Dacheng Li, Kwonjoon Lee, Zhuowen Tu [pdf] [supp] [arXiv] [bibtex]
Binary TTC: A Temporal Geofence for Autonomous Navigation Abhishek Badki, Orazio Gallo, Jan Kautz, Pradeep Sen [pdf] [supp] [arXiv] [bibtex]
Semantic-Aware Video Text Detection Wei Feng, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu [pdf] [bibtex]
Real-Time High-Resolution Background Matting Shanchuan Lin, Andrey Ryabtsev, Soumyadip Sengupta, Brian L. Curless, Steven M. Seitz, Ira Kemelmacher-Shlizerman [pdf] [supp] [arXiv] [bibtex]
Interpretable Social Anchors for Human Trajectory Forecasting in Crowds Parth Kothari, Brian Sifringer, Alexandre Alahi [pdf] [arXiv] [bibtex]
Trajectory Prediction With Latent Belief Energy-Based Model Bo Pang, Tianyang Zhao, Xu Xie, Ying Nian Wu [pdf] [supp] [arXiv] [bibtex]
Metadata Normalization Mandy Lu, Qingyu Zhao, Jiequan Zhang, Kilian M. Pohl, Li Fei-Fei, Juan Carlos Niebles, Ehsan Adeli [pdf] [arXiv] [bibtex]
Multi-Objective Interpolation Training for Robustness To Label Noise Diego Ortego, Eric Arazo, Paul Albert, Noel E. O’Connor, Kevin McGuinness [pdf] [arXiv] [bibtex]
PhySG: Inverse Rendering With Spherical Gaussians for Physics-Based Material Editing and Relighting Kai Zhang, Fujun Luan, Qianqian Wang, Kavita Bala, Noah Snavely [pdf] [arXiv] [bibtex]
Predator: Registration of 3D Point Clouds With Low Overlap Shengyu Huang, Zan Gojcic, Mikhail Usvyatsov, Andreas Wieser, Konrad Schindler [pdf] [supp] [arXiv] [bibtex]
Hierarchical Motion Understanding via Motion Programs Sumith Kulal, Jiayuan Mao, Alex Aiken, Jiajun Wu [pdf] [arXiv] [bibtex]
Neural Side-by-Side: Predicting Human Preferences for No-Reference Super-Resolution Evaluation Valentin Khrulkov, Artem Babenko [pdf] [bibtex]
Coordinate Attention for Efficient Mobile Network Design Qibin Hou, Daquan Zhou, Jiashi Feng [pdf] [arXiv] [bibtex]
Stylized Neural Painting Zhengxia Zou, Tianyang Shi, Shuang Qiu, Yi Yuan, Zhenwei Shi [pdf] [supp] [arXiv] [bibtex]
Image Change Captioning by Learning From an Auxiliary Task Mehrdad Hosseinzadeh, Yang Wang [pdf] [bibtex]
Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification Yuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, Nicu Sebe [pdf] [supp] [arXiv] [bibtex]
Discriminative Appearance Modeling With Multi-Track Pooling for Real-Time Multi-Object Tracking Chanho Kim, Li Fuxin, Mazen Alotaibi, James M. Rehg [pdf] [supp] [arXiv] [bibtex]
LASR: Learning Articulated Shape Reconstruction From a Monocular Video Gengshan Yang, Deqing Sun, Varun Jampani, Daniel Vlasic, Forrester Cole, Huiwen Chang, Deva Ramanan, William T. Freeman, Ce Liu [pdf] [supp] [arXiv] [bibtex]
FVC: A New Framework Towards Deep Video Compression in Feature Space Zhihao Hu, Guo Lu, Dong Xu [pdf] [arXiv] [bibtex]
Exponential Moving Average Normalization for Self-Supervised and Semi-Supervised Learning Zhaowei Cai, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Zhuowen Tu, Stefano Soatto [pdf] [supp] [arXiv] [bibtex]
Confluent Vessel Trees With Accurate Bifurcations Zhongwen Zhang, Dmitrii Marin, Maria Drangova, Yuri Boykov [pdf] [supp] [arXiv] [bibtex]
Intentonomy: A Dataset and Study Towards Human Intent Understanding Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim [pdf] [supp] [arXiv] [bibtex]
End-to-End Rotation Averaging With Multi-Source Propagation Luwei Yang, Heng Li, Jamal Ahmed Rahim, Zhaopeng Cui, Ping Tan [pdf] [supp] [bibtex]
Controllable Image Restoration for Under-Display Camera in Smartphones Kinam Kwon, Eunhee Kang, Sangwon Lee, Su-Jin Lee, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han [pdf] [supp] [bibtex]
Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma [pdf] [supp] [arXiv] [bibtex]
Context-Aware Biaffine Localizing Network for Temporal Sentence Grounding Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Yu Cheng, Wei Wei, Zichuan Xu, Yulai Xie [pdf] [arXiv] [bibtex]
NewtonianVAE: Proportional Control and Goal Identification From Pixels via Physical Latent Spaces Miguel Jaques, Michael Burke, Timothy M. Hospedales [pdf] [arXiv] [bibtex]
Auto-Exposure Fusion for Single-Image Shadow Removal Lan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang [pdf] [arXiv] [bibtex]
Anticipating Human Actions by Correlating Past With the Future With Jaccard Similarity Measures Basura Fernando, Samitha Herath [pdf] [supp] [arXiv] [bibtex]
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces From Video Using Pose and Lighting Normalization Avisek Lahiri, Vivek Kwatra, Christian Frueh, John Lewis, Chris Bregler [pdf] [supp] [bibtex]
Simpler Certified Radius Maximization by Propagating Covariances Xingjian Zhen, Rudrasis Chakraborty, Vikas Singh [pdf] [supp] [arXiv] [bibtex]
A 3D GAN for Improved Large-Pose Facial Recognition Richard T. Marriott, Sami Romdhani, Liming Chen [pdf] [arXiv] [bibtex]
Repopulating Street Scenes Yifan Wang, Andrew Liu, Richard Tucker, Jiajun Wu, Brian L. Curless, Steven M. Seitz, Noah Snavely [pdf] [supp] [arXiv] [bibtex]
ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring Dongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, Hongdong Li [pdf] [arXiv] [bibtex]
Unsupervised Object Detection With LIDAR Clues Hao Tian, Yuntao Chen, Jifeng Dai, Zhaoxiang Zhang, Xizhou Zhu [pdf] [supp] [arXiv] [bibtex]
TesseTrack: End-to-End Learnable Multi-Person Articulated 3D Pose Tracking N Dinesh Reddy, Laurent Guigues, Leonid Pishchulin, Jayan Eledath, Srinivasa G. Narasimhan [pdf] [supp] [bibtex]
HVPR: Hybrid Voxel-Point Representation for Single-Stage 3D Object Detection Jongyoun Noh, Sanghoon Lee, Bumsub Ham [pdf] [supp] [arXiv] [bibtex]
SOE-Net: A Self-Attention and Orientation Encoding Network for Point Cloud Based Place Recognition Yan Xia, Yusheng Xu, Shuang Li, Rui Wang, Juan Du, Daniel Cremers, Uwe Stilla [pdf] [bibtex]
Controlling the Rain: From Removal to Rendering Siqi Ni, Xueyun Cao, Tao Yue, Xuemei Hu [pdf] [supp] [bibtex]
KeypointDeformer: Unsupervised 3D Keypoint Discovery for Shape Control Tomas Jakab, Richard Tucker, Ameesh Makadia, Jiajun Wu, Noah Snavely, Angjoo Kanazawa [pdf] [supp] [arXiv] [bibtex]
A2-FPN: Attention Aggregation Based Feature Pyramid Network for Instance Segmentation Miao Hu, Yali Li, Lu Fang, Shengjin Wang [pdf] [supp] [bibtex]
Quasi-Dense Similarity Learning for Multiple Object Tracking Jiangmiao Pang, Linlu Qiu, Xia Li, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu [pdf] [supp] [arXiv] [bibtex]
Simultaneously Localize, Segment and Rank the Camouflaged Objects Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan [pdf] [supp] [arXiv] [bibtex]
Hybrid Message Passing With Performance-Driven Structures for Facial Action Unit Detection Tengfei Song, Zijun Cui, Wenming Zheng, Qiang Ji [pdf] [supp] [bibtex]
Distilling Object Detectors via Decoupled Features Jianyuan Guo, Kai Han, Yunhe Wang, Han Wu, Xinghao Chen, Chunjing Xu, Chang Xu [pdf] [supp] [arXiv] [bibtex]
Roof-GAN: Learning To Generate Roof Geometry and Relations for Residential Houses Yiming Qian, Hao Zhang, Yasutaka Furukawa [pdf] [supp] [bibtex]
No Shadow Left Behind: Removing Objects and Their Shadows Using Approximate Lighting and Geometry Edward Zhang, Ricardo Martin-Brualla, Janne Kontkanen, Brian L. Curless [pdf] [supp] [bibtex]
NetAdaptV2: Efficient Neural Architecture Search With Fast Super-Network Training and Architecture Optimization Tien-Ju Yang, Yi-Lun Liao, Vivienne Sze [pdf] [supp] [arXiv] [bibtex]
PhD Learning: Learning With Pompeiu-Hausdorff Distances for Video-Based Vehicle Re-Identification Jianan Zhao, Fengliang Qi, Guangyu Ren, Lin Xu [pdf] [supp] [bibtex]
DeepVideoMVS: Multi-View Stereo on Video With Recurrent Spatio-Temporal Fusion Arda Duzceker, Silvano Galliani, Christoph Vogel, Pablo Speciale, Mihai Dusmanu, Marc Pollefeys [pdf] [supp] [arXiv] [bibtex]
Saliency-Guided Image Translation Lai Jiang, Mai Xu, Xiaofei Wang, Leonid Sigal [pdf] [supp] [bibtex]
Weakly Supervised Learning of Rigid 3D Scene Flow Zan Gojcic, Or Litany, Andreas Wieser, Leonidas J. Guibas, Tolga Birdal [pdf] [supp] [arXiv] [bibtex]
InverseForm: A Loss Function for Structured Boundary-Aware Segmentation Shubhankar Borse, Ying Wang, Yizhe Zhang, Fatih Porikli [pdf] [supp] [arXiv] [bibtex]
Towards Accurate Text-Based Image Captioning With Content Diversity Exploration Guanghui Xu, Shuaicheng Niu, Mingkui Tan, Yucheng Luo, Qing Du, Qi Wu [pdf] [supp] [arXiv] [bibtex]
Learning Placeholders for Open-Set Recognition Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan [pdf] [supp] [arXiv] [bibtex]
CodedStereo: Learned Phase Masks for Large Depth-of-Field Stereo Shiyu Tan, Yicheng Wu, Shoou-I Yu, Ashok Veeraraghavan [pdf] [supp] [arXiv] [bibtex]
More Photos Are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song [pdf] [supp] [arXiv] [bibtex]
Unsupervised Hyperbolic Representation Learning via Message Passing Auto-Encoders Jiwoong Park, Junho Cho, Hyung Jin Chang, Jin Young Choi [pdf] [supp] [arXiv] [bibtex]
Retinex-Inspired Unrolling With Cooperative Prior Architecture Search for Low-Light Image Enhancement Risheng Liu, Long Ma, Jiaao Zhang, Xin Fan, Zhongxuan Luo [pdf] [supp] [arXiv] [bibtex]
Relevance-CAM: Your Model Already Knows Where To Look Jeong Ryong Lee, Sewon Kim, Inyong Park, Taejoon Eo, Dosik Hwang [pdf] [supp] [bibtex]
Boundary IoU: Improving Object-Centric Image Segmentation Evaluation Bowen Cheng, Ross Girshick, Piotr Dollar, Alexander C. Berg, Alexander Kirillov [pdf] [supp] [arXiv] [bibtex]
KeepAugment: A Simple Information-Preserving Data Augmentation Approach Chengyue Gong, Dilin Wang, Meng Li, Vikas Chandra, Qiang Liu [pdf] [arXiv] [bibtex]
On Robustness and Transferability of Convolutional Neural Networks Josip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D’Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic [pdf] [supp] [arXiv] [bibtex]
POSEFusion: Pose-Guided Selective Fusion for Single-View Human Volumetric Capture Zhe Li, Tao Yu, Zerong Zheng, Kaiwen Guo, Yebin Liu [pdf] [supp] [arXiv] [bibtex]
Exploring Adversarial Fake Images on Face Manifold Dongze Li, Wei Wang, Hongxing Fan, Jing Dong [pdf] [arXiv] [bibtex]
Reinforced Attention for Few-Shot Learning and Beyond Jie Hong, Pengfei Fang, Weihao Li, Tong Zhang, Christian Simon, Mehrtash Harandi, Lars Petersson [pdf] [supp] [arXiv] [bibtex]
HOTR: End-to-End Human-Object Interaction Detection With Transformers Bumsoo Kim, Junhyun Lee, Jaewoo Kang, Eun-Sol Kim, Hyunwoo J. Kim [pdf] [supp] [arXiv] [bibtex]
Deep Video Matting via Spatio-Temporal Alignment and Aggregation Yanan Sun, Guanzhi Wang, Qiao Gu, Chi-Keung Tang, Yu-Wing Tai [pdf] [supp] [arXiv] [bibtex]
Triple-Cooperative Video Shadow Detection Zhihao Chen, Liang Wan, Lei Zhu, Jia Shen, Huazhu Fu, Wennan Liu, Jing Qin [pdf] [arXiv] [bibtex]
Scale-Aware Graph Neural Network for Few-Shot Semantic Segmentation Guo-Sen Xie, Jie Liu, Huan Xiong, Ling Shao [pdf] [bibtex]
Continuous Face Aging via Self-Estimated Residual Age Embedding Zeqi Li, Ruowei Jiang, Parham Aarabi [pdf] [supp] [arXiv] [bibtex]
Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline Lingzhi He, Hongguang Zhu, Feng Li, Huihui Bai, Runmin Cong, Chunjie Zhang, Chunyu Lin, Meiqin Liu, Yao Zhao [pdf] [supp] [bibtex]
Jigsaw Clustering for Unsupervised Visual Representation Learning Pengguang Chen, Shu Liu, Jiaya Jia [pdf] [supp] [arXiv] [bibtex]
DI-Fusion: Online Implicit 3D Reconstruction With Deep Priors Jiahui Huang, Shi-Sheng Huang, Haoxuan Song, Shi-Min Hu [pdf] [supp] [bibtex]
Square Root Bundle Adjustment for Large-Scale Reconstruction Nikolaus Demmel, Christiane Sommer, Daniel Cremers, Vladyslav Usenko [pdf] [supp] [arXiv] [bibtex]
PatchMatch-Based Neighborhood Consensus for Semantic Correspondence Jae Yong Lee, Joseph DeGol, Victor Fragoso, Sudipta N. Sinha [pdf] [supp] [bibtex]
Representative Forgery Mining for Fake Face Detection Chengrui Wang, Weihong Deng [pdf] [arXiv] [bibtex]
Look Closer To Segment Better: Boundary Patch Refinement for Instance Segmentation Chufeng Tang, Hang Chen, Xiao Li, Jianmin Li, Zhaoxiang Zhang, Xiaolin Hu [pdf] [supp] [arXiv] [bibtex]
Adaptive Class Suppression Loss for Long-Tail Object Detection Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Jinqiao Wang, Ming Tang [pdf] [arXiv] [bibtex]
ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References Yannan He, Anqi Pang, Xin Chen, Han Liang, Minye Wu, Yuexin Ma, Lan Xu [pdf] [arXiv] [bibtex]
Automated Log-Scale Quantization for Low-Cost Deep Neural Networks Sangyun Oh, Hyeonuk Sim, Sugil Lee, Jongeun Lee [pdf] [supp] [bibtex]
Hallucination Improves Few-Shot Object Detection Weilin Zhang, Yu-Xiong Wang [pdf] [supp] [arXiv] [bibtex]
Efficient Conditional GAN Transfer With Knowledge Propagation Across Classes Mohamad Shahbazi, Zhiwu Huang, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool [pdf] [supp] [arXiv] [bibtex]
Fully Convolutional Scene Graph Generation Hengyue Liu, Ning Yan, Masood Mortazavi, Bir Bhanu [pdf] [supp] [arXiv] [bibtex]
Crossing Cuts Polygonal Puzzles: Models and Solvers Peleg Harel, Ohad Ben-Shahar [pdf] [supp] [bibtex]
Graph-Based High-Order Relation Modeling for Long-Term Action Recognition Jiaming Zhou, Kun-Yu Lin, Haoxin Li, Wei-Shi Zheng [pdf] [supp] [bibtex]
Positive-Unlabeled Data Purification in the Wild for Object Detection Jianyuan Guo, Kai Han, Han Wu, Chao Zhang, Xinghao Chen, Chunjing Xu, Chang Xu, Yunhe Wang [pdf] [bibtex]
ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows Jie An, Siyu Huang, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo [pdf] [supp] [arXiv] [bibtex]
Network Quantization With Element-Wise Gradient Scaling Junghyup Lee, Dohyung Kim, Bumsub Ham [pdf] [arXiv] [bibtex]
img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation Vitor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner [pdf] [arXiv] [bibtex]
Sparse Multi-Path Corrections in Fringe Projection Profilometry Yu Zhang, Daniel Lau, David Wipf [pdf] [bibtex]
NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi [pdf] [supp] [bibtex]
Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder Tal Daniel, Aviv Tamar [pdf] [supp] [bibtex]
Energy-Based Learning for Scene Graph Generation Mohammed Suhail, Abhay Mittal, Behjat Siddiquie, Chris Broaddus, Jayan Eledath, Gerard Medioni, Leonid Sigal [pdf] [supp] [arXiv] [bibtex]
Zillow Indoor Dataset: Annotated Floor Plans With 360deg Panoramas and 3D Room Layouts Steve Cruz, Will Hutchcroft, Yuguang Li, Naji Khosravan, Ivaylo Boyadzhiev, Sing Bing Kang [pdf] [supp] [bibtex]
Progressive Contour Regression for Arbitrary-Shape Scene Text Detection Pengwen Dai, Sanyi Zhang, Hua Zhang, Xiaochun Cao [pdf] [bibtex]
UV-Net: Learning From Boundary Representations Pradeep Kumar Jayaraman, Aditya Sanghi, Joseph G. Lambourne, Karl D.D. Willis, Thomas Davies, Hooman Shayani, Nigel Morris [pdf] [supp] [bibtex]
MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation Sanjay Kariyappa, Atul Prakash, Moinuddin K Qureshi [pdf] [supp] [arXiv] [bibtex]
Universal Spectral Adversarial Attacks for Deformable Shapes Arianna Rampini, Franco Pestarini, Luca Cosmo, Simone Melzi, Emanuele Rodola [pdf] [supp] [arXiv] [bibtex]
Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation Xiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto Sangiovanni Vincentelli [pdf] [supp] [arXiv] [bibtex]
HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation Jiefeng Li, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu [pdf] [supp] [arXiv] [bibtex]
Human De-Occlusion: Invisible Perception and Recovery for Humans Qiang Zhou, Shiyin Wang, Yitong Wang, Zilong Huang, Xinggang Wang [pdf] [supp] [bibtex]
The Neural Tangent Link Between CNN Denoisers and Non-Local Filters Julian Tachella, Junqi Tang, Mike Davies [pdf] [arXiv] [bibtex]
Achieving Robustness in Classification Using Optimal Transport With Hinge Regularization Mathieu Serrurier, Franck Mamalet, Alberto Gonzalez-Sanz, Thibaut Boissin, Jean-Michel Loubes, Eustasio del Barrio [pdf] [bibtex]
Stochastic Image-to-Video Synthesis Using cINNs Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Bjorn Ommer [pdf] [supp] [arXiv] [bibtex]
Ego-Exo: Transferring Visual Representations From Third-Person to First-Person Videos Yanghao Li, Tushar Nagarajan, Bo Xiong, Kristen Grauman [pdf] [supp] [bibtex]
Dynamic Slimmable Network Changlin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang [pdf] [supp] [arXiv] [bibtex]
Jo-SRC: A Contrastive Approach for Combating Noisy Labels Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang [pdf] [bibtex]
Deep Lucas-Kanade Homography for Multimodal Image Alignment Yiming Zhao, Xinming Huang, Ziming Zhang [pdf] [arXiv] [bibtex]
clDice – A Novel Topology-Preserving Loss Function for Tubular Structure Segmentation Suprosanna Shit, Johannes C. Paetzold, Anjany Sekuboyina, Ivan Ezhov, Alexander Unger, Andrey Zhylka, Josien P. W. Pluim, Ulrich Bauer, Bjoern H. Menze [pdf] [supp] [bibtex]
Hyper-LifelongGAN: Scalable Lifelong Learning for Image Conditioned Generation Mengyao Zhai, Lei Chen, Greg Mori [pdf] [bibtex]
Semi-Supervised Synthesis of High-Resolution Editable Textures for 3D Humans Bindita Chaudhuri, Nikolaos Sarafianos, Linda Shapiro, Tony Tung [pdf] [supp] [arXiv] [bibtex]
CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback Seungmin Lee, Dongwan Kim, Bohyung Han [pdf] [supp] [bibtex]
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers Antoine Miech, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Andrew Zisserman [pdf] [arXiv] [bibtex]
RGB-D Local Implicit Function for Depth Completion of Transparent Objects Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox [pdf] [supp] [bibtex]
Fingerspelling Detection in American Sign Language Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu [pdf] [supp] [arXiv] [bibtex]
Uncertainty Reduction for Model Adaptation in Semantic Segmentation Prabhu Teja S, Francois Fleuret [pdf] [supp] [bibtex]
Learning Triadic Belief Dynamics in Nonverbal Communication From Videos Lifeng Fan, Shuwen Qiu, Zilong Zheng, Tao Gao, Song-Chun Zhu, Yixin Zhu [pdf] [supp] [arXiv] [bibtex]
Temporal Modulation Network for Controllable Space-Time Video Super-Resolution Gang Xu, Jun Xu, Zhen Li, Liang Wang, Xing Sun, Ming-Ming Cheng [pdf] [supp] [arXiv] [bibtex]
Zero-Shot Single Image Restoration Through Controlled Perturbation of Koschmieder’s Model Aupendu Kar, Sobhan Kanti Dhara, Debashis Sen, Prabir Kumar Biswas [pdf] [supp] [bibtex]
Uncertainty-Aware Camera Pose Estimation From Points and Lines Alexander Vakhitov, Luis Ferraz, Antonio Agudo, Francesc Moreno-Noguer [pdf] [supp] [bibtex]
Temporal Context Aggregation Network for Temporal Action Proposal Refinement Zhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang [pdf] [arXiv] [bibtex]
Information-Theoretic Segmentation by Inpainting Error Maximization Pedro Savarese, Sunnie S. Y. Kim, Michael Maire, Greg Shakhnarovich, David McAllester [pdf] [supp] [arXiv] [bibtex]
Adaptive Prototype Learning and Allocation for Few-Shot Segmentation Gen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, Joongkyu Kim [pdf] [supp] [bibtex]
RefineMask: Towards High-Quality Instance Segmentation With Fine-Grained Features Gang Zhang, Xin Lu, Jingru Tan, Jianmin Li, Zhaoxiang Zhang, Quanquan Li, Xiaolin Hu [pdf] [supp] [arXiv] [bibtex]
DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation Xiong Zhang, Hongmin Xu, Hong Mo, Jianchao Tan, Cheng Yang, Lei Wang, Wenqi Ren [pdf] [supp] [arXiv] [bibtex]
Tackling the Ill-Posedness of Super-Resolution Through Adaptive Target Generation Younghyun Jo, Seoung Wug Oh, Peter Vajda, Seon Joo Kim [pdf] [supp] [bibtex]
DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation Yufan He, Dong Yang, Holger Roth, Can Zhao, Daguang Xu [pdf] [arXiv] [bibtex]
Im2Vec: Synthesizing Vector Graphics Without Vector Supervision Pradyumna Reddy, Michael Gharbi, Michal Lukac, Niloy J. Mitra [pdf] [supp] [arXiv] [bibtex]
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing Yuanyuan Yuan, Shuai Wang, Mingyue Jiang, Tsong Yueh Chen [pdf] [supp] [bibtex]
Unsupervised Part Segmentation Through Disentangling Appearance and Shape Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu [pdf] [supp] [arXiv] [bibtex]
Adversarial Imaging Pipelines Buu Phan, Fahim Mannan, Felix Heide [pdf] [supp] [arXiv] [bibtex]
Adaptive Consistency Regularization for Semi-Supervised Transfer Learning Abulikemu Abuduweili, Xingjian Li, Humphrey Shi, Cheng-Zhong Xu, Dejing Dou [pdf] [supp] [arXiv] [bibtex]
GANmut: Learning Interpretable Conditional Space for Gamut of Emotions Stefano d’Apolito, Danda Pani Paudel, Zhiwu Huang, Andres Romero, Luc Van Gool [pdf] [supp] [bibtex]
StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation Zongze Wu, Dani Lischinski, Eli Shechtman [pdf] [supp] [arXiv] [bibtex]
Rethinking the Heatmap Regression for Bottom-Up Human Pose Estimation Zhengxiong Luo, Zhicheng Wang, Yan Huang, Liang Wang, Tieniu Tan, Erjin Zhou [pdf] [arXiv] [bibtex]
From Semantic Categories to Fixations: A Novel Weakly-Supervised Visual-Auditory Saliency Detection Approach Guotao Wang, Chenglizhao Chen, Deng-Ping Fan, Aimin Hao, Hong Qin [pdf] [bibtex]
High-Fidelity Face Tracking for AR/VR via Deep Lighting Adaptation Lele Chen, Chen Cao, Fernando De la Torre, Jason Saragih, Chenliang Xu, Yaser Sheikh [pdf] [supp] [arXiv] [bibtex]
Mixed-Privacy Forgetting in Deep Networks Aditya Golatkar, Alessandro Achille, Avinash Ravichandran, Marzia Polito, Stefano Soatto [pdf] [supp] [arXiv] [bibtex]
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation Weihao Xia, Yujiu Yang, Jing-Hao Xue, Baoyuan Wu [pdf] [arXiv] [bibtex]
Affective Processes: Stochastic Modelling of Temporal Context for Emotion and Facial Expression Recognition Enrique Sanchez, Mani Kumar Tellamekala, Michel Valstar, Georgios Tzimiropoulos [pdf] [supp] [arXiv] [bibtex]
ID-Unet: Iterative Soft and Hard Deformation for View Synthesis Mingyu Yin, Li Sun, Qingli Li [pdf] [bibtex]
Positional Encoding As Spatial Inductive Bias in GANs Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy [pdf] [supp] [arXiv] [bibtex]
Mask-ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging Ilya Chugunov, Seung-Hwan Baek, Qiang Fu, Wolfgang Heidrich, Felix Heide [pdf] [supp] [bibtex]
QPP: Real-Time Quantization Parameter Prediction for Deep Neural Networks Vladimir Kryzhanovskiy, Gleb Balitskiy, Nikolay Kozyrskiy, Aleksandr Zuruev [pdf] [supp] [bibtex]
Nighttime Visibility Enhancement by Increasing the Dynamic Range and Suppression of Light Effects Aashish Sharma, Robby T. Tan [pdf] [bibtex]
Self-Supervised Augmentation Consistency for Adapting Semantic Segmentation Nikita Araslanov, Stefan Roth [pdf] [supp] [arXiv] [bibtex]
Patch-VQ: ‘Patching Up’ the Video Quality Problem Zhenqiang Ying, Maniratnam Mandal, Deepti Ghadiyaram, Alan Bovik [pdf] [supp] [bibtex]
Double Low-Rank Representation With Projection Distance Penalty for Clustering Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang [pdf] [supp] [bibtex]
Towards High Fidelity Face Relighting With Realistic Shadows Andrew Hou, Ze Zhang, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu [pdf] [supp] [arXiv] [bibtex]
Multi-View Multi-Person 3D Pose Estimation With Plane Sweep Stereo Jiahao Lin, Gim Hee Lee [pdf] [arXiv] [bibtex]
Fusing the Old with the New: Learning Relative Camera Pose with Geometry-Guided Uncertainty Bingbing Zhuang, Manmohan Chandraker [pdf] [supp] [arXiv] [bibtex]
CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning Chen Wei, Kihyuk Sohn, Clayton Mellina, Alan Yuille, Fan Yang [pdf] [supp] [arXiv] [bibtex]
Towards Diverse Paragraph Captioning for Untrimmed Videos Yuqing Song, Shizhe Chen, Qin Jin [pdf] [supp] [arXiv] [bibtex]
FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation Yair Kittenplon, Yonina C. Eldar, Dan Raviv [pdf] [arXiv] [bibtex]
Adversarial Robustness Across Representation Spaces Pranjal Awasthi, George Yu, Chun-Sung Ferng, Andrew Tomkins, Da-Cheng Juan [pdf] [supp] [arXiv] [bibtex]
MagDR: Mask-Guided Detection and Reconstruction for Defending Deepfakes Zhikai Chen, Lingxi Xie, Shanmin Pang, Yong He, Bo Zhang [pdf] [arXiv] [bibtex]
Neural Deformation Graphs for Globally-Consistent Non-Rigid Reconstruction Aljaz Bozic, Pablo Palafox, Michael Zollhofer, Justus Thies, Angela Dai, Matthias Niessner [pdf] [supp] [arXiv] [bibtex]
Fostering Generalization in Single-View 3D Reconstruction by Learning a Hierarchy of Local and Global Shape Priors Jan Bechtold, Maxim Tatarchenko, Volker Fischer, Thomas Brox [pdf] [supp] [arXiv] [bibtex]
Progressive Semantic-Aware Style Transformation for Blind Face Restoration Chaofeng Chen, Xiaoming Li, Lingbo Yang, Xianhui Lin, Lei Zhang, Kwan-Yee K. Wong [pdf] [supp] [arXiv] [bibtex]
Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association Peisong Wen, Qianqian Xu, Yangbangyan Jiang, Zhiyong Yang, Yuan He, Qingming Huang [pdf] [supp] [arXiv] [bibtex]
Invertible Image Signal Processing Yazhou Xing, Zian Qian, Qifeng Chen [pdf] [supp] [arXiv] [bibtex]
Lighting, Reflectance and Geometry Estimation From 360deg Panoramic Stereo Junxuan Li, Hongdong Li, Yasuyuki Matsushita [pdf] [bibtex]
Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model Interpretation Dohun Lim, Hyeonseok Lee, Sungchan Kim [pdf] [supp] [arXiv] [bibtex]
NeX: Real-Time View Synthesis With Neural Basis Expansion Suttisak Wizadwongsa, Pakkapon Phongthawee, Jiraphon Yenphraphai, Supasorn Suwajanakorn [pdf] [supp] [arXiv] [bibtex]
DAT: Training Deep Networks Robust To Label-Noise by Matching the Feature Distributions Yuntao Qu, Shasha Mo, Jianwei Niu [pdf] [supp] [bibtex]
Repetitive Activity Counting by Sight and Sound Yunhua Zhang, Ling Shao, Cees G. M. Snoek [pdf] [supp] [arXiv] [bibtex]
PointGuard: Provably Robust 3D Point Cloud Classification Hongbin Liu, Jinyuan Jia, Neil Zhenqiang Gong [pdf] [supp] [arXiv] [bibtex]
Unsupervised Multi-Source Domain Adaptation for Person Re-Identification Zechen Bai, Zhigang Wang, Jian Wang, Di Hu, Errui Ding [pdf] [supp] [arXiv] [bibtex]
BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation Jungbeom Lee, Jihun Yi, Chaehun Shin, Sungroh Yoon [pdf] [supp] [arXiv] [bibtex]
Boosting Video Representation Learning With Multi-Faceted Integration Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Xiao-Ping Zhang, Dong Wu, Tao Mei [pdf] [bibtex]
Beyond Bounding-Box: Convex-Hull Feature Adaptation for Oriented and Densely Packed Object Detection Zonghao Guo, Chang Liu, Xiaosong Zhang, Jianbin Jiao, Xiangyang Ji, Qixiang Ye [pdf] [bibtex]
3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management Tianyi Zhao, Kai Cao, Jiawen Yao, Isabella Nogues, Le Lu, Lingyun Huang, Jing Xiao, Zhaozheng Yin, Ling Zhang [pdf] [arXiv] [bibtex]
Protecting Intellectual Property of Generative Adversarial Networks From Ambiguity Attacks Ding Sheng Ong, Chee Seng Chan, Kam Woh Ng, Lixin Fan, Qiang Yang [pdf] [supp] [arXiv] [bibtex]
End-to-End High Dynamic Range Camera Pipeline Optimization Nicolas Robidoux, Luis E. Garcia Capel, Dong-eun Seo, Avinash Sharma, Federico Ariza, Felix Heide [pdf] [supp] [bibtex]
Parser-Free Virtual Try-On via Distilling Appearance Flows Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo [pdf] [supp] [arXiv] [bibtex]
GIRAFFE: Representing Scenes As Compositional Generative Neural Feature Fields Michael Niemeyer, Andreas Geiger [pdf] [arXiv] [bibtex]
Single-Stage Instance Shadow Detection With Bidirectional Relation Learning Tianyu Wang, Xiaowei Hu, Chi-Wing Fu, Pheng-Ann Heng [pdf] [supp] [bibtex]
High-Speed Image Reconstruction Through Short-Term Plasticity for Spiking Cameras Yajing Zheng, Lingxiao Zheng, Zhaofei Yu, Boxin Shi, Yonghong Tian, Tiejun Huang [pdf] [supp] [bibtex]
Self-Supervised 3D Mesh Reconstruction From Single Images Tao Hu, Liwei Wang, Xiaogang Xu, Shu Liu, Jiaya Jia [pdf] [supp] [bibtex]
Dual-GAN: Joint BVP and Noise Modeling for Remote Physiological Measurement Hao Lu, Hu Han, S. Kevin Zhou [pdf] [bibtex]
Audio-Visual Instance Discrimination with Cross-Modal Agreement Pedro Morgado, Nuno Vasconcelos, Ishan Misra [pdf] [supp] [arXiv] [bibtex]
Combined Depth Space Based Architecture Search for Person Re-Identification Hanjun Li, Gaojie Wu, Wei-Shi Zheng [pdf] [supp] [arXiv] [bibtex]
Rethinking BiSeNet for Real-Time Semantic Segmentation Mingyuan Fan, Shenqi Lai, Junshi Huang, Xiaoming Wei, Zhenhua Chai, Junfeng Luo, Xiaolin Wei [pdf] [arXiv] [bibtex]
The Spatially-Correlative Loss for Various Image Translation Tasks Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai [pdf] [arXiv] [bibtex]
Learning To Restore Hazy Video: A New Real-World Dataset and a New Method Xinyi Zhang, Hang Dong, Jinshan Pan, Chao Zhu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Fei Wang [pdf] [supp] [bibtex]
DyGLIP: A Dynamic Graph Model With Link Prediction for Accurate Multi-Camera Multiple Object Tracking Kha Gia Quach, Pha Nguyen, Huu Le, Thanh-Dat Truong, Chi Nhan Duong, Minh-Triet Tran, Khoa Luu [pdf] [supp] [arXiv] [bibtex]
Towards Efficient Tensor Decomposition-Based DNN Model Compression With Optimization Framework Miao Yin, Yang Sui, Siyu Liao, Bo Yuan [pdf] [supp] [bibtex]
User-Guided Line Art Flat Filling With Split Filling Mechanism Lvmin Zhang, Chengze Li, Edgar Simo-Serra, Yi Ji, Tien-Tsin Wong, Chunping Liu [pdf] [bibtex]
Restore From Restored: Video Restoration With Pseudo Clean Video Seunghwan Lee, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim [pdf] [arXiv] [bibtex]
Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion Shi Qiu, Saeed Anwar, Nick Barnes [pdf] [supp] [arXiv] [bibtex]
Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection Qize Yang, Xihan Wei, Biao Wang, Xian-Sheng Hua, Lei Zhang [pdf] [bibtex]
DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls Hongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bin Jiang, Wei Liu [pdf] [supp] [arXiv] [bibtex]
Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani [pdf] [supp] [arXiv] [bibtex]
KSM: Fast Multiple Task Adaption via Kernel-Wise Soft Mask Learning Li Yang, Zhezhi He, Junshan Zhang, Deliang Fan [pdf] [arXiv] [bibtex]
Rich Context Aggregation With Reflection Prior for Glass Surface Detection Jiaying Lin, Zebang He, Rynson W.H. Lau [pdf] [bibtex]
Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization Aysim Toker, Qunjie Zhou, Maxim Maximov, Laura Leal-Taixe [pdf] [supp] [arXiv] [bibtex]
AutoInt: Automatic Integration for Fast Neural Volume Rendering David B. Lindell, Julien N. P. Martel, Gordon Wetzstein [pdf] [supp] [arXiv] [bibtex]
Pose-Guided Human Animation From a Single Image in the Wild Jae Shin Yoon, Lingjie Liu, Vladislav Golyanik, Kripasindhu Sarkar, Hyun Soo Park, Christian Theobalt [pdf] [supp] [arXiv] [bibtex]
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression Chen Gao, Jinyu Chen, Si Liu, Luting Wang, Qiong Zhang, Qi Wu [pdf] [supp] [bibtex]
Equivariant Point Network for 3D Point Cloud Analysis Haiwei Chen, Shichen Liu, Weikai Chen, Hao Li, Randall Hill [pdf] [supp] [arXiv] [bibtex]
Learning Graph Embeddings for Compositional Zero-Shot Learning Muhammad Ferjad Naeem, Yongqin Xian, Federico Tombari, Zeynep Akata [pdf] [supp] [arXiv] [bibtex]
NeRD: Neural 3D Reflection Symmetry Detector Yichao Zhou, Shichen Liu, Yi Ma [pdf] [supp] [arXiv] [bibtex]
Checkerboard Context Model for Efficient Learned Image Compression Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin [pdf] [supp] [arXiv] [bibtex]
Zero-Shot Adversarial Quantization Yuang Liu, Wei Zhang, Jun Wang [pdf] [arXiv] [bibtex]
Group Whitening: Balancing Learning Efficiency and Representational Capacity Lei Huang, Yi Zhou, Li Liu, Fan Zhu, Ling Shao [pdf] [supp] [arXiv] [bibtex]
Adversarial Robustness Under Long-Tailed Distribution Tong Wu, Ziwei Liu, Qingqiu Huang, Yu Wang, Dahua Lin [pdf] [supp] [arXiv] [bibtex]
HyperSeg: Patch-Wise Hypernetwork for Real-Time Semantic Segmentation Yuval Nirkin, Lior Wolf, Tal Hassner [pdf] [supp] [arXiv] [bibtex]
Augmentation Strategies for Learning With Noisy Labels Kento Nishi, Yi Ding, Alex Rich, Tobias Hollerer [pdf] [supp] [arXiv] [bibtex]
AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching Xiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi [pdf] [supp] [arXiv] [bibtex]
ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic Xiangtao Kong, Hengyuan Zhao, Yu Qiao, Chao Dong [pdf] [arXiv] [bibtex]
Partition-Guided GANs Mohammadreza Armandpour, Ali Sadeghian, Chunyuan Li, Mingyuan Zhou [pdf] [supp] [arXiv] [bibtex]
GATSBI: Generative Agent-Centric Spatio-Temporal Object Interaction Cheol-Hui Min, Jinseok Bae, Junho Lee, Young Min Kim [pdf] [supp] [arXiv] [bibtex]
Privacy-Preserving Collaborative Learning With Automatic Transformation Search Wei Gao, Shangwei Guo, Tianwei Zhang, Han Qiu, Yonggang Wen, Yang Liu [pdf] [arXiv] [bibtex]
Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval Yawen Zeng, Da Cao, Xiaochi Wei, Meng Liu, Zhou Zhao, Zheng Qin [pdf] [bibtex]
Point Cloud Instance Segmentation Using Probabilistic Embeddings Biao Zhang, Peter Wonka [pdf] [supp] [arXiv] [bibtex]
pixelNeRF: Neural Radiance Fields From One or Few Images Alex Yu, Vickie Ye, Matthew Tancik, Angjoo Kanazawa [pdf] [supp] [arXiv] [bibtex]
Navigating the GAN Parameter Space for Semantic Image Editing Anton Cherepkov, Andrey Voynov, Artem Babenko [pdf] [supp] [arXiv] [bibtex]
Large-Capacity Image Steganography Based on Invertible Neural Networks Shao-Ping Lu, Rong Wang, Tao Zhong, Paul L. Rosin [pdf] [supp] [bibtex]
Exploiting Edge-Oriented Reasoning for 3D Point-Based Scene Graph Analysis Chaoyi Zhang, Jianhui Yu, Yang Song, Weidong Cai [pdf] [supp] [arXiv] [bibtex]
CoLA: Weakly-Supervised Temporal Action Localization With Snippet Contrastive Learning Can Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou [pdf] [supp] [arXiv] [bibtex]
MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition Shuang Li, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Feng Qiao, Xinjing Cheng [pdf] [arXiv] [bibtex]
Limitations of Post-Hoc Feature Alignment for Robustness Collin Burns, Jacob Steinhardt [pdf] [supp] [arXiv] [bibtex]
Every Annotation Counts: Multi-Label Deep Supervision for Medical Image Segmentation Simon Reiss, Constantin Seibold, Alexander Freytag, Erik Rodner, Rainer Stiefelhagen [pdf] [supp] [arXiv] [bibtex]
Roses Are Red, Violets Are Blue… but Should VQA Expect Them To? Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf [pdf] [supp] [arXiv] [bibtex]
FAPIS: A Few-Shot Anchor-Free Part-Based Instance Segmenter Khoi Nguyen, Sinisa Todorovic [pdf] [supp] [arXiv] [bibtex]
Disentangling Label Distribution for Long-Tailed Visual Recognition Youngkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, Buru Chang [pdf] [supp] [arXiv] [bibtex]
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling Mateusz Malinowski, Dimitrios Vytiniotis, Grzegorz Swirszcz, Viorica Patraucean, Joao Carreira [pdf] [supp] [bibtex]
Learning a Non-Blind Deblurring Network for Night Blurry Images Liang Chen, Jiawei Zhang, Jinshan Pan, Songnan Lin, Faming Fang, Jimmy S. Ren [pdf] [supp] [bibtex]
Differentiable Diffusion for Dense Depth Estimation From Multi-View Images Numair Khan, Min H. Kim, James Tompkin [pdf] [supp] [bibtex]
Deep Compositional Metric Learning Wenzhao Zheng, Chengkun Wang, Jiwen Lu, Jie Zhou [pdf] [bibtex]
Representing Videos As Discriminative Sub-Graphs for Action Recognition Dong Li, Zhaofan Qiu, Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei [pdf] [bibtex]
AIFit: Automatic 3D Human-Interpretable Feedback Models for Fitness Training Mihai Fieraru, Mihai Zanfir, Silviu Cristian Pirlea, Vlad Olaru, Cristian Sminchisescu [pdf] [supp] [bibtex]
Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes Jiashun Wang, Huazhe Xu, Jingwei Xu, Sifei Liu, Xiaolong Wang [pdf] [supp] [arXiv] [bibtex]
How Well Do Self-Supervised Models Transfer? Linus Ericsson, Henry Gouk, Timothy M. Hospedales [pdf] [supp] [arXiv] [bibtex]
Understanding Object Dynamics for Interactive Image-to-Video Synthesis Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Bjorn Ommer [pdf] [supp] [bibtex]
Pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis Eric R. Chan, Marco Monteiro, Petr Kellnhofer, Jiajun Wu, Gordon Wetzstein [pdf] [supp] [bibtex]
Diverse Branch Block: Building a Convolution as an Inception-Like Unit Xiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding [pdf] [arXiv] [bibtex]
Post-Hoc Uncertainty Calibration for Domain Drift Scenarios Christian Tomani, Sebastian Gruber, Muhammed Ebrar Erdem, Daniel Cremers, Florian Buettner [pdf] [supp] [arXiv] [bibtex]
Slimmable Compressive Autoencoders for Practical Neural Image Compression Fei Yang, Luis Herranz, Yongmei Cheng, Mikhail G. Mozerov [pdf] [supp] [arXiv] [bibtex]
Function4D: Real-Time Human Volumetric Capture From Very Sparse Consumer RGBD Sensors Tao Yu, Zerong Zheng, Kaiwen Guo, Pengpeng Liu, Qionghai Dai, Yebin Liu [pdf] [supp] [arXiv] [bibtex]
LAU-Net: Latitude Adaptive Upscaling Network for Omnidirectional Image Super-Resolution Xin Deng, Hao Wang, Mai Xu, Yichen Guo, Yuhang Song, Li Yang [pdf] [bibtex]
UP-DETR: Unsupervised Pre-Training for Object Detection With Transformers Zhigang Dai, Bolun Cai, Yugeng Lin, Junying Chen [pdf] [supp] [bibtex]
Self-Attention Based Text Knowledge Mining for Text Detection Qi Wan, Haoqin Ji, Linlin Shen [pdf] [supp] [bibtex]
Image De-Raining via Continual Learning Man Zhou, Jie Xiao, Yifan Chang, Xueyang Fu, Aiping Liu, Jinshan Pan, Zheng-Jun Zha [pdf] [bibtex]
Layer-Wise Searching for 1-Bit Detectors Sheng Xu, Junhe Zhao, Jinhu Lu, Baochang Zhang, Shumin Han, David Doermann [pdf] [bibtex]
Distilling Audio-Visual Knowledge by Compositional Contrastive Learning Yanbei Chen, Yongqin Xian, A. Sophia Koepke, Ying Shan, Zeynep Akata [pdf] [supp] [arXiv] [bibtex]
Unsupervised Visual Attention and Invariance for Reinforcement Learning Xudong Wang, Long Lian, Stella X. Yu [pdf] [supp] [arXiv] [bibtex]
CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement Noranart Vesdapunt, Baoyuan Wang [pdf] [supp] [arXiv] [bibtex]
Semantic Audio-Visual Navigation Changan Chen, Ziad Al-Halah, Kristen Grauman [pdf] [supp] [bibtex]
Humble Teachers Teach Better Students for Semi-Supervised Object Detection Yihe Tang, Weifeng Chen, Yijun Luo, Yuting Zhang [pdf] [supp] [bibtex]
One Shot Face Swapping on Megapixels Yuhao Zhu, Qi Li, Jian Wang, Cheng-Zhong Xu, Zhenan Sun [pdf] [supp] [arXiv] [bibtex]
CDFI: Compression-Driven Network Design for Frame Interpolation Tianyu Ding, Luming Liang, Zhihui Zhu, Ilya Zharkov [pdf] [arXiv] [bibtex]
PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds Mutian Xu, Runyu Ding, Hengshuang Zhao, Xiaojuan Qi [pdf] [supp] [arXiv] [bibtex]
End-to-End Object Detection With Fully Convolutional Network Jianfeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng [pdf] [supp] [arXiv] [bibtex]
Efficient Initial Pose-Graph Generation for Global SfM Daniel Barath, Dmytro Mishkin, Ivan Eichhardt, Ilia Shipachev, Jiri Matas [pdf] [supp] [arXiv] [bibtex]
Representative Batch Normalization With Feature Calibration Shang-Hua Gao, Qi Han, Duo Li, Ming-Ming Cheng, Pai Peng [pdf] [bibtex]
VarifocalNet: An IoU-Aware Dense Object Detector Haoyang Zhang, Ying Wang, Feras Dayoub, Niko Sunderhauf [pdf] [arXiv] [bibtex]
Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation Youngmin Oh, Beomjun Kim, Bumsub Ham [pdf] [supp] [arXiv] [bibtex]
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution Chi Zhang, Baoxiong Jia, Song-Chun Zhu, Yixin Zhu [pdf] [supp] [arXiv] [bibtex]
Reducing Domain Gap by Reducing Style Bias Hyeonseob Nam, HyunJae Lee, Jongchan Park, Wonjun Yoon, Donggeun Yoo [pdf] [arXiv] [bibtex]
Efficient Regional Memory Network for Video Object Segmentation Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun [pdf] [arXiv] [bibtex]
Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-Localization in Large Scenes From Body-Mounted Sensors Vladimir Guzov, Aymen Mir, Torsten Sattler, Gerard Pons-Moll [pdf] [supp] [arXiv] [bibtex]
Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection Chenchen Zhu, Fangyi Chen, Uzair Ahmed, Zhiqiang Shen, Marios Savvides [pdf] [supp] [arXiv] [bibtex]
Online Multiple Object Tracking With Cross-Task Synergy Song Guo, Jingya Wang, Xinchao Wang, Dacheng Tao [pdf] [arXiv] [bibtex]
Discovering Relationships Between Object Categories via Universal Canonical Maps Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotny, Andrea Vedaldi [pdf] [supp] [bibtex]
Prior Based Human Completion Zibo Zhao, Wen Liu, Yanyu Xu, Xianing Chen, Weixin Luo, Lei Jin, Bohui Zhu, Tong Liu, Binqiang Zhao, Shenghua Gao [pdf] [supp] [bibtex]
Neural Response Interpretation Through the Lens of Critical Pathways Ashkan Khakzar, Soroosh Baselizadeh, Saurabh Khanduja, Christian Rupprecht, Seong Tae Kim, Nassir Navab [pdf] [supp] [arXiv] [bibtex]
Rethinking and Improving the Robustness of Image Style Transfer Pei Wang, Yijun Li, Nuno Vasconcelos [pdf] [supp] [arXiv] [bibtex]
FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang [pdf] [supp] [arXiv] [bibtex]
Cross-Domain Similarity Learning for Face Recognition in Unseen Domains Masoud Faraki, Xiang Yu, Yi-Hsuan Tsai, Yumin Suh, Manmohan Chandraker [pdf] [arXiv] [bibtex]
Learning 3D Shape Feature for Texture-Insensitive Person Re-Identification Jiaxing Chen, Xinyang Jiang, Fudong Wang, Jun Zhang, Feng Zheng, Xing Sun, Wei-Shi Zheng [pdf] [supp] [bibtex]
Virtual Fully-Connected Layer: Training a Large-Scale Face Recognition Dataset With Limited Computational Resources Pengyu Li, Biao Wang, Lei Zhang [pdf] [supp] [bibtex]
Multi-Person Implicit Reconstruction From a Single Image Armin Mustafa, Akin Caliskan, Lourdes Agapito, Adrian Hilton [pdf] [arXiv] [bibtex]
OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection Tingting Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling [pdf] [arXiv] [bibtex]
Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering Jungin Park, Jiyoung Lee, Kwanghoon Sohn [pdf] [arXiv] [bibtex]
Learning Compositional Radiance Fields of Dynamic Human Heads Ziyan Wang, Timur Bagautdinov, Stephen Lombardi, Tomas Simon, Jason Saragih, Jessica Hodgins, Michael Zollhofer [pdf] [supp] [arXiv] [bibtex]
Partial Person Re-Identification With Part-Part Correspondence Learning Tianyu He, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua [pdf] [supp] [bibtex]
Monte Carlo Scene Search for 3D Scene Understanding Shreyas Hampali, Sinisa Stekovic, Sayan Deb Sarkar, Chetan S. Kumar, Friedrich Fraundorfer, Vincent Lepetit [pdf] [supp] [arXiv] [bibtex]
Coarse-To-Fine Person Re-Identification With Auxiliary-Domain Classification and Second-Order Information Bottleneck Anguo Zhang, Yueming Gao, Yuzhen Niu, Wenxi Liu, Yongcheng Zhou [pdf] [supp] [bibtex]
Transformer Tracking Xin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu [pdf] [arXiv] [bibtex]
Structured Multi-Level Interaction Network for Video Moment Localization via Language Query Hao Wang, Zheng-Jun Zha, Liang Li, Dong Liu, Jiebo Luo [pdf] [bibtex]
Structured Scene Memory for Vision-Language Navigation Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen [pdf] [arXiv] [bibtex]
Unsupervised Pre-Training for Person Re-Identification Dengpan Fu, Dongdong Chen, Jianmin Bao, Hao Yang, Lu Yuan, Lei Zhang, Houqiang Li, Dong Chen [pdf] [supp] [arXiv] [bibtex]
Progressive Stage-Wise Learning for Unsupervised Feature Representation Enhancement Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao [pdf] [bibtex]
Domain-Specific Suppression for Adaptive Object Detection Yu Wang, Rui Zhang, Shuo Zhang, Miao Li, Yangyang Xia, Xishan Zhang, Shaoli Liu [pdf] [arXiv] [bibtex]
Few-Shot Object Detection via Classification Refinement and Distractor Retreatment Yiting Li, Haiyue Zhu, Yu Cheng, Wenxin Wang, Chek Sing Teo, Cheng Xiang, Prahlad Vadakkepat, Tong Heng Lee [pdf] [supp] [bibtex]
D2IM-Net: Learning Detail Disentangled Implicit Fields From Single Images Manyi Li, Hao Zhang [pdf] [bibtex]
Not Just Compete, but Collaborate: Local Image-to-Image Translation via Cooperative Mask Prediction Daejin Kim, Mohammad Azam Khan, Jaegul Choo [pdf] [bibtex]
Behavior-Driven Synthesis of Human Dynamics Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Bjorn Ommer [pdf] [supp] [arXiv] [bibtex]
GAIA: A Transfer Learning System of Object Detection That Fits Your Needs Xingyuan Bu, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang [pdf] [supp] [bibtex]
IronMask: Modular Architecture for Protecting Deep Face Template Sunpill Kim, Yunseong Jeong, Jinsu Kim, Jungkon Kim, Hyung Tae Lee, Jae Hong Seo [pdf] [supp] [arXiv] [bibtex]
Learning To Recommend Frame for Interactive Video Object Segmentation in the Wild Zhaoyuan Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao [pdf] [supp] [arXiv] [bibtex]
DSRNA: Differentiable Search of Robust Neural Architectures Ramtin Hosseini, Xingyi Yang, Pengtao Xie [pdf] [arXiv] [bibtex]
Reconstructing 3D Human Pose by Watching Humans in the Mirror Qi Fang, Qing Shuai, Junting Dong, Hujun Bao, Xiaowei Zhou [pdf] [arXiv] [bibtex]
Spk2ImgNet: Learning To Reconstruct Dynamic Scene From Continuous Spike Stream Jing Zhao, Ruiqin Xiong, Hangfan Liu, Jian Zhang, Tiejun Huang [pdf] [supp] [bibtex]
MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation Hansheng Chen, Yuyao Huang, Wei Tian, Zhong Gao, Lu Xiong [pdf] [supp] [arXiv] [bibtex]
Complete & Label: A Domain Adaptation Approach to Semantic Segmentation of LiDAR Point Clouds Li Yi, Boqing Gong, Thomas Funkhouser [pdf] [supp] [arXiv] [bibtex]
GMOT-40: A Benchmark for Generic Multiple Object Tracking Hexin Bai, Wensheng Cheng, Peng Chu, Juehuan Liu, Kai Zhang, Haibin Ling [pdf] [supp] [bibtex]
Few-Shot Image Generation via Cross-Domain Correspondence Utkarsh Ojha, Yijun Li, Jingwan Lu, Alexei A. Efros, Yong Jae Lee, Eli Shechtman, Richard Zhang [pdf] [supp] [arXiv] [bibtex]
Hierarchical Lovasz Embeddings for Proposal-Free Panoptic Segmentation Tommi Kerola, Jie Li, Atsushi Kanehira, Yasunori Kudo, Alexis Vallet, Adrien Gaidon [pdf] [supp] [bibtex]
Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans Sida Peng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Hujun Bao, Xiaowei Zhou [pdf] [arXiv] [bibtex]
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting Lingbo Liu, Jiaqi Chen, Hefeng Wu, Guanbin Li, Chenglong Li, Liang Lin [pdf] [supp] [arXiv] [bibtex]
Weakly Supervised Video Salient Object Detection Wangbo Zhao, Jing Zhang, Long Li, Nick Barnes, Nian Liu, Junwei Han [pdf] [supp] [arXiv] [bibtex]
Pixel-Wise Anomaly Detection in Complex Driving Scenes Giancarlo Di Biase, Hermann Blum, Roland Siegwart, Cesar Cadena [pdf] [supp] [arXiv] [bibtex]
Learning To Associate Every Segment for Video Panoptic Segmentation Sanghyun Woo, Dahun Kim, Joon-Young Lee, In So Kweon [pdf] [bibtex]
Variational Transformer Networks for Layout Generation Diego Martin Arroyo, Janis Postels, Federico Tombari [pdf] [supp] [arXiv] [bibtex]
Mitigating Face Recognition Bias via Group Adaptive Classifier Sixue Gong, Xiaoming Liu, Anil K. Jain [pdf] [supp] [arXiv] [bibtex]
A Peek Into the Reasoning of Neural Networks: Interpreting With Structural Visual Concepts Yunhao Ge, Yao Xiao, Zhi Xu, Meng Zheng, Srikrishna Karanam, Terrence Chen, Laurent Itti, Ziyan Wu [pdf] [supp] [arXiv] [bibtex]
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations Zhihui Li, Lina Yao [pdf] [bibtex]
A Dual Iterative Refinement Method for Non-Rigid Shape Matching Rui Xiang, Rongjie Lai, Hongkai Zhao [pdf] [supp] [arXiv] [bibtex]
Image Super-Resolution With Non-Local Sparse Attention Yiqun Mei, Yuchen Fan, Yuqian Zhou [pdf] [supp] [bibtex]
3D Video Stabilization With Depth Estimation by CNN-Based Optimization Yao-Chih Lee, Kuan-Wei Tseng, Yu-Ta Chen, Chien-Cheng Chen, Chu-Song Chen, Yi-Ping Hung [pdf] [supp] [bibtex]
Predicting Human Scanpaths in Visual Question Answering Xianyu Chen, Ming Jiang, Qi Zhao [pdf] [supp] [bibtex]
DetectoRS: Detecting Objects With Recursive Feature Pyramid and Switchable Atrous Convolution Siyuan Qiao, Liang-Chieh Chen, Alan Yuille [pdf] [arXiv] [bibtex]
SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks Shunsuke Saito, Jinlong Yang, Qianli Ma, Michael J. Black [pdf] [supp] [arXiv] [bibtex]
Improving Accuracy of Binary Neural Networks Using Unbalanced Activation Distribution Hyungjun Kim, Jihoon Park, Changhun Lee, Jae-Joon Kim [pdf] [supp] [arXiv] [bibtex]
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation Xinge Zhu, Hui Zhou, Tai Wang, Fangzhou Hong, Yuexin Ma, Wei Li, Hongsheng Li, Dahua Lin [pdf] [arXiv] [bibtex]
SMPLicit: Topology-Aware Generative Model for Clothed People Enric Corona, Albert Pumarola, Guillem Alenya, Gerard Pons-Moll, Francesc Moreno-Noguer [pdf] [supp] [arXiv] [bibtex]
Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization Long Zhao, Yuxiao Wang, Jiaping Zhao, Liangzhe Yuan, Jennifer J. Sun, Florian Schroff, Hartwig Adam, Xi Peng, Dimitris Metaxas, Ting Liu [pdf] [supp] [arXiv] [bibtex]
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation Yazhou Yao, Tao Chen, Guo-Sen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang [pdf] [arXiv] [bibtex]
DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation Xing Shen, Jirui Yang, Chunbo Wei, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Xiaoliang Cheng, Kewei Liang [pdf] [bibtex]
Bridging the Visual Gap: Wide-Range Image Blending Chia-Ni Lu, Ya-Chu Chang, Wei-Chen Chiu [pdf] [supp] [arXiv] [bibtex]