会员服务 ·

ICCV 2017 accepted papers

专知出品

ICCV 2017 papers on the web

原文链接：http://openaccess.thecvf.com/ICCV2017.py

Papers

Globally-Optimal Inlier Set Maximisation for Simultaneous Camera Pose and Feature Correspondence: Dylan Campbell, Lars Petersson, Laurent Kneip, Hongdong Li; [ pdf] [ supp]
Robust Pseudo Random Fields for Light-Field Stereo Matching: Chao-Tsung Huang; [ pdf]
A Lightweight Approach for On-The-Fly Reflectance Estimation: Kihwan Kim, Jinwei Gu, Stephen Tyree, Pavlo Molchanov, Matthias Niessner, Jan Kautz; [ pdf] [ supp]
Distributed Very Large Scale Bundle Adjustment by Global Camera Consensus: Runze Zhang, Siyu Zhu, Tian Fang, Long Quan; [ pdf] [ supp]
Practical Projective Structure From Motion (P2SfM): Ludovic Magerand, Alessio Del Bue; [ pdf] [ supp]
Anticipating Daily Intention Using On-Wrist Motion Triggered Sensing: Tz-Ying Wu, Ting-An Chien, Cheng-Sheng Chan, Chan-Wei Hu, Min Sun; [ pdf] [ supp]
Rethinking Reprojection: Closing the Loop for Pose-Aware Shape Reconstruction From a Single Image: Rui Zhu, Hamed Kiani Galoogahi, Chaoyang Wang, Simon Lucey; [ pdf] [ supp]
End-To-End Learning of Geometry and Context for Deep Stereo Regression: Alex Kendall, Hayk Martirosyan, Saumitro Dasgupta, Peter Henry, Ryan Kennedy, Abraham Bachrach, Adam Bry; [ pdf]
Using Sparse Elimination for Solving Minimal Problems in Computer Vision: Janne Heikkila; [ pdf]
High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference: Xiaoguang Han, Zhen Li, Haibin Huang, Evangelos Kalogerakis, Yizhou Yu; [ pdf] [ supp] [ arXiv]
Temporal Tessellation: A Unified Approach for Video Analysis: Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf; [ pdf] [ arXiv]
Learning Policies for Adaptive Tracking With Deep Feature Cascades: Chen Huang, Simon Lucey, Deva Ramanan; [ pdf] [ supp] [ arXiv]
Temporal Shape Super-Resolution by Intra-Frame Motion Encoding Using High-Fps Structured Light: Yuki Shiba, Satoshi Ono, Ryo Furukawa, Shinsaku Hiura, Hiroshi Kawasaki; [ pdf] [ supp]
Real-Time Monocular Pose Estimation of 3D Objects Using Temporally Consistent Local Color Histograms: Henning Tjaden, Ulrich Schwanecke, Elmar Schomer; [ pdf] [ supp]
CAD Priors for Accurate and Flexible Instance Reconstruction: Tolga Birdal, Slobodan Ilic; [ pdf] [ arXiv]
Colored Point Cloud Registration Revisited: Jaesik Park, Qian-Yi Zhou, Vladlen Koltun; [ pdf]
Learning Compact Geometric Features: Marc Khoury, Qian-Yi Zhou, Vladlen Koltun; [ pdf] [ arXiv]
Joint Layout Estimation and Global Multi-View Registration for Indoor Reconstruction: Jeong-Kyun Lee, Jaewon Yea, Min-Gyu Park, Kuk-Jin Yoon; [ pdf] [ supp] [ arXiv]
A Geometric Framework for Statistical Analysis of Trajectories With Distinct Temporal Spans: Rudrasis Chakraborty, Vikas Singh, Nagesh Adluru, Baba C. Vemuri; [ pdf]
An Optimal Transportation Based Univariate Neuroimaging Index: Liang Mi, Wen Zhang, Junwei Zhang, Yonghui Fan, Dhruman Goradia, Kewei Chen, Eric M. Reiman, Xianfeng Gu, Yalin Wang; [ pdf]
S3FD: Single Shot Scale-Invariant Face Detector: Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li; [ pdf] [ supp]
Amulet: Aggregating Multi-Level Convolutional Features for Salient Object Detection: Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Xiang Ruan; [ pdf] [ supp] [ arXiv]
Learning Uncertain Convolutional Features for Accurate Saliency Detection: Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Baocai Yin; [ pdf] [ supp] [ arXiv]
Zero-Order Reverse Filtering: Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia; [ pdf] [ arXiv]
Learning Blind Motion Deblurring: Patrick Wieschollek, Michael Hirsch, Bernhard Scholkopf, Hendrik P. A. Lensch; [ pdf] [ supp] [ arXiv]
Joint Adaptive Sparsity and Low-Rankness on the Fly: An Online Tensor Reconstruction Scheme for Video Denoising: Bihan Wen, Yanjun Li, Luke Pfister, Yoram Bresler; [ pdf] [ supp]
Learning to Super-Resolve Blurry Face and Text Images: Xiangyu Xu, Deqing Sun, Jinshan Pan, Yujin Zhang, Hanspeter Pfister, Ming-Hsuan Yang; [ pdf]
Video Frame Interpolation via Adaptive Separable Convolution: Simon Niklaus, Long Mai, Feng Liu; [ pdf] [ arXiv]
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection: Pierre Baque, Francois Fleuret, Pascal Fua; [ pdf] [ supp] [ arXiv]
Encouraging LSTMs to Anticipate Actions Very Early: Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Basura Fernando, Lars Petersson, Lars Andersson; [ pdf] [ supp] [ arXiv]
PathTrack: Fast Trajectory Annotation With Path Supervision: Santiago Manen, Michael Gygli, Dengxin Dai, Luc Van Gool; [ pdf] [ supp] [ arXiv]
Tracking the Untrackable: Learning to Track Multiple Cues With Long-Term Dependencies: Amir Sadeghian, Alexandre Alahi, Silvio Savarese; [ pdf] [ arXiv]
MirrorFlow: Exploiting Symmetries in Joint Optical Flow and Occlusion Estimation: Junhwa Hur, Stefan Roth; [ pdf] [ supp] [ arXiv]
Tracking as Online Decision-Making: Learning a Policy From Streaming Videos With Reinforcement Learning: James Supancic,III, Deva Ramanan; [ pdf] [ supp]
Non-Convex Rank/Sparsity Regularization and Local Minima: Carl Olsson, Marcus Carlsson, Fredrik Andersson, Viktor Larsson; [ pdf] [ supp] [ arXiv]
A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework: Weixin Luo, Wen Liu, Shenghua Gao; [ pdf]
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis: Xihui Liu, Haiyu Zhao, Maoqing Tian, Lu Sheng, Jing Shao, Shuai Yi, Junjie Yan, Xiaogang Wang; [ pdf]
No Fuss Distance Metric Learning Using Proxies: Yair Movshovitz-Attias, Alexander Toshev, Thomas K. Leung, Sergey Ioffe, Saurabh Singh; [ pdf] [ arXiv]
Benchmarking and Error Diagnosis in Multi-Instance Pose Estimation: Matteo Ruggero Ronchi, Pietro Perona; [ pdf] [ supp] [ arXiv]
Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-Identification: Zhongdao Wang, Luming Tang, Xihui Liu, Zhuliang Yao, Shuai Yi, Jing Shao, Junjie Yan, Shengjin Wang, Hongsheng Li, Xiaogang Wang; [ pdf]
Fashion Forward: Forecasting Visual Style in Fashion: Ziad Al-Halah, Rainer Stiefelhagen, Kristen Grauman; [ pdf] [ supp] [ arXiv]
Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach: Xingyi Zhou, Qixing Huang, Xiao Sun, Xiangyang Xue, Yichen Wei; [ pdf] [ supp] [ arXiv]
Flow-Guided Feature Aggregation for Video Object Detection: Xizhou Zhu, Yujie Wang, Jifeng Dai, Lu Yuan, Yichen Wei; [ pdf] [ arXiv]
Reasoning About Fine-Grained Attribute Phrases Using Reference Games: Jong-Chyi Su, Chenyun Wu, Huaizu Jiang, Subhransu Maji; [ pdf] [ supp] [ arXiv]
DeNet: Scalable Real-Time Object Detection With Directed Sparse Sampling: Lachlan Tychsen-Smith, Lars Petersson; [ pdf] [ supp] [ arXiv]
MIHash: Online Hashing With Mutual Information: Fatih Cakir, Kun He, Sarah Adel Bargal, Stan Sclaroff; [ pdf] [ supp] [ arXiv]
SafetyNet: Detecting and Rejecting Adversarial Examples Robustly: Jiajun Lu, Theerasit Issaranon, David Forsyth; [ pdf] [ supp] [ arXiv]
Recurrent Models for Situation Recognition: Arun Mallya, Svetlana Lazebnik; [ pdf] [ arXiv]
Multi-Label Image Recognition by Recurrently Discovering Attentional Regions: Zhouxia Wang, Tianshui Chen, Guanbin Li, Ruijia Xu, Liang Lin; [ pdf]
Deep Determinantal Point Process for Large-Scale Multi-Label Classification: Pengtao Xie, Ruslan Salakhutdinov, Luntian Mou, Eric P. Xing; [ pdf]
Visual Semantic Planning Using Deep Successor Representations: Yuke Zhu, Daniel Gordon, Eric Kolve, Dieter Fox, Li Fei-Fei, Abhinav Gupta, Roozbeh Mottaghi, Ali Farhadi; [ pdf] [ arXiv]
Neural Person Search Machines: Hao Liu, Jiashi Feng, Zequn Jie, Karlekar Jayashree, Bo Zhao, Meibin Qi, Jianguo Jiang, Shuicheng Yan; [ pdf] [ arXiv]
DualNet: Learn Complementary Features for Image Recognition: Saihui Hou, Xu Liu, Zilei Wang; [ pdf] [ supp]
Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization: Sijia Cai, Wangmeng Zuo, Lei Zhang; [ pdf]
Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner: Tseng-Hung Chen, Yuan-Hong Liao, Ching-Yao Chuang, Wan-Ting Hsu, Jianlong Fu, Min Sun; [ pdf] [ supp] [ arXiv]
Attribute Recognition by Joint Recurrent Learning of Context and Correlation: Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li; [ pdf] [ arXiv]
VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization: Saihui Hou, Yushan Feng, Zilei Wang; [ pdf] [ supp]
Increasing CNN Robustness to Occlusions by Reducing Filter Support: Elad Osherov, Michael Lindenbaum; [ pdf]
Exploiting Multi-Grain Ranking Constraints for Precisely Searching Visually-Similar Vehicles: Ke Yan, Yonghong Tian, Yaowei Wang, Wei Zeng, Tiejun Huang; [ pdf]
Recurrent Scale Approximation for Object Detection in CNN: Yu Liu, Hongyang Li, Junjie Yan, Fangyin Wei, Xiaogang Wang, Xiaoou Tang; [ pdf] [ arXiv]
Embedding 3D Geometric Features for Rigid Object Part Segmentation: Yafei Song, Xiaowu Chen, Jia Li, Qinping Zhao; [ pdf]
Towards Context-Aware Interaction Recognition for Visual Relationship Detection: Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid; [ pdf]
When Unsupervised Domain Adaptation Meets Tensor Representations: Hao Lu, Lei Zhang, Zhiguo Cao, Wei Wei, Ke Xian, Chunhua Shen, Anton van den Hengel; [ pdf] [ supp] [ arXiv]
Look, Listen and Learn: Relja Arandjelovic, Andrew Zisserman; [ pdf] [ arXiv]
Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization: Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra; [ pdf] [ supp]
Image-Based Localization Using LSTMs for Structured Feature Correlation: Florian Walch, Caner Hazirbas, Laura Leal-Taixe, Torsten Sattler, Sebastian Hilsenbeck, Daniel Cremers; [ pdf] [ supp] [ arXiv]
Personalized Image Aesthetics: Jian Ren, Xiaohui Shen, Zhe Lin, Radomir Mech, David J. Foran; [ pdf] [ supp]
Predicting Deeper Into the Future of Semantic Segmentation: Pauline Luc, Natalia Neverova, Camille Couprie, Jakob Verbeek, Yann LeCun; [ pdf] [ supp] [ arXiv]
Coordinating Filters for Faster Deep Neural Networks: Wei Wen, Cong Xu, Chunpeng Wu, Yandan Wang, Yiran Chen, Hai Li; [ pdf] [ arXiv]
Unsupervised Representation Learning by Sorting Sequences: Hsin-Ying Lee, Jia-Bin Huang, Maneesh Singh, Ming-Hsuan Yang; [ pdf] [ arXiv]
A Read-Write Memory Network for Movie Story Understanding: Seil Na, Sangho Lee, Jisung Kim, Gunhee Kim; [ pdf] [ supp]
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow: Jingchun Cheng, Yi-Hsuan Tsai, Shengjin Wang, Ming-Hsuan Yang; [ pdf] [ supp] [ arXiv]
Unsupervised Action Discovery and Localization in Videos: Khurram Soomro, Mubarak Shah; [ pdf]
Dense-Captioning Events in Videos: Ranjay Krishna, Kenji Hata, Frederic Ren, Li Fei-Fei, Juan Carlos Niebles; [ pdf] [ supp] [ arXiv]
Learning Long-Term Dependencies for Action Recognition With a Biologically-Inspired Deep Network: Yemin Shi, Yonghong Tian, Yaowei Wang, Wei Zeng, Tiejun Huang; [ pdf] [ arXiv]
Compressive Quantization for Fast Object Instance Search in Videos: Tan Yu, Zhenzhen Wang, Junsong Yuan; [ pdf]
Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos: Hehe Fan, Xiaojun Chang, De Cheng, Yi Yang, Dong Xu, Alexander G. Hauptmann; [ pdf]
Deep Direct Regression for Multi-Oriented Scene Text Detection: Wenhao He, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu; [ pdf] [ arXiv]
Open Set Domain Adaptation: Pau Panareda Busto, Juergen Gall; [ pdf] [ supp]
Deformable Convolutional Networks: Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei; [ pdf] [ arXiv]
Ensemble Diffusion for Retrieval: Song Bai, Zhichao Zhou, Jingdong Wang, Xiang Bai, Longin Jan Latecki, Qi Tian; [ pdf] [ supp]
FoveaNet: Perspective-Aware Urban Scene Parsing: Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng; [ pdf] [ supp] [ arXiv]
Beyond Planar Symmetry: Modeling Human Perception of Reflection and Rotation Symmetries in the Wild: Christopher Funk, Yanxi Liu; [ pdf] [ arXiv]
Learning to Reason: End-To-End Module Networks for Visual Question Answering: Ronghang Hu, Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Kate Saenko; [ pdf] [ supp]
Hard-Aware Deeply Cascaded Embedding: Yuhui Yuan, Kuiyuan Yang, Chao Zhang; [ pdf] [ arXiv]
Query-Guided Regression Network With Context Policy for Phrase Grounding: Kan Chen, Rama Kovvuri, Ram Nevatia; [ pdf] [ arXiv]
SUBIC: A Supervised, Structured Binary Code for Image Search: Himalaya Jain, Joaquin Zepeda, Patrick Perez, Remi Gribonval; [ pdf] [ arXiv]
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era: Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta; [ pdf] [ supp] [ arXiv]
A Generative Model of People in Clothing: Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler; [ pdf] [ supp] [ arXiv]
Escape From Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models: Roman Klokov, Victor Lempitsky; [ pdf] [ arXiv]
Improved Image Captioning via Policy Gradient Optimization of SPIDEr: Siqi Liu, Zhenhai Zhu, Ning Ye, Sergio Guadarrama, Kevin Murphy; [ pdf] [ arXiv]
Rolling Shutter Correction in Manhattan World: Pulak Purkait, Christopher Zach, Ales Leonardis; [ pdf] [ supp]
Local-To-Global Point Cloud Registration Using a Dictionary of Viewpoint Descriptors: David Avidar, David Malah, Meir Barzohar; [ pdf]
3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks: Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem; [ pdf]
BodyFusion: Real-Time Capture of Human Motion and Surface Geometry Using a Single Depth Camera: Tao Yu, Kaiwen Guo, Feng Xu, Yuan Dong, Zhaoqi Su, Jianhui Zhao, Jianguo Li, Qionghai Dai, Yebin Liu; [ pdf]
Quasiconvex Plane Sweep for Triangulation With Outliers: Qianggong Zhang, Tat-Jun Chin, David Suter; [ pdf] [ supp]
"Maximizing Rigidity" Revisited: A Convex Programming Approach for Generic 3D Shape Reconstruction From Multiple Perspective Views: Pan Ji, Hongdong Li, Yuchao Dai, Ian Reid; [ pdf] [ arXiv]
Surface Registration via Foliation: Xiaopeng Zheng, Chengfeng Wen, Na Lei, Ming Ma, Xianfeng Gu; [ pdf]
Rolling-Shutter-Aware Differential SfM and Image Rectification: Bingbing Zhuang, Loong-Fah Cheong, Gim Hee Lee; [ pdf] [ supp]
Corner-Based Geometric Calibration of Multi-Focus Plenoptic Cameras: Sotiris Nousias, Francois Chadebecq, Jonas Pichat, Pearse Keane, Sebastien Ourselin, Christos Bergeles; [ pdf]
Focal Track: Depth and Accommodation With Oscillating Lens Deformation: Qi Guo, Emma Alexander, Todd Zickler; [ pdf]
Reconfiguring the Imaging Pipeline for Computer Vision: Mark Buckler, Suren Jayasuriya, Adrian Sampson; [ pdf] [ supp] [ arXiv]
Catadioptric HyperSpectral Light Field Imaging: Yujia Xue, Kang Zhu, Qiang Fu, Xilin Chen, Jingyi Yu; [ pdf] [ supp]
Cross-View Asymmetric Metric Learning for Unsupervised Person Re-Identification: Hong-Xing Yu, Ancong Wu, Wei-Shi Zheng; [ pdf] [ supp] [ arXiv]
Real Time Eye Gaze Tracking With 3D Deformable Eye-Face Model: Kang Wang, Qiang Ji; [ pdf]
Ensemble Deep Learning for Skeleton-Based Action Recognition Using Temporal Sliding LSTM Networks: Inwoong Lee, Doyoung Kim, Seoungyoon Kang, Sanghoon Lee; [ pdf]
How Far Are We From Solving the 2D & 3D Face Alignment Problem? (And a Dataset of 230,000 3D Facial Landmarks): Adrian Bulat, Georgios Tzimiropoulos; [ pdf] [ supp]
Large Pose 3D Face Reconstruction From a Single Image via Direct Volumetric CNN Regression: Aaron S. Jackson, Adrian Bulat, Vasileios Argyriou, Georgios Tzimiropoulos; [ pdf] [ supp] [ arXiv]
RankIQA: Learning From Rankings for No-Reference Image Quality Assessment: Xialei Liu, Joost van de Weijer, Andrew D. Bagdanov; [ pdf] [ supp] [ arXiv]
Look, Perceive and Segment: Finding the Salient Objects in Images via Two-Stream Fixation-Semantic CNNs: Xiaowu Chen, Anlin Zheng, Jia Li, Feng Lu; [ pdf]
Delving Into Salient Object Subitizing and Detection: Shengfeng He, Jianbo Jiao, Xiaodan Zhang, Guoqiang Han, Rynson W.H. Lau; [ pdf]
Learning Discriminative Data Fitting Functions for Blind Image Deblurring: Jinshan Pan, Jiangxin Dong, Yu-Wing Tai, Zhixun Su, Ming-Hsuan Yang; [ pdf] [ supp]
Video Deblurring via Semantic Segmentation and Pixel-Wise Non-Linear Kernel: Wenqi Ren, Jinshan Pan, Xiaochun Cao, Ming-Hsuan Yang; [ pdf] [ arXiv]
On-Demand Learning for Deep Image Restoration: Ruohan Gao, Kristen Grauman; [ pdf]
Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising: Jun Xu, Lei Zhang, David Zhang, Xiangchu Feng; [ pdf] [ supp] [ arXiv]
Coherent Online Video Style Transfer: Dongdong Chen, Jing Liao, Lu Yuan, Nenghai Yu, Gang Hua; [ pdf] [ arXiv]
SHaPE: A Novel Graph Theoretic Algorithm for Making Consensus-Based Decisions in Person Re-Identification Systems: Arko Barman, Shishir K. Shah; [ pdf] [ supp]
Need for Speed: A Benchmark for Higher Frame Rate Object Tracking: Hamed Kiani Galoogahi, Ashton Fagg, Chen Huang, Deva Ramanan, Simon Lucey; [ pdf] [ supp] [ arXiv]
Learning Background-Aware Correlation Filters for Visual Tracking: Hamed Kiani Galoogahi, Ashton Fagg, Simon Lucey; [ pdf] [ supp] [ arXiv]
Robust Object Tracking Based on Temporal and Spatial Deep Networks: Zhu Teng, Junliang Xing, Qiang Wang, Congyan Lang, Songhe Feng, Yi Jin; [ pdf]
Real-Time Hand Tracking Under Occlusion From an Egocentric RGB-D Sensor: Franziska Mueller, Dushyant Mehta, Oleksandr Sotnychenko, Srinath Sridhar, Dan Casas, Christian Theobalt; [ pdf] [ supp]
Predicting Human Activities Using Stochastic Grammar: Siyuan Qi, Siyuan Huang, Ping Wei, Song-Chun Zhu; [ pdf] [ arXiv]
ProbFlow: Joint Optical Flow and Uncertainty Estimation: Anne S. Wannenwetsch, Margret Keuper, Stefan Roth; [ pdf] [ supp] [ arXiv]
Sublabel-Accurate Discretization of Nonconvex Free-Discontinuity Problems: Thomas Mollenhoff, Daniel Cremers; [ pdf] [ supp] [ arXiv]
DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding: Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, Jianxiong Xiao; [ pdf] [ supp]
BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography: Michael J. Wilber, Chen Fang, Hailin Jin, Aaron Hertzmann, John Collomosse, Serge Belongie; [ pdf] [ arXiv]
Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation: Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang; [ pdf] [ arXiv]
An Empirical Study of Language CNN for Image Captioning: Jiuxiang Gu, Gang Wang, Jianfei Cai, Tsuhan Chen; [ pdf] [ arXiv]
Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning: Berkan Demirel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis; [ pdf] [ arXiv]
Areas of Attention for Image Captioning: Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek; [ pdf] [ supp] [ arXiv]
Generative Modeling of Audible Shapes for Object Perception: Zhoutong Zhang, Jiajun Wu, Qiujia Li, Zhengjia Huang, James Traer, Josh H. McDermott, Joshua B. Tenenbaum, William T. Freeman; [ pdf]
Scene Graph Generation From Objects, Phrases and Region Captions: Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, Xiaogang Wang; [ pdf] [ arXiv]
Recurrent Multimodal Interaction for Referring Image Segmentation: Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Alan Yuille; [ pdf] [ supp] [ arXiv]
Learning Feature Pyramids for Human Pose Estimation: Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, Xiaogang Wang; [ pdf] [ supp] [ arXiv]
Structured Attentions for Visual Question Answering: Chen Zhu, Yanpeng Zhao, Shuaiyi Huang, Kewei Tu, Yi Ma; [ pdf] [ arXiv]
Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection: Debidatta Dwibedi, Ishan Misra, Martial Hebert; [ pdf] [ arXiv]
Cascaded Feature Network for Semantic Segmentation of RGB-D Images: Di Lin, Guangyong Chen, Daniel Cohen-Or, Pheng-Ann Heng, Hui Huang; [ pdf]
Encoder Based Lifelong Learning: Amal Rannen, Rahaf Aljundi, Matthew B. Blaschko, Tinne Tuytelaars; [ pdf] [ supp] [ arXiv]
Transitive Invariance for Self-Supervised Visual Representation Learning: Xiaolong Wang, Kaiming He, Abhinav Gupta; [ pdf] [ arXiv]
Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction: Stepan Tulyakov, Anton Ivanov, Francois Fleuret; [ pdf]
Fine-Grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach: Timnit Gebru, Judy Hoffman, Li Fei-Fei; [ pdf] [ arXiv]
SORT: Second-Order Response Transform for Visual Recognition: Yan Wang, Lingxi Xie, Chenxi Liu, Siyuan Qiao, Ya Zhang, Wenjun Zhang, Qi Tian, Alan Yuille; [ pdf] [ arXiv]
Adversarial Examples for Semantic Segmentation and Object Detection: Cihang Xie, Jianyu Wang, Zhishuai Zhang, Yuyin Zhou, Lingxi Xie, Alan Yuille; [ pdf] [ supp] [ arXiv]
Genetic CNN: Lingxi Xie, Alan Yuille; [ pdf] [ arXiv]
Channel Pruning for Accelerating Very Deep Neural Networks: Yihui He, Xiangyu Zhang, Jian Sun; [ pdf] [ arXiv]
Infinite Latent Feature Selection: A Probabilistic Latent Graph-Based Ranking Approach: Giorgio Roffo, Simone Melzi, Umberto Castellani, Alessandro Vinciarelli; [ pdf] [ arXiv]
Video Fill in the Blank Using LR/RL LSTMs With Spatial-Temporal Attentions: Amir Mazaheri, Dong Zhang, Mubarak Shah; [ pdf] [ arXiv]
Primary Video Object Segmentation via Complementary CNNs and Neighborhood Reversible Flow: Jia Li, Anlin Zheng, Xiaowu Chen, Bin Zhou; [ pdf]
Attentive Semantic Video Generation Using Captions: Tanya Marwah, Gaurav Mittal, Vineeth N. Balasubramanian; [ pdf] [ arXiv]
Following Gaze in Video: Adria Recasens, Carl Vondrick, Aditya Khosla, Antonio Torralba; [ pdf]
Adaptive RNN Tree for Large-Scale Human Action Recognition: Wenbo Li, Longyin Wen, Ming-Ching Chang, Ser Nam Lim, Siwei Lyu; [ pdf]
Spatio-Temporal Person Retrieval via Natural Language Queries: Masataka Yamaguchi, Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada; [ pdf] [ supp] [ arXiv]
Automatic Spatially-Aware Fashion Concept Discovery: Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis; [ pdf] [ arXiv]
ChromaTag: A Colored Marker and Fast Detection Algorithm: Joseph DeGol, Timothy Bretl, Derek Hoiem; [ pdf] [ supp] [ arXiv]
Adversarial Image Perturbation for Privacy Protection -- A Game Theory Perspective: Seong Joon Oh, Mario Fritz, Bernt Schiele; [ pdf] [ supp] [ arXiv]
WeText: Scene Text Detection Under Weak Supervision: Shangxuan Tian, Shijian Lu, Chongshou Li; [ pdf]
Arbitrary Style Transfer in Real-Time With Adaptive Instance Normalization: Xun Huang, Serge Belongie; [ pdf] [ supp] [ arXiv]
Photographic Image Synthesis With Cascaded Refinement Networks: Qifeng Chen, Vladlen Koltun; [ pdf] [ arXiv]
SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again: Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, Nassir Navab; [ pdf] [ supp]
Unsupervised Creation of Parameterized Avatars: Lior Wolf, Yaniv Taigman, Adam Polyak; [ pdf] [ supp] [ arXiv]
Learning for Active 3D Mapping: Karel Zimmermann, Tomas Petricek, Vojtech Salansky, Tomas Svoboda; [ pdf] [ supp] [ arXiv]
Toward Perceptually-Consistent Stereo: A Scanline Study: Jialiang Wang, Daniel Glasner, Todd Zickler; [ pdf]
Surface Normals in the Wild: Weifeng Chen, Donglai Xiang, Jia Deng; [ pdf] [ supp] [ arXiv]
Unsupervised Learning of Stereo Matching: Chao Zhou, Hong Zhang, Xiaoyong Shen, Jiaya Jia; [ pdf]
Unrestricted Facial Geometry Reconstruction Using Image-To-Image Translation: Matan Sela, Elad Richardson, Ron Kimmel; [ pdf] [ supp]
Learned Multi-Patch Similarity: Wilfried Hartmann, Silvano Galliani, Michal Havlena, Luc Van Gool, Konrad Schindler; [ pdf] [ supp] [ arXiv]
Click Here: Human-Localized Keypoints as Guidance for Viewpoint Estimation: Ryan Szeto, Jason J. Corso; [ pdf] [ supp] [ arXiv]
Unsupervised Adaptation for Deep Stereo: Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano; [ pdf] [ supp]
Composite Focus Measure for High Quality Depth Maps: Parikshit Sakurikar, P. J. Narayanan; [ pdf]
Reconstruction-Based Disentanglement for Pose-Invariant Face Recognition: Xi Peng, Xiang Yu, Kihyuk Sohn, Dimitris N. Metaxas, Manmohan Chandraker; [ pdf] [ supp] [ arXiv]
Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection: Shengtao Xiao, Jiashi Feng, Luoqi Liu, Xuecheng Nie, Wei Wang, Shuicheng Yan, Ashraf Kassim; [ pdf] [ supp]
Anchored Regression Networks Applied to Age Estimation and Super Resolution: Eirikur Agustsson, Radu Timofte, Luc Van Gool; [ pdf]
Infant Footprint Recognition: Eryun Liu; [ pdf]
Self-Paced Kernel Estimation for Robust Blind Image Deblurring: Dong Gong, Mingkui Tan, Yanning Zhang, Anton van den Hengel, Qinfeng Shi; [ pdf]
Super-Trajectory for Video Segmentation: Wenguan Wang, Jianbing Shen, Jianwen Xie, Fatih Porikli; [ pdf] [ arXiv]
Be Your Own Prada: Fashion Synthesis With Structural Coherence: Shizhan Zhu, Raquel Urtasun, Sanja Fidler, Dahua Lin, Chen Change Loy; [ pdf]
Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution: Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan; [ pdf]
Learning Gaze Transitions From Depth to Improve Video Saliency Estimation: George Leifman, Dmitry Rudoy, Tristan Swedish, Eduardo Bayro-Corrochano, Ramesh Raskar; [ pdf] [ supp] [ arXiv]
Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation: Shuhang Gu, Deyu Meng, Wangmeng Zuo, Lei Zhang; [ pdf] [ supp]
Modelling the Scene Dependent Imaging in Cameras With a Deep Neural Network: Seonghyeon Nam, Seon Joo Kim; [ pdf] [ supp] [ arXiv]
Transformed Low-Rank Model for Line Pattern Noise Removal: Yi Chang, Luxin Yan, Sheng Zhong; [ pdf]
Weakly Supervised Manifold Learning for Dense Semantic Object Correspondence: Utkarsh Gaur, B. S. Manjunath; [ pdf]
Dual Motion GAN for Future-Flow Embedded Video Prediction: Xiaodan Liang, Lisa Lee, Wei Dai, Eric P. Xing; [ pdf] [ arXiv]
Online Robust Image Alignment via Subspace Learning From Gradient Orientations: Qingqing Zheng, Yi Wang, Pheng-Ann Heng; [ pdf] [ supp]
Learning Dynamic Siamese Network for Visual Object Tracking: Qing Guo, Wei Feng, Ce Zhou, Rui Huang, Liang Wan, Song Wang; [ pdf]
High Order Tensor Formulation for Convolutional Sparse Coding: Adel Bibi, Bernard Ghanem; [ pdf] [ supp]
Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems: Tim Meinhardt, Michael Moller, Caner Hazirbas, Daniel Cremers; [ pdf] [ supp] [ arXiv]
ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond: Siyuan Qiao, Wei Shen, Weichao Qiu, Chenxi Liu, Alan Yuille; [ pdf] [ arXiv]
Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection: Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, Abhinav Gupta; [ pdf] [ supp] [ arXiv]
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation: Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong; [ pdf] [ arXiv]
Multi-Modal Factorized Bilinear Pooling With Co-Attention Learning for Visual Question Answering: Zhou Yu, Jun Yu, Jianping Fan, Dacheng Tao; [ pdf] [ arXiv]
SCNet: Learning Semantic Correspondence: Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho, Cordelia Schmid, Jean Ponce; [ pdf] [ arXiv]
Soft Proposal Networks for Weakly Supervised Object Localization: Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao; [ pdf] [ arXiv]
Class Rectification Hard Mining for Imbalanced Deep Learning: Qi Dong, Shaogang Gong, Xiatian Zhu; [ pdf]
Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs: Vishwanath A. Sindagi, Vishal M. Patel; [ pdf] [ arXiv]
See the Glass Half Full: Reasoning About Liquid Containers, Their Volume and Content: Roozbeh Mottaghi, Connor Schenck, Dieter Fox, Ali Farhadi; [ pdf] [ arXiv]
Hierarchical Multimodal LSTM for Dense Visual-Semantic Embedding: Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, Gang Hua; [ pdf]
Identity-Aware Textual-Visual Matching With Latent Co-Attention: Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, Xiaogang Wang; [ pdf] [ arXiv]
Learning Deep Neural Networks for Vehicle Re-ID With Visual-Spatio-Temporal Path Proposals: Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang; [ pdf]
Learning From Noisy Labels With Distillation: Yuncheng Li, Jianchao Yang, Yale Song, Liangliang Cao, Jiebo Luo, Li-Jia Li; [ pdf] [ arXiv]
DSOD: Learning Deeply Supervised Object Detectors From Scratch: Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang, Yurong Chen, Xiangyang Xue; [ pdf] [ arXiv]
Phrase Localization and Visual Relationship Detection With Comprehensive Image-Language Cues: Bryan A. Plummer, Arun Mallya, Christopher M. Cervantes, Julia Hockenmaier, Svetlana Lazebnik; [ pdf] [ supp] [ arXiv]
Chained Cascade Network for Object Detection: Wanli Ouyang, Kun Wang, Xin Zhu, Xiaogang Wang; [ pdf]
VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition: Seokju Lee, Junsik Kim, Jae Shin Yoon, Seunghak Shin, Oleksandr Bailo, Namil Kim, Tae-Hee Lee, Hyun Seok Hong, Seung-Hoon Han, In So Kweon; [ pdf] [ supp]
Unsupervised Learning of Important Objects From First-Person Videos: Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi; [ pdf] [ arXiv]
An Analysis of Visual Question Answering Algorithms: Kushal Kafle, Christopher Kanan; [ pdf] [ arXiv]
Visual Relationship Detection With Internal and External Linguistic Knowledge Distillation: Ruichi Yu, Ang Li, Vlad I. Morariu, Larry S. Davis; [ pdf] [ supp] [ arXiv]
A Two Stream Siamese Convolutional Neural Network for Person Re-Identification: Dahjung Chung, Khalid Tahboub, Edward J. Delp; [ pdf]
No More Discrimination: Cross City Adaptation of Road Scene Segmenters: Yi-Hsin Chen, Wei-Yu Chen, Yu-Ting Chen, Bo-Cheng Tsai, Yu-Chiang Frank Wang, Min Sun; [ pdf] [ supp] [ arXiv]
Open Vocabulary Scene Parsing: Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, Antonio Torralba; [ pdf] [ supp] [ arXiv]
Learned Watershed: End-To-End Learning of Seeded Segmentation: Steffen Wolf, Lukas Schott, Ullrich Kothe, Fred Hamprecht; [ pdf] [ supp]
Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes: Yang Zhang, Philip David, Boqing Gong; [ pdf] [ arXiv]
Scale-Adaptive Convolutions for Scene Parsing: Rui Zhang, Sheng Tang, Yongdong Zhang, Jintao Li, Shuicheng Yan; [ pdf]
Privacy-Preserving Visual Learning Using Doubly Permuted Homomorphic Encryption: Ryo Yonetani, Vishnu Naresh Boddeti, Kris M. Kitani, Yoichi Sato; [ pdf] [ supp] [ arXiv]
Multi-Task Self-Supervised Visual Learning: Carl Doersch, Andrew Zisserman; [ pdf] [ arXiv]
A Self-Balanced Min-Cut Algorithm for Image Clustering: Xiaojun Chen, Joshua Zhexue Haung, Feiping Nie, Renjie Chen, Qingyao Wu; [ pdf]
Is Second-Order Information Helpful for Large-Scale Visual Recognition?: Peihua Li, Jiangtao Xie, Qilong Wang, Wangmeng Zuo; [ pdf] [ arXiv]
Factorized Bilinear Models for Image Recognition: Yanghao Li, Naiyan Wang, Jiaying Liu, Xiaodi Hou; [ pdf] [ supp] [ arXiv]
Octree Generating Networks: Efficient Convolutional Architectures for High-Resolution 3D Outputs: Maxim Tatarchenko, Alexey Dosovitskiy, Thomas Brox; [ pdf] [ supp] [ arXiv]
Truncating Wide Networks Using Binary Tree Architectures: Yan Zhang, Mete Ozay, Shuohao Li, Takayuki Okatani; [ pdf] [ supp] [ arXiv]
Bringing Background Into the Foreground: Making All Classes Equal in Weakly-Supervised Video Semantic Segmentation: Fatemeh Sadat Saleh, Mohammad Sadegh Aliakbarian, Mathieu Salzmann, Lars Petersson, Jose M. Alvarez; [ pdf] [ arXiv]
View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition From Skeleton Data: Pengfei Zhang, Cuiling Lan, Junliang Xing, Wenjun Zeng, Jianru Xue, Nanning Zheng; [ pdf] [ arXiv]
Joint Discovery of Object States and Manipulation Actions: Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Simon Lacoste-Julien; [ pdf] [ arXiv]
What Actions Are Needed for Understanding Human Actions in Videos?: Gunnar A. Sigurdsson, Olga Russakovsky, Abhinav Gupta; [ pdf]
Lattice Long Short-Term Memory for Human Action Recognition: Lin Sun, Kui Jia, Kevin Chen, Dit-Yan Yeung, Bertram E. Shi, Silvio Savarese; [ pdf] [ arXiv]
Common Action Discovery and Localization in Unconstrained Videos: Jiong Yang, Junsong Yuan; [ pdf]
Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks: Jae Shin Yoon, Francois Rameau, Junsik Kim, Seokju Lee, Seunghak Shin, In So Kweon; [ pdf] [ arXiv]
Am I a Baller? Basketball Performance Assessment From First-Person Videos: Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi; [ pdf] [ arXiv]
Deep Cropping via Attention Box Prediction and Aesthetics Assessment: Wenguan Wang, Jianbing Shen; [ pdf]
Raster-To-Vector: Revisiting Floorplan Transformation: Chen Liu, Jiajun Wu, Pushmeet Kohli, Yasutaka Furukawa; [ pdf] [ supp]
Deep TextSpotter: An End-To-End Trainable Scene Text Localization and Recognition Framework: Michal Busta, Lukas Neumann, Jiri Matas; [ pdf]
Playing for Benchmarks: Stephan R. Richter, Zeeshan Hayder, Vladlen Koltun; [ pdf] [ arXiv]
Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks: Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros; [ pdf]
GANs for Biological Image Synthesis: Anton Osokin, Anatole Chessel, Rafael E. Carazo Salas, Federico Vaggi; [ pdf] [ arXiv]
Learning to Synthesize a 4D RGBD Light Field From a Single Image: Pratul P. Srinivasan, Tongzhou Wang, Ashwin Sreelal, Ravi Ramamoorthi, Ren Ng; [ pdf] [ supp] [ arXiv]
Neural EPI-Volume Networks for Shape From Light Field: Stefan Heber, Wei Yu, Thomas Pock; [ pdf]
Material Editing Using a Physically Based Rendering Network: Guilin Liu, Duygu Ceylan, Ersin Yumer, Jimei Yang, Jyh-Ming Lien; [ pdf] [ supp] [ arXiv]
Turning Corners Into Cameras: Principles and Methods: Katherine L. Bouman, Vickie Ye, Adam B. Yedidia, Fredo Durand, Gregory W. Wornell, Antonio Torralba, William T. Freeman; [ pdf]
Linear Differential Constraints for Photo-Polarimetric Height Estimation: Silvia Tozza, William A. P. Smith, Dizhong Zhu, Ravi Ramamoorthi, Edwin R. Hancock; [ pdf] [ arXiv]
Polynomial Solvers for Saturated Ideals: Viktor Larsson, Kalle Astrom, Magnus Oskarsson; [ pdf]
Shape Inpainting Using 3D Generative Adversarial Network and Recurrent Convolutional Networks: Weiyue Wang, Qiangui Huang, Suya You, Chao Yang, Ulrich Neumann; [ pdf]
SurfaceNet: An End-To-End 3D Neural Network for Multiview Stereopsis: Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, Lu Fang; [ pdf]
Making Minimal Solvers for Absolute Pose Estimation Compact and Robust: Viktor Larsson, Zuzana Kukelova, Yinqiang Zheng; [ pdf] [ supp]
3D Surface Detail Enhancement From a Single Normal Map: Wuyuan Xie, Miaohui Wang, Xianbiao Qi, Lei Zhang; [ pdf] [ supp]
RMPE: Regional Multi-Person Pose Estimation: Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu; [ pdf] [ arXiv]
Online Video Object Detection Using Association LSTM: Yongyi Lu, Cewu Lu, Chi-Keung Tang; [ pdf]
PolyFit: Polygonal Surface Reconstruction From Point Clouds: Liangliang Nan, Peter Wonka; [ pdf] [ supp]
Progressive Large Scale-Invariant Image Matching in Scale Space: Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, Tian Fang, Long Quan; [ pdf]
Efficient Global 2D-3D Matching for Camera Localization in a Large-Scale 3D Map: Liu Liu, Hongdong Li, Yuchao Dai; [ pdf] [ supp]
Multi-View Non-Rigid Refinement and Normal Selection for High Quality 3D Reconstruction: Sk. Mohammadul Haque, Venu Madhav Govindu; [ pdf] [ supp]
Multi-Stage Multi-Recursive-Input Fully Convolutional Networks for Neuronal Boundary Detection: Wei Shen, Bin Wang, Yuan Jiang, Yan Wang, Alan Yuille; [ pdf]
Depth and Image Restoration From Light Field in a Scattering Medium: Jiandong Tian, Zachary Murez, Tong Cui, Zhen Zhang, David Kriegman, Ravi Ramamoorthi; [ pdf]
Video Reflection Removal Through Spatio-Temporal Optimization: Ajay Nandoriya, Mohamed Elgharib, Changil Kim, Mohamed Hefeeda, Wojciech Matusik; [ pdf]
Efficient Online Local Metric Adaptation via Negative Samples for Person Re-Identification: Jiahuan Zhou, Pei Yu, Wei Tang, Ying Wu; [ pdf] [ supp]
Stepwise Metric Promotion for Unsupervised Video Person Re-Identification: Zimo Liu, Dong Wang, Huchuan Lu; [ pdf]
Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis: Rui Huang, Shu Zhang, Tianyu Li, Ran He; [ pdf] [ supp] [ arXiv]
Group Re-Identification via Unsupervised Transfer of Sparse Features Encoding: Giuseppe Lisanti, Niki Martinel, Alberto Del Bimbo, Gian Luca Foresti; [ pdf] [ arXiv]
Visual Transformation Aided Contrastive Learning for Video-Based Kinship Verification: Hamdi Dibeklioglu; [ pdf]
Decoder Network Over Lightweight Reconstructed Feature for Fast Semantic Style Transfer: Ming Lu, Hao Zhao, Anbang Yao, Feng Xu, Yurong Chen, Li Zhang; [ pdf]
Blind Image Deblurring With Outlier Handling: Jiangxin Dong, Jinshan Pan, Zhixun Su, Ming-Hsuan Yang; [ pdf] [ supp]
Paying Attention to Descriptions Generated by Image Captioning Models: Hamed R. Tavakoli, Rakshith Shetty, Ali Borji, Jorma Laaksonen; [ pdf] [ arXiv]
Fast Image Processing With Fully-Convolutional Networks: Qifeng Chen, Jia Xu, Vladlen Koltun; [ pdf] [ arXiv]
Robust Video Super-Resolution With Learned Temporal Dynamics: Ding Liu, Zhaowen Wang, Yuchen Fan, Xianming Liu, Zhangyang Wang, Shiyu Chang, Thomas Huang; [ pdf]
Should We Encode Rain Streaks in Video as Deterministic or Stochastic?: Wei Wei, Lixuan Yi, Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu; [ pdf] [ supp]
Joint Bi-Layer Optimization for Single-Image Rain Streak Removal: Lei Zhu, Chi-Wing Fu, Dani Lischinski, Pheng-Ann Heng; [ pdf] [ supp]
Low-Dimensionality Calibration Through Local Anisotropic Scaling for Robust Hand Model Personalization: Edoardo Remelli, Anastasia Tkach, Andrea Tagliasacchi, Mark Pauly; [ pdf] [ supp]
Non-Markovian Globally Consistent Multi-Object Tracking: Andrii Maksai, Xinchao Wang, Francois Fleuret, Pascal Fua; [ pdf] [ supp]
CREST: Convolutional Residual Learning for Visual Tracking: Yibing Song, Chao Ma, Lijun Gong, Jiawei Zhang, Rynson W. H. Lau, Ming-Hsuan Yang; [ pdf] [ arXiv]
Volumetric Flow Estimation for Incompressible Fluids Using the Stationary Stokes Equations: Katrin Lasinger, Christoph Vogel, Konrad Schindler; [ pdf] [ supp]
Bounding Boxes, Segmentations and Object Coordinates: How Important Is Recognition for 3D Scene Flow Estimation in Autonomous Driving Scenarios?: Aseem Behl, Omid Hosseini Jafari, Siva Karthik Mustikovela, Hassan Abu Alhaija, Carsten Rother, Andreas Geiger; [ pdf] [ supp]
Performance Guaranteed Network Acceleration via High-Order Residual Quantization: Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao; [ pdf] [ arXiv]
Deep Metric Learning With Angular Loss: Jian Wang, Feng Zhou, Shilei Wen, Xiao Liu, Yuanqing Lin; [ pdf] [ arXiv]
Compositional Human Pose Regression: Xiao Sun, Jiaxiang Shang, Shuang Liang, Yichen Wei; [ pdf] [ arXiv]
MUTAN: Multimodal Tucker Fusion for Visual Question Answering: Hedi Ben-younes, Remi Cadene, Matthieu Cord, Nicolas Thome; [ pdf] [ arXiv]
Revisiting IM2GPS in the Deep Learning Era: Nam Vo, Nathan Jacobs, James Hays; [ pdf] [ supp] [ arXiv]
Scene Parsing With Global Context Embedding: Wei-Chih Hung, Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang; [ pdf] [ supp]
A Simple yet Effective Baseline for 3D Human Pose Estimation: Julieta Martinez, Rayat Hossain, Javier Romero, James J. Little; [ pdf] [ arXiv]
Dual-Glance Model for Deciphering Social Relationships: Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli; [ pdf]
Sketching With Style: Visual Search With Sketches and Aesthetic Context: John Collomosse, Tu Bui, Michael J. Wilber, Chen Fang, Hailin Jin; [ pdf] [ supp]
Point Set Registration With Global-Local Correspondence and Transformation Estimation: Su Zhang, Yang Yang, Kun Yang, Yi Luo, Sim-Heng Ong; [ pdf]
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?: John McCormac, Ankur Handa, Stefan Leutenegger, Andrew J. Davison; [ pdf]
A Unified Model for Near and Remote Sensing: Scott Workman, Menghua Zhai, David J. Crandall, Nathan Jacobs; [ pdf] [ supp] [ arXiv]
Directionally Convolutional Networks for 3D Shape Segmentation: Haotian Xu, Ming Dong, Zichun Zhong; [ pdf]
AMAT: Medial Axis Transform for Natural Images: Stavros Tsogkas, Sven Dickinson; [ pdf] [ supp] [ arXiv]
Deep Dual Learning for Semantic Image Segmentation: Ping Luo, Guangrun Wang, Liang Lin, Xiaogang Wang; [ pdf]
Regional Interactive Image Segmentation Networks: Jun Hao Liew, Yunchao Wei, Wei Xiong, Sim-Heng Ong, Jiashi Feng; [ pdf] [ supp]
Learning Efficient Convolutional Networks Through Network Slimming: Zhuang Liu, Jianguo Li, Zhiqiang Shen, Gao Huang, Shoumeng Yan, Changshui Zhang; [ pdf] [ supp] [ arXiv]
CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training: Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, Gang Hua; [ pdf] [ supp]
Universal Adversarial Perturbations Against Semantic Image Segmentation: Jan Hendrik Metzen, Mummadi Chaithanya Kumar, Thomas Brox, Volker Fischer; [ pdf] [ supp] [ arXiv]
Associative Domain Adaptation: Philip Haeusser, Thomas Frerix, Alexander Mordvintsev, Daniel Cremers; [ pdf] [ supp] [ arXiv]
Introspective Neural Networks for Generative Modeling: Justin Lazarow, Long Jin, Zhuowen Tu; [ pdf]
Towards a Unified Compositional Model for Visual Pattern Modeling: Wei Tang, Pei Yu, Jiahuan Zhou, Ying Wu; [ pdf] [ supp]
Least Squares Generative Adversarial Networks: Xudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, Zhen Wang, Stephen Paul Smolley; [ pdf] [ supp] [ arXiv]
Centered Weight Normalization in Accelerating Training of Deep Neural Networks: Lei Huang, Xianglong Liu, Yang Liu, Bo Lang, Dacheng Tao; [ pdf] [ supp]
Deep Growing Learning: Guangcong Wang, Xiaohua Xie, Jianhuang Lai, Jiaxuan Zhuo; [ pdf]
Smart Mining for Deep Metric Learning: Ben Harwood, Vijay Kumar B G, Gustavo Carneiro, Ian Reid, Tom Drummond; [ pdf] [ supp] [ arXiv]
Temporal Generative Adversarial Nets With Singular Value Clipping: Masaki Saito, Eiichi Matsumoto, Shunta Saito; [ pdf] [ arXiv]
Sampling Matters in Deep Embedding Learning: Chao-Yuan Wu, R. Manmatha, Alexander J. Smola, Philipp Krahenbuhl; [ pdf] [ supp] [ arXiv]
DualGAN: Unsupervised Dual Learning for Image-To-Image Translation: Zili Yi, Hao Zhang, Ping Tan, Minglun Gong; [ pdf]
Learning View-Invariant Features for Person Identification in Temporally Synchronized Videos Taken by Wearable Cameras: Kang Zheng, Xiaochuan Fan, Yuewei Lin, Hao Guo, Hongkai Yu, Dazhou Guo, Song Wang; [ pdf]
MarioQA: Answering Questions by Watching Gameplay Videos: Jonghwan Mun, Paul Hongsuck Seo, Ilchae Jung, Bohyung Han; [ pdf] [ supp] [ arXiv]
SBGAR: Semantics Based Group Activity Recognition: Xin Li, Mooi Choo Chuah; [ pdf]
Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video: Davide Moltisanti, Michael Wray, Walterio Mayol-Cuevas, Dima Damen; [ pdf] [ supp] [ arXiv]
Unmasking the Abnormal Events in Video: Radu Tudor Ionescu, Sorina Smeureanu, Bogdan Alexe, Marius Popescu; [ pdf] [ arXiv]
Chained Multi-Stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection: Mohammadreza Zolfaghari, Gabriel L. Oliveira, Nima Sedaghat, Thomas Brox; [ pdf] [ supp] [ arXiv]
Temporal Action Detection With Structured Segment Networks: Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, Xiaoou Tang, Dahua Lin; [ pdf] [ supp] [ arXiv]
Jointly Recognizing Object Fluents and Tasks in Egocentric Videos: Yang Liu, Ping Wei, Song-Chun Zhu; [ pdf]
Transferring Objects: Joint Inference of Container and Human Pose: Hanqing Wang, Wei Liang, Lap-Fai Yu; [ pdf]
Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention: Jinkyu Kim, John Canny; [ pdf] [ arXiv]
Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning: Abhishek Das, Satwik Kottur, Jose M. F. Moura, Stefan Lee, Dhruv Batra; [ pdf] [ supp] [ arXiv]
Mask R-CNN: Kaiming He, Georgia Gkioxari, Piotr Dollar, Ross Girshick; [ pdf] [ arXiv]
Towards Diverse and Natural Image Descriptions via a Conditional GAN: Bo Dai, Sanja Fidler, Raquel Urtasun, Dahua Lin; [ pdf] [ arXiv]
Focal Loss for Dense Object Detection: Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollar; [ pdf] [ arXiv]
Inferring and Executing Programs for Visual Reasoning: Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick; [ pdf] [ supp] [ arXiv]
Visual Forecasting by Imitating Dynamics in Natural Sequences: Kuo-Hao Zeng, William B. Shen, De-An Huang, Min Sun, Juan Carlos Niebles; [ pdf] [ supp] [ arXiv]
TorontoCity: Seeing the World With a Million Eyes: Shenlong Wang, Min Bai, Gellert Mattyus, Hang Chu, Wenjie Luo, Bin Yang, Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun; [ pdf] [ arXiv]
Low-Shot Visual Recognition by Shrinking and Hallucinating Features: Bharath Hariharan, Ross Girshick; [ pdf] [ supp] [ arXiv]
A Coarse-Fine Network for Keypoint Localization: Shaoli Huang, Mingming Gong, Dacheng Tao; [ pdf]
Detect to Track and Track to Detect: Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman; [ pdf]
Single Shot Text Detector With Regional Attention: Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li; [ pdf] [ arXiv]
SubUNets: End-To-End Hand Shape and Continuous Sign Language Recognition: Necati Cihan Camgoz, Simon Hadfield, Oscar Koller, Richard Bowden; [ pdf]
A Spatiotemporal Oriented Energy Network for Dynamic Texture Recognition: Isma Hadji, Richard P. Wildes; [ pdf] [ supp] [ arXiv]
Probabilistic Structure From Motion With Objects (PSfMO): Paul Gay, Cosimo Rubino, Vaibhav Bansal, Alessio Del Bue; [ pdf]
A 3D Morphable Model of Craniofacial Shape and Texture Variation: Hang Dai, Nick Pears, William A. P. Smith, Christian Duncan; [ pdf]
Multi-View Dynamic Shape Refinement Using Local Temporal Integration: Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer; [ pdf]
Learning Hand Articulations by Hallucinating Heat Distribution: Chiho Choi, Sangpil Kim, Karthik Ramani; [ pdf] [ supp]
Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization With Spatially-Varying Lighting: Robert Maier, Kihwan Kim, Daniel Cremers, Jan Kautz, Matthias Niessner; [ pdf] [ supp] [ arXiv]
Robust Hand Pose Estimation During the Interaction With an Unknown Object: Chiho Choi, Sang Ho Yoon, Chin-Ning Chen, Karthik Ramani; [ pdf] [ supp]
Detailed Surface Geometry and Albedo Recovery From RGB-D Video Under Natural Illumination: Xinxin Zuo, Sen Wang, Jiangbin Zheng, Ruigang Yang; [ pdf] [ arXiv]
Monocular Free-Head 3D Gaze Tracking With Deep Learning and Geometry Constraints: Wangjiang Zhu, Haoping Deng; [ pdf]
Filter Selection for Hyperspectral Estimation: Boaz Arad, Ohad Ben-Shahar; [ pdf]
A Microfacet-Based Reflectance Model for Photometric Stereo With Highly Specular Surfaces: Lixiong Chen, Yinqiang Zheng, Boxin Shi, Art Subpa-Asa, Imari Sato; [ pdf]
Detecting Faces Using Inside Cascaded Contextual CNN: Kaipeng Zhang, Zhanpeng Zhang, Hao Wang, Zhifeng Li, Yu Qiao, Wei Liu; [ pdf]
A Novel Space-Time Representation on the Positive Semidefinite Cone for Facial Expression Recognition: Anis Kacem, Mohamed Daoudi, Boulbaba Ben Amor, Juan Carlos Alvarez-Paiva; [ pdf]
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding: Dieu Linh Tran, Robert Walecki, Ognjen (Oggi) Rudovic, Stefanos Eleftheriadis, Bjorn Schuller, Maja Pantic; [ pdf] [ arXiv]
Pose-Invariant Face Alignment With a Single CNN: Amin Jourabloo, Mao Ye, Xiaoming Liu, Liu Ren; [ pdf] [ arXiv]
Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos: Kihyuk Sohn, Sifei Liu, Guangyu Zhong, Xiang Yu, Ming-Hsuan Yang, Manmohan Chandraker; [ pdf] [ supp] [ arXiv]
Deeply-Learned Part-Aligned Representations for Person Re-Identification: Liming Zhao, Xi Li, Yueting Zhuang, Jingdong Wang; [ pdf] [ arXiv]
Semantic Line Detection and Its Applications: Jun-Tae Lee, Han-Ul Kim, Chul Lee, Chang-Su Kim; [ pdf]
A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing: Qingnan Fan, Jiaolong Yang, Gang Hua, Baoquan Chen, David Wipf; [ pdf] [ supp] [ arXiv]
Revisiting Cross-Channel Information Transfer for Chromatic Aberration Correction: Tiancheng Sun, Yifan Peng, Wolfgang Heidrich; [ pdf] [ supp]
High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits: Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia; [ pdf] [ arXiv]
Learning Visual Attention to Identify People With Autism Spectrum Disorder: Ming Jiang, Qi Zhao; [ pdf]
DSLR-Quality Photos on Mobile Devices With Deep Convolutional Networks: Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Kenneth Vanhoey, Luc Van Gool; [ pdf] [ supp] [ arXiv]
Non-Uniform Blind Deblurring by Reblurring: Yuval Bahat, Netalee Efrat, Michal Irani; [ pdf]
Misalignment-Robust Joint Filter for Cross-Modal Image Pairs: Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi; [ pdf] [ supp]
Low-Rank Tensor Completion: A Pseudo-Bayesian Learning Approach: Wei Chen, Nan Song; [ pdf]
DeepCD: Learning Deep Complementary Descriptors for Patch Representations: Tsun-Yi Yang, Jo-Han Hsu, Yen-Yu Lin, Yung-Yu Chuang; [ pdf]
Beyond Standard Benchmarks: Parameterizing Performance Evaluation in Visual Object Tracking: Luka Cehovin Zajc, Alan Lukezic, Ales Leonardis, Matej Kristan; [ pdf]
The Pose Knows: Video Forecasting by Generating Pose Futures: Jacob Walker, Kenneth Marino, Abhinav Gupta, Martial Hebert; [ pdf] [ arXiv]
What Will Happen Next? Forecasting Player Moves in Sports Videos: Panna Felsen, Pulkit Agrawal, Jitendra Malik; [ pdf]
Robust Kronecker-Decomposable Component Analysis for Low-Rank Modeling: Mehdi Bahri, Yannis Panagakis, Stefanos Zafeiriou; [ pdf] [ arXiv]
Recurrent Topic-Transition GAN for Visual Paragraph Generation: Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, Eric P. Xing; [ pdf] [ arXiv]
A Two-Streamed Network for Estimating Fine-Scaled Depth Maps From Single RGB Images: Jun Li, Reinhard Klein, Angela Yao; [ pdf] [ supp] [ arXiv]
Weakly Supervised Object Localization Using Things and Stuff Transfer: Miaojing Shi, Holger Caesar, Vittorio Ferrari; [ pdf] [ supp] [ arXiv]
Single Image Action Recognition Using Semantic Body Part Actions: Zhichen Zhao, Huimin Ma, Shaodi You; [ pdf] [ arXiv]
Incremental Learning of Object Detectors Without Catastrophic Forgetting: Konstantin Shmelkov, Cordelia Schmid, Karteek Alahari; [ pdf] [ arXiv]
Generative Adversarial Networks Conditioned by Brain Signals: Simone Palazzo, Concetto Spampinato, Isaak Kavasidis, Daniela Giordano, Mubarak Shah; [ pdf]
Learning to Disambiguate by Asking Discriminative Questions: Yining Li, Chen Huang, Xiaoou Tang, Chen Change Loy; [ pdf] [ supp] [ arXiv]
Interpretable Explanations of Black Boxes by Meaningful Perturbation: Ruth C. Fong, Andrea Vedaldi; [ pdf] [ supp] [ arXiv]
DeepRoadMapper: Extracting Road Topology From Aerial Images: Gellert Mattyus, Wenjie Luo, Raquel Urtasun; [ pdf]
Monocular 3D Human Pose Estimation by Predicting Depth on Joints: Bruce Xiaohan Nie, Ping Wei, Song-Chun Zhu; [ pdf]
Large-Scale Image Retrieval With Attentive Deep Local Features: Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, Bohyung Han; [ pdf] [ arXiv]
Deep Globally Constrained MRFs for Human Pose Estimation: Ioannis Marras, Petar Palasek, Ioannis Patras; [ pdf]
Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning: Soravit Changpinyo, Wei-Lun Chao, Fei Sha; [ pdf] [ supp] [ arXiv]
Multi-Label Learning of Part Detectors for Heavily Occluded Pedestrian Detection: Chunluan Zhou, Junsong Yuan; [ pdf] [ supp]
SGN: Sequential Grouping Networks for Instance Segmentation: Shu Liu, Jiaya Jia, Sanja Fidler, Raquel Urtasun; [ pdf]
Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors: Hong-Yu Zhou, Bin-Bin Gao, Jianxin Wu; [ pdf] [ arXiv]
Aesthetic Critiques Generation for Photos: Kuang-Yu Chang, Kung-Hung Lu, Chu-Song Chen; [ pdf] [ supp]
Hide-And-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization: Krishna Kumar Singh, Yong Jae Lee; [ pdf] [ supp]
Two-Phase Learning for Weakly Supervised Object Localization: Dahun Kim, Donghyeon Cho, Donggeun Yoo, In So Kweon; [ pdf] [ arXiv]
Curriculum Dropout: Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, Rene Vidal, Vittorio Murino; [ pdf] [ supp] [ arXiv]
Predictor Combination at Test Time: Kwang In Kim, James Tompkin, Christian Richardt; [ pdf] [ supp]
Guided Perturbations: Self-Corrective Behavior in Convolutional Neural Networks: Swami Sankaranarayanan, Arpit Jain, Ser Nam Lim; [ pdf]
Learning Robust Visual-Semantic Embeddings: Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov; [ pdf] [ supp] [ arXiv]
PUnDA: Probabilistic Unsupervised Domain Adaptation for Knowledge Transfer Across Visual Categories: Behnam Gholami, Ognjen (Oggi) Rudovic, Vladimir Pavlovic; [ pdf] [ supp]
Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses: Christian Rupprecht, Iro Laina, Robert DiPietro, Maximilian Baust, Federico Tombari, Nassir Navab, Gregory D. Hager; [ pdf] [ arXiv]
CDTS: Collaborative Detection, Tracking, and Segmentation for Online Multiple Object Segmentation in Videos: Yeong Jun Koh, Chang-Su Kim; [ pdf]
Temporal Superpixels Based on Proximity-Weighted Patch Matching: Se-Ho Lee, Won-Dong Jang, Chang-Su Kim; [ pdf]
Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge: Ryota Hinami, Tao Mei, Shin'ichi Satoh; [ pdf] [ supp] [ arXiv]
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals: Jiyang Gao, Zhenheng Yang, Kan Chen, Chen Sun, Ram Nevatia; [ pdf] [ supp] [ arXiv]
Online Real-Time Multiple Spatiotemporal Action Localisation and Prediction: Gurkirt Singh, Suman Saha, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin; [ pdf] [ supp] [ arXiv]
Leveraging Weak Semantic Relevance for Complex Video Event Classification: Chao Li, Jiewei Cao, Zi Huang, Lei Zhu, Heng Tao Shen; [ pdf]
Weakly Supervised Summarization of Web Videos: Rameswar Panda, Abir Das, Ziyan Wu, Jan Ernst, Amit K. Roy-Chowdhury; [ pdf] [ supp]
FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras: Shanghang Zhang, Guanhang Wu, Joao P. Costeira, Jose M. F. Moura; [ pdf]
Fast Face-Swap Using Convolutional Neural Networks: Iryna Korshunova, Wenzhe Shi, Joni Dambre, Lucas Theis; [ pdf] [ arXiv]
Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images: Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz; [ pdf] [ supp]
First-Person Activity Forecasting With Online Inverse Reinforcement Learning: Nicholas Rhinehart, Kris M. Kitani; [ pdf] [ supp] [ arXiv]
Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment With Limited Resources: Adrian Bulat, Georgios Tzimiropoulos; [ pdf] [ supp]
MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction: Ayush Tewari, Michael Zollhofer, Hyeongwoo Kim, Pablo Garrido, Florian Bernard, Patrick Perez, Christian Theobalt; [ pdf] [ supp] [ arXiv]
RPAN: An End-To-End Recurrent Pose-Attention Network for Action Recognition in Videos: Wenbin Du, Yali Wang, Yu Qiao; [ pdf] [ supp]
Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition: Chi Nhan Duong, Kha Gia Quach, Khoa Luu, Ngan Le, Marios Savvides; [ pdf] [ arXiv]
Attribute-Enhanced Face Recognition With Neural Tensor Fusion Networks: Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, Sankha S. Mukherjee, Timothy M. Hospedales, Neil M. Robertson, Yongxin Yang; [ pdf]
Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro: Zhedong Zheng, Liang Zheng, Yi Yang; [ pdf] [ arXiv]
Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks With Spatiotemporal Transformer Modules: Congqi Cao, Yifan Zhang, Yi Wu, Hanqing Lu, Jian Cheng; [ pdf]
Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition: Wanglong Wu, Meina Kan, Xin Liu, Yi Yang, Shiguang Shan, Xilin Chen; [ pdf]
Learning Discriminative Aggregation Network for Video-Based Face Recognition: Yongming Rao, Ji Lin, Jiwen Lu, Jie Zhou; [ pdf]
Synergy Between Face Alignment and Tracking via Discriminative Global Consensus Optimization: Muhammad Haris Khan, John McDonagh, Georgios Tzimiropoulos; [ pdf]
SVDNet for Pedestrian Retrieval: Yifan Sun, Liang Zheng, Weijian Deng, Shengjin Wang; [ pdf] [ arXiv]
Towards More Accurate Iris Recognition Using Deeply Learned Spatially Corresponding Features: Zijing Zhao, Ajay Kumar; [ pdf] [ supp]
Semantically Informed Multiview Surface Refinement: Maros Blaha, Mathias Rothermel, Martin R. Oswald, Torsten Sattler, Audrey Richard, Jan D. Wegner, Marc Pollefeys, Konrad Schindler; [ pdf] [ supp] [ arXiv]
BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects Without Using Depth: Mahdi Rad, Vincent Lepetit; [ pdf] [ arXiv]
Modeling Urban Scenes From Pointclouds: William Nguatem, Helmut Mayer; [ pdf] [ supp]
Parameter-Free Lens Distortion Calibration of Central Cameras: Filippo Bergamasco, Luca Cosmo, Andrea Gasparetto, Andrea Albarelli, Andrea Torsello; [ pdf]
Pose Guided RGBD Feature Learning for 3D Object Pose Estimation: Vassileios Balntas, Andreas Doumanoglou, Caner Sahin, Juil Sock, Rigas Kouskouridas, Tae-Kyun Kim; [ pdf]
Efficient Global Illumination for Morphable Models: Andreas Schneider, Sandro Schonborn, Lavrenti Frobeen, Bernhard Egger, Thomas Vetter; [ pdf]
Low Compute and Fully Parallel Computer Vision With HashMatch: Sean Ryan Fanello, Julien Valentin, Adarsh Kowdle, Christoph Rhemann, Vladimir Tankovich, Carlo Ciliberto, Philip Davidson, Shahram Izadi; [ pdf] [ supp]
Dense Non-Rigid Structure-From-Motion and Shading With Unknown Albedos: Mathias Gallardo, Toby Collins, Adrien Bartoli; [ pdf] [ supp]
From Point Clouds to Mesh Using Regression: Lubor Ladicky, Olivier Saurer, SoHyeon Jeong, Fabio Maninchedda, Marc Pollefeys; [ pdf]
Stereo DSO: Large-Scale Direct Sparse Visual Odometry With Stereo Cameras: Rui Wang, Martin Schworer, Daniel Cremers; [ pdf] [ supp] [ arXiv]
Space-Time Localization and Mapping: Minhaeng Lee, Charless C. Fowlkes; [ pdf] [ supp]
Benchmarking Single-Image Reflection Removal Algorithms: Renjie Wan, Boxin Shi, Ling-Yu Duan, Ah-Hwee Tan, Alex C. Kot; [ pdf]
Attention-Aware Deep Reinforcement Learning for Video Face Recognition: Yongming Rao, Jiwen Lu, Jie Zhou; [ pdf]
Learning to Fuse 2D and 3D Image Cues for Monocular Body Pose Estimation: Bugra Tekin, Pablo Marquez-Neila, Mathieu Salzmann, Pascal Fua; [ pdf] [ supp] [ arXiv]
Deep Facial Action Unit Recognition From Partially Labeled Data: Shan Wu, Shangfei Wang, Bowen Pan, Qiang Ji; [ pdf]
Pose-Driven Deep Convolutional Model for Person Re-Identification: Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian; [ pdf] [ arXiv]
Recognition of Action Units in the Wild With Deep Nets and a New Global-Local Loss: C. Fabian Benitez-Quiroz, Yan Wang, Aleix M. Martinez; [ pdf] [ supp]
Faster Than Real-Time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses: Chandrasekhar Bhagavatula, Chenchen Zhu, Khoa Luu, Marios Savvides; [ pdf] [ arXiv]
Towards Large-Pose Face Frontalization in the Wild: Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, Manmohan Chandraker; [ pdf] [ supp] [ arXiv]
A Joint Intrinsic-Extrinsic Prior Model for Retinex: Bolun Cai, Xianming Xu, Kailing Guo, Kui Jia, Bin Hu, Dacheng Tao; [ pdf]
Going Unconstrained With Rolling Shutter Deblurring: Mahesh Mohan M. R., A. N. Rajagopalan, Gunasekaran Seetharaman; [ pdf] [ supp]
A Stagewise Refinement Model for Detecting Salient Objects in Images: Tiantian Wang, Ali Borji, Lihe Zhang, Pingping Zhang, Huchuan Lu; [ pdf]
From Square Pieces to Brick Walls: The Next Challenge in Solving Jigsaw Puzzles: Shir Gur, Ohad Ben-Shahar; [ pdf]
Online Video Deblurring via Dynamic Temporal Blending Network: Tae Hyun Kim, Kyoung Mu Lee, Bernhard Scholkopf, Michael Hirsch; [ pdf] [ arXiv]
Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector: Dingwen Zhang, Junwei Han, Yu Zhang; [ pdf]
Fast Multi-Image Matching via Density-Based Clustering: Roberto Tron, Xiaowei Zhou, Carlos Esteves, Kostas Daniilidis; [ pdf]
Characterizing and Improving Stability in Neural Style Transfer: Agrim Gupta, Justin Johnson, Alexandre Alahi, Li Fei-Fei; [ pdf] [ supp] [ arXiv]
Cross-Modal Deep Variational Hashing: Venice Erin Liong, Jiwen Lu, Yap-Peng Tan, Jie Zhou; [ pdf]
Spatial Memory for Context Reasoning in Object Detection: Xinlei Chen, Abhinav Gupta; [ pdf] [ arXiv]
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval: Yuming Shen, Li Liu, Ling Shao, Jingkuan Song; [ pdf] [ arXiv]
Learning a Recurrent Residual Fusion Network for Multimodal Matching: Yu Liu, Yanming Guo, Erwin M. Bakker, Michael S. Lew; [ pdf]
Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition: Anders Glent Buch, Lilita Kiforenko, Dirk Kraft; [ pdf] [ arXiv]
CoupleNet: Coupling Global Structure With Local Parts for Object Detection: Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, Yi Wu, Hanqing Lu; [ pdf] [ arXiv]
Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training: Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, Bernt Schiele; [ pdf] [ supp] [ arXiv]
Drone-Based Object Counting by Spatially Regularized Regional Proposal Network: Meng-Ru Hsieh, Yen-Liang Lin, Winston H. Hsu; [ pdf] [ arXiv]
BlitzNet: A Real-Time Deep Network for Scene Understanding: Nikita Dvornik, Konstantin Shmelkov, Julien Mairal, Cordelia Schmid; [ pdf] [ supp]
Joint Learning of Object and Action Detectors: Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, Cordelia Schmid; [ pdf]
Situation Recognition With Graph Neural Networks: Ruiyu Li, Makarand Tapaswi, Renjie Liao, Jiaya Jia, Raquel Urtasun, Sanja Fidler; [ pdf] [ supp] [ arXiv]
Learning Visual N-Grams From Web Data: Ang Li, Allan Jabri, Armand Joulin, Laurens van der Maaten; [ pdf] [ supp] [ arXiv]
Attention-Based Multimodal Fusion for Video Description: Chiori Hori, Takaaki Hori, Teng-Yok Lee, Ziming Zhang, Bret Harsham, John R. Hershey, Tim K. Marks, Kazuhiko Sumi; [ pdf] [ supp] [ arXiv]
Learning the Latent "Look": Unsupervised Discovery of a Style-Coherent Embedding From Fashion Images: Wei-Lin Hsiao, Kristen Grauman; [ pdf] [ arXiv]
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks: Tanmay Gupta, Kevin Shih, Saurabh Singh, Derek Hoiem; [ pdf] [ arXiv]
Learning Discriminative Latent Attributes for Zero-Shot Classification: Huajie Jiang, Ruiping Wang, Shiguang Shan, Yi Yang, Xilin Chen; [ pdf] [ supp]
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN: Hanwang Zhang, Zawlin Kyaw, Jinyang Yu, Shih-Fu Chang; [ pdf]
Higher-Order Minimum Cost Lifted Multicuts for Motion Segmentation: Margret Keuper; [ pdf] [ arXiv]
Deep Free-Form Deformation Network for Object-Mask Registration: Haoyang Zhang, Xuming He; [ pdf]
Region-Based Correspondence Between 3D Shapes via Spatially Smooth Biclustering: Matteo Denitto, Simone Melzi, Manuele Bicego, Umberto Castellani, Alessandro Farinelli, Mario A. T. Figueiredo, Yanir Kleiman, Maks Ovsjanikov; [ pdf]
Learning Discriminative ab-Divergences for Positive Definite Matrices: Anoop Cherian, Panagiotis Stanitsas, Mehrtash Harandi, Vassilios Morellas, Nikolaos Papanikolopoulos; [ pdf]
Consensus Convolutional Sparse Coding: Biswarup Choudhury, Robin Swanson, Felix Heide, Gordon Wetzstein, Wolfgang Heidrich; [ pdf] [ supp]
Domain-Adaptive Deep Network Compression: Marc Masana, Joost van de Weijer, Luis Herranz, Andrew D. Bagdanov, Jose M. Alvarez; [ pdf] [ arXiv]
Self-Supervised Learning of Pose Embeddings From Spatiotemporal Relations in Videos: Omer Sumer, Tobias Dencker, Bjorn Ommer; [ pdf] [ supp] [ arXiv]
Approximate Grassmannian Intersections: Subspace-Valued Subspace Learning: Calvin Murdock, Fernando De la Torre; [ pdf]
Side Information in Robust Principal Component Analysis: Algorithms and Applications: Niannan Xue, Yannis Panagakis, Stefanos Zafeiriou; [ pdf] [ supp] [ arXiv]
Summarization and Classification of Wearable Camera Streams by Learning the Distributions Over Deep Features of Out-Of-Sample Image Sequences: Alessandro Perina, Sadegh Mohammadi, Nebojsa Jojic, Vittorio Murino; [ pdf] [ supp]
Unsupervised Learning From Video to Detect Foreground Objects in Single Images: Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu; [ pdf] [ arXiv]
Supplementary Meta-Learning: Towards a Dynamic Model for Deep Neural Networks: Feihu Zhang, Benjamin W. Wah; [ pdf]
Adversarial Inverse Graphics Networks: Learning 2D-To-3D Lifting and Image-To-Image Translation From Unpaired Supervision: Hsiao-Yu Fish Tung, Adam W. Harley, William Seto, Katerina Fragkiadaki; [ pdf] [ supp]
Active Learning for Human Pose Estimation: Buyu Liu, Vittorio Ferrari; [ pdf]
Interleaved Group Convolutions: Ting Zhang, Guo-Jun Qi, Bin Xiao, Jingdong Wang; [ pdf] [ supp]
Learning-Based Cloth Material Recovery From Video: Shan Yang, Junbang Liang, Ming C. Lin; [ pdf]
Unsupervised Video Understanding by Reconciliation of Posture Similarities: Timo Milbich, Miguel Bautista, Ekaterina Sutter, Bjorn Ommer; [ pdf] [ supp]
Action Tubelet Detector for Spatio-Temporal Action Localization: Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, Cordelia Schmid; [ pdf] [ arXiv]
AMTnet: Action-Micro-Tube Regression by End-To-End Trainable Deep Architecture: Suman Saha, Gurkirt Singh, Fabio Cuzzolin; [ pdf] [ supp]
Constrained Convolutional Sparse Coding for Parametric Based Reconstruction of Line Drawings: Sara Shaheen, Lama Affara, Bernard Ghanem; [ pdf] [ supp]
Neural Ctrl-F: Segmentation-Free Query-By-String Word Spotting in Handwritten Manuscript Collections: Tomas Wilkinson, Jonas Lindstrom, Anders Brun; [ pdf]
Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions: Pascal Mettes, Cees G. M. Snoek; [ pdf] [ arXiv]
Semantic Video CNNs Through Representation Warping: Raghudeep Gadde, Varun Jampani, Peter V. Gehler; [ pdf] [ supp] [ arXiv]
Video Frame Synthesis Using Deep Voxel Flow: Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala; [ pdf] [ arXiv]
Detail-Revealing Deep Video Super-Resolution: Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia; [ pdf] [ arXiv]
Learning Video Object Segmentation With Visual Memory: Pavel Tokmakov, Karteek Alahari, Cordelia Schmid; [ pdf] [ arXiv]
EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis: Mehdi S. M. Sajjadi, Bernhard Scholkopf, Michael Hirsch; [ pdf] [ supp] [ arXiv]
Makeup-Go: Blind Reversion of Portrait Edit: Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia; [ pdf]
Shadow Detection With Conditional Generative Adversarial Networks: Vu Nguyen, Tomas F. Yago Vicente, Maozheng Zhao, Minh Hoai, Dimitris Samaras; [ pdf]
Learning High Dynamic Range From Outdoor Panoramas: Jinsong Zhang, Jean-Francois Lalonde; [ pdf] [ arXiv]
DCTM: Discrete-Continuous Transformation Matching for Semantic Flow: Seungryong Kim, Dongbo Min, Stephen Lin, Kwanghoon Sohn; [ pdf] [ arXiv]
MemNet: A Persistent Memory Network for Image Restoration: Ying Tai, Jian Yang, Xiaoming Liu, Chunyan Xu; [ pdf] [ arXiv]
Structure-Measure: A New Way to Evaluate Foreground Maps: Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, Ali Borji; [ pdf]
Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting: Donghyeon Cho, Jinsun Park, Tae-Hyun Oh, Yu-Wing Tai, In So Kweon; [ pdf] [ arXiv]
Practical and Efficient Multi-View Matching: Eleonora Maset, Federica Arrigoni, Andrea Fusiello; [ pdf]
Unrolled Memory Inner-Products: An Abstract GPU Operator for Efficient Vision-Related Computations: Yu-Sheng Lin, Wei-Chao Chen, Shao-Yi Chien; [ pdf]
Learning to Push the Limits of Efficient FFT-Based Image Deconvolution: Jakob Kruse, Carsten Rother, Uwe Schmidt; [ pdf] [ supp]
Learning Spread-Out Local Feature Descriptors: Xu Zhang, Felix X. Yu, Sanjiv Kumar, Shih-Fu Chang; [ pdf] [ arXiv]
Visual Odometry for Pixel Processor Arrays: Laurie Bose, Jianing Chen, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas; [ pdf]
Joint Estimation of Camera Pose, Depth, Deblurring, and Super-Resolution From a Blurred Image Sequence: Haesol Park, Kyoung Mu Lee; [ pdf] [ supp] [ arXiv]
2D-Driven 3D Object Detection in RGB-D Images: Jean Lahoud, Bernard Ghanem; [ pdf] [ supp]
Ray Space Features for Plenoptic Structure-From-Motion: Yingliang Zhang, Peihong Yu, Wei Yang, Yuanxi Ma, Jingyi Yu; [ pdf]
Depth Estimation Using Structured Light Flow -- Analysis of Projected Pattern Flow on an Object's Surface: Ryo Furukawa, Ryusuke Sagawa, Hiroshi Kawasaki; [ pdf] [ supp]
Monocular Dense 3D Reconstruction of a Complex Dynamic Scene From Two Perspective Frames: Suryansh Kumar, Yuchao Dai, Hongdong Li; [ pdf] [ supp] [ arXiv]
Optimal Transformation Estimation With Semantic Cues: Danda Pani Paudel, Adlane Habed, Luc Van Gool; [ pdf]
Dynamics Enhanced Multi-Camera Motion Segmentation From Unsynchronized Videos: Xikang Zhang, Bengisu Ozbay, Mario Sznaier, Octavia Camps; [ pdf]
Taking the Scenic Route to 3D: Optimising Reconstruction From Moving Cameras: Oscar Mendez, Simon Hadfield, Nicolas Pugeault, Richard Bowden; [ pdf] [ supp]
FLaME: Fast Lightweight Mesh Estimation Using Variational Smoothing on Delaunay Graphs: W. Nicholas Greene, Nicholas Roy; [ pdf]
Efficient Algorithms for Moral Lineage Tracing: Markus Rempfler, Jan-Hendrik Lange, Florian Jug, Corinna Blasse, Eugene W. Myers, Bjoern H. Menze, Bjoern Andres; [ pdf] [ supp] [ arXiv]
From RGB to Spectrum for Natural Scenes via Manifold-Based Mapping: Yan Jia, Yinqiang Zheng, Lin Gu, Art Subpa-Asa, Antony Lam, Yoichi Sato, Imari Sato; [ pdf]
DeepFuse: A Deep Unsupervised Approach for Exposure Fusion With Extreme Exposure Image Pairs: K. Ram Prabhakar, V Sai Srikar, R. Venkatesh Babu; [ pdf] [ supp]
Learning Dense Facial Correspondences in Unconstrained Images: Ronald Yu, Shunsuke Saito, Haoxiang Li, Duygu Ceylan, Hao Li; [ pdf] [ supp] [ arXiv]
Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification: Shuangjie Xu, Yu Cheng, Kang Gu, Yang Yang, Shiyu Chang, Pan Zhou; [ pdf] [ arXiv]
Automatic Content-Aware Projection for 360deg Videos: Yeong Won Kim, Chang-Ryeol Lee, Dae-Yong Cho, Yong Hoon Kwon, Hyeok-Jae Choi, Kuk-Jin Yoon; [ pdf] [ supp]
Blur-Invariant Deep Learning for Blind-Deblurring: T. M. Nimisha, Akash Kumar Singh, A. N. Rajagopalan; [ pdf] [ supp]
Non-Linear Convolution Filters for CNN-Based Learning: Georgios Zoumpourlis, Alexandros Doumanoglou, Nicholas Vretos, Petros Daras; [ pdf] [ arXiv]
AOD-Net: All-In-One Dehazing Network: Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng; [ pdf]
Simultaneous Detection and Removal of High Altitude Clouds From an Image: Tushar Sandhan, Jin Young Choi; [ pdf]
Understanding Low- and High-Level Contributions to Fixation Prediction: Matthias Kummerer, Thomas S. A. Wallis, Leon A. Gatys, Matthias Bethge; [ pdf] [ supp]
Image Super-Resolution Using Dense Skip Connections: Tong Tong, Gen Li, Xiejie Liu, Qinquan Gao; [ pdf]
Convergence Analysis of MAP Based Blur Kernel Estimation: Sunghyun Cho, Seungyong Lee; [ pdf] [ supp] [ arXiv]
Blob Reconstruction Using Unilateral Second Order Gaussian Kernels With Application to High-ISO Long-Exposure Image Denoising: Gang Wang, Carlos Lopez-Molina, Bernard De Baets; [ pdf]
Deep Generative Adversarial Compression Artifact Removal: Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, Alberto Del Bimbo; [ pdf] [ arXiv]
Online Multi-Object Tracking Using CNN-Based Single Object Tracker With Spatial-Temporal Attention Mechanism: Qi Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang, Bin Liu, Nenghai Yu; [ pdf] [ arXiv]
Mutual Enhancement for Detection of Multiple Logos in Sports Videos: Yuan Liao, Xiaoqing Lu, Chengcui Zhang, Yongtao Wang, Zhi Tang; [ pdf]
Referring Expression Generation and Comprehension via Attributes: Jingyu Liu, Liang Wang, Ming-Hsuan Yang; [ pdf] [ supp]
RoomNet: End-To-End Room Layout Estimation: Chen-Yu Lee, Vijay Badrinarayanan, Tomasz Malisiewicz, Andrew Rabinovich; [ pdf]
SSH: Single Stage Headless Face Detector: Mahyar Najibi, Pouya Samangouei, Rama Chellappa, Larry S. Davis; [ pdf] [ arXiv]
AnnArbor: Approximate Nearest Neighbors Using Arborescence Coding: Artem Babenko, Victor Lempitsky; [ pdf]
Boosting Image Captioning With Attributes: Ting Yao, Yingwei Pan, Yehao Li, Zhaofan Qiu, Tao Mei; [ pdf] [ arXiv]
Learning to Estimate 3D Hand Pose From Single RGB Images: Christian Zimmermann, Thomas Brox; [ pdf] [ supp]
Locally-Transferred Fisher Vectors for Texture Classification: Yang Song, Fan Zhang, Qing Li, Heng Huang, Lauren J. O'Donnell, Weidong Cai; [ pdf]
Object-Level Proposals: Jianxiang Ma, Anlong Ming, Zilong Huang, Xinggang Wang, Yu Zhou; [ pdf]
Extreme Clicking for Efficient Object Annotation: Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari; [ pdf] [ supp] [ arXiv]
WordSup: Exploiting Word Annotations for Character Based Text Detection: Han Hu, Chengquan Zhang, Yuxuan Luo, Yuzhuo Wang, Junyu Han, Errui Ding; [ pdf] [ arXiv]
Illuminating Pedestrians via Simultaneous Detection & Segmentation: Garrick Brazil, Xi Yin, Xiaoming Liu; [ pdf]
Generalized Orderless Pooling Performs Implicit Salient Matching: Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, Erik Rodner; [ pdf] [ supp] [ arXiv]
Exploiting Spatial Structure for Localizing Manipulated Image Regions: Jawadul H. Bappy, Amit K. Roy-Chowdhury, Jason Bunk, Lakshmanan Nataraj, B. S. Manjunath; [ pdf]
RDFNet: RGB-D Multi-Level Residual Feature Fusion for Indoor Semantic Segmentation: Seong-Jin Park, Ki-Sang Hong, Seungyong Lee; [ pdf] [ supp]
The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes: Gerhard Neuhold, Tobias Ollmann, Samuel Rota Bulo, Peter Kontschieder; [ pdf] [ supp]
Self-Organized Text Detection With Minimal Post-Processing via Border Learning: Yue Wu, Prem Natarajan; [ pdf]
Sparse Exact PGA on Riemannian Manifolds: Monami Banerjee, Rudrasis Chakraborty, Baba C. Vemuri; [ pdf]
Tensor RPCA by Bayesian CP Factorization With Complex Noise: Qiong Luo, Zhi Han, Xi'ai Chen, Yao Wang, Deyu Meng, Dong Liang, Yandong Tang; [ pdf] [ supp]
Multimodal Gaussian Process Latent Variable Models With Harmonization: Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian; [ pdf]
Segmentation-Aware Convolutional Networks Using Local Attention Masks: Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos; [ pdf] [ supp] [ arXiv]
Rotation Equivariant Vector Field Networks: Diego Marcos, Michele Volpi, Nikos Komodakis, Devis Tuia; [ pdf] [ arXiv]
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression: Jian-Hao Luo, Jianxin Wu, Weiyao Lin; [ pdf] [ arXiv]
AutoDIAL: Automatic DomaIn Alignment Layers: Fabio Maria Carlucci, Lorenzo Porzi, Barbara Caputo, Elisa Ricci, Samuel Rota Bulo; [ pdf] [ supp] [ arXiv]
Focusing Attention: Towards Accurate Text Recognition in Natural Images: Zhanzhan Cheng, Fan Bai, Yunlu Xu, Gang Zheng, Shiliang Pu, Shuigeng Zhou; [ pdf] [ arXiv]
Unsupervised Object Segmentation in Video by Efficient Selection of Highly Probable Positive Features: Emanuela Haller, Marius Leordeanu; [ pdf] [ arXiv]
Nonparametric Variational Auto-Encoders for Hierarchical Representation Learning: Prasoon Goyal, Zhiting Hu, Xiaodan Liang, Chenyu Wang, Eric P. Xing; [ pdf] [ arXiv]
Dense and Low-Rank Gaussian CRFs Using Deep Embeddings: Siddhartha Chandra, Nicolas Usunier, Iasonas Kokkinos; [ pdf]
A Multimodal Deep Regression Bayesian Network for Affective Video Content Analyses: Quan Gan, Shangfei Wang, Longfei Hao, Qiang Ji; [ pdf]
Moving Object Detection in Time-Lapse or Motion Trigger Image Sequences Using Low-Rank and Invariant Sparse Decomposition: Moein Shakeri, Hong Zhang; [ pdf] [ supp]
A Multilayer-Based Framework for Online Background Subtraction With Freely Moving Cameras: Yizhe Zhu, Ahmed Elgammal; [ pdf] [ supp] [ arXiv]
Dynamic Label Graph Matching for Unsupervised Video Re-Identification: Mang Ye, Andy J. Ma, Liang Zheng, Jiawei Li, Pong C. Yuen; [ pdf] [ arXiv]
Spatiotemporal Modeling for Crowd Counting in Videos: Feng Xiong, Xingjian Shi, Dit-Yan Yeung; [ pdf] [ supp] [ arXiv]
Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning: Tae-Hyun Oh, Kyungdon Joo, Neel Joshi, Baoyuan Wang, In So Kweon, Sing Bing Kang; [ pdf] [ supp]
What Is Around the Camera?: Stamatios Georgoulis, Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Tinne Tuytelaars, Luc Van Gool; [ pdf] [ arXiv]
Weakly-Supervised Learning of Visual Relations: Julia Peyre, Josef Sivic, Ivan Laptev, Cordelia Schmid; [ pdf] [ arXiv]
BIER - Boosting Independent Embeddings Robustly: Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof; [ pdf] [ supp]
3D Graph Neural Networks for RGBD Semantic Segmentation: Xiaojuan Qi, Renjie Liao, Jiaya Jia, Sanja Fidler, Raquel Urtasun; [ pdf]
Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition: Heliang Zheng, Jianlong Fu, Tao Mei, Jiebo Luo; [ pdf]
Learning 3D Object Categories by Looking Around Them: David Novotny, Diane Larlus, Andrea Vedaldi; [ pdf] [ supp] [ arXiv]
Quantitative Evaluation of Confidence Measures in a Machine Learning World: Matteo Poggi, Fabio Tosi, Stefano Mattoccia; [ pdf] [ supp]
Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks: Hui Li, Peng Wang, Chunhua Shen; [ pdf]
DeepSetNet: Predicting Sets With Deep Neural Networks: S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid; [ pdf] [ supp] [ arXiv]
Learning From Video and Text via Large-Scale Discriminative Clustering: Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, Josef Sivic; [ pdf] [ arXiv]
TALL: Temporal Activity Localization via Language Query: Jiyang Gao, Chen Sun, Zhenheng Yang, Ram Nevatia; [ pdf] [ supp] [ arXiv]
End-To-End Face Detection and Cast Grouping in Movies Using Erdos-Renyi Clustering: SouYoung Jin, Hang Su, Chris Stauffer, Erik Learned-Miller; [ pdf] [ supp]
Active Decision Boundary Annotation With Deep Generative Models: Miriam Huijser, Jan C. van Gemert; [ pdf] [ supp] [ arXiv]
Convolutional Dictionary Learning via Local Processing: Vardan Papyan, Yaniv Romano, Jeremias Sulam, Michael Elad; [ pdf] [ supp] [ arXiv]
Editable Parametric Dense Foliage From 3D Capture: Gaurav Chaurasia, Paul Beardsley; [ pdf]
Refractive Structure-From-Motion Through a Flat Refractive Interface: Francois Chadebecq, Francisco Vasconcelos, George Dwyer, Rene Lacher, Sebastien Ourselin, Tom Vercauteren, Danail Stoyanov; [ pdf]
Submodular Trajectory Optimization for Aerial 3D Scanning: Mike Roberts, Debadeepta Dey, Anh Truong, Sudipta Sinha, Shital Shah, Ashish Kapoor, Pat Hanrahan, Neel Joshi; [ pdf] [ supp] [ arXiv]
Camera Calibration by Global Constraints on the Motion of Silhouettes: Gil Ben-Artzi; [ pdf] [ arXiv]
Deltille Grids for Geometric Camera Calibration: Hyowon Ha, Michal Perdoch, Hatem Alismail, In So Kweon, Yaser Sheikh; [ pdf]
A Lightweight Single-Camera Polarization Compass With Covariance Estimation: Wolfgang Sturzl; [ pdf] [ supp]
Reflectance Capture Using Univariate Sampling of BRDFs: Zhuo Hui, Kalyan Sunkavalli, Joon-Young Lee, Sunil Hadap, Jian Wang, Aswin C. Sankaranarayanan; [ pdf] [ supp]
Estimating Defocus Blur via Rank of Local Patches: Guodong Xu, Yuhui Quan, Hui Ji; [ pdf] [ supp]
RGB-Infrared Cross-Modality Person Re-Identification: Ancong Wu, Wei-Shi Zheng, Hong-Xing Yu, Shaogang Gong, Jianhuang Lai; [ pdf] [ supp]
Intrinsic 3D Dynamic Surface Tracking Based on Dynamic Ricci Flow and Teichmuller Map: Xiaokang Yu, Na Lei, Yalin Wang, Xianfeng Gu; [ pdf]
Multi-Scale Deep Learning Architectures for Person Re-Identification: Xuelin Qian, Yanwei Fu, Yu-Gang Jiang, Tao Xiang, Xiangyang Xue; [ pdf] [ supp] [ arXiv]
Range Loss for Deep Face Recognition With Long-Tailed Training Data: Xiao Zhang, Zhiyuan Fang, Yandong Wen, Zhifeng Li, Yu Qiao; [ pdf]
Face Sketch Matching via Coupled Deep Transform Learning: Shruti Nagpal, Maneet Singh, Richa Singh, Mayank Vatsa, Afzel Noore, Angshul Majumdar; [ pdf]
Realistic Dynamic Facial Textures From a Single Image Using GANs: Kyle Olszewski, Zimo Li, Chao Yang, Yi Zhou, Ronald Yu, Zeng Huang, Sitao Xiang, Shunsuke Saito, Pushmeet Kohli, Hao Li; [ pdf] [ supp]
Pixel Recursive Super Resolution: Ryan Dahl, Mohammad Norouzi, Jonathon Shlens; [ pdf] [ supp] [ arXiv]
PanNet: A Deep Network Architecture for Pan-Sharpening: Junfeng Yang, Xueyang Fu, Yuwen Hu, Yue Huang, Xinghao Ding, John Paisley; [ pdf]
Recurrent Color Constancy: Yanlin Qian, Ke Chen, Jarno Nikkanen, Joni-Kristian Kamarainen, Jiri Matas; [ pdf] [ supp]
Saliency Pattern Detection by Ranking Structured Trees: Lei Zhu, Haibin Ling, Jin Wu, Huiping Deng, Jin Liu; [ pdf]
Monocular Video-Based Trailer Coupler Detection Using Multiplexer Convolutional Neural Network: Yousef Atoum, Joseph Roth, Michael Bliss, Wende Zhang, Xiaoming Liu; [ pdf]
Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy Visual Tracking: Heng Fan, Haibin Ling; [ pdf] [ supp] [ arXiv]
Non-Rigid Object Tracking via Deformable Patches Using Shape-Preserved KCF and Level Sets: Xin Sun, Ngai-Man Cheung, Hongxun Yao, Yiluan Guo; [ pdf]
A Discriminative View of MRF Pre-Processing Algorithms: Chen Wang, Charles Herrmann, Ramin Zabih; [ pdf] [ supp] [ arXiv]
Offline Handwritten Signature Modeling and Verification Based on Archetypal Analysis: Elias N. Zois, Ilias Theodorakopoulos, George Economou; [ pdf]
Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization: Huseyin Coskun, Felix Achilles, Robert DiPietro, Nassir Navab, Federico Tombari; [ pdf]
Learning Spatio-Temporal Representation With Pseudo-3D Residual Networks: Zhaofan Qiu, Ting Yao, Tao Mei; [ pdf]
Deeper, Broader and Artier Domain Generalization: Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales; [ pdf]
Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval: Jifei Song, Qian Yu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales; [ pdf] [ supp]
Soft-NMS -- Improving Object Detection With One Line of Code: Navaneeth Bodla, Bharat Singh, Rama Chellappa, Larry S. Davis; [ pdf] [ arXiv]
Semantic Jitter: Dense Supervision for Visual Comparisons via Synthetic Images: Aron Yu, Kristen Grauman; [ pdf] [ supp] [ arXiv]
Video Scene Parsing With Predictive Feature Learning: Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan; [ pdf] [ supp] [ arXiv]
Understanding and Mapping Natural Beauty: Scott Workman, Richard Souvenir, Nathan Jacobs; [ pdf] [ supp]
Human Pose Estimation Using Global and Local Normalization: Ke Sun, Cuiling Lan, Junliang Xing, Wenjun Zeng, Dong Liu, Jingdong Wang; [ pdf] [ supp] [ arXiv]
HashNet: Deep Learning to Hash by Continuation: Zhangjie Cao, Mingsheng Long, Jianmin Wang, Philip S. Yu; [ pdf] [ supp] [ arXiv]
Scaling the Scattering Transform: Deep Hybrid Networks: Edouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko; [ pdf] [ supp] [ arXiv]
Flip-Invariant Motion Representation: Takumi Kobayashi; [ pdf] [ supp]
Scene Categorization With Spectral Features: Salman H. Khan, Munawar Hayat, Fatih Porikli; [ pdf]
Image2song: Song Retrieval via Bridging Image Content and Lyric Words: Xuelong Li, Di Hu, Xiaoqiang Lu; [ pdf] [ supp] [ arXiv]
Deep Functional Maps: Structured Prediction for Dense Shape Correspondence: Or Litany, Tal Remez, Emanuele Rodola, Alex Bronstein, Michael Bronstein; [ pdf] [ arXiv]
Training Deep Networks to Be Spatially Sensitive: Nicholas Kolkin, Eli Shechtman, Gregory Shakhnarovich; [ pdf] [ arXiv]
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds: Fangyu Liu, Shuaipeng Li, Liqiang Zhang, Chenghu Zhou, Rongtian Ye, Yuebin Wang, Jiwen Lu; [ pdf] [ supp] [ arXiv]
Semi Supervised Semantic Segmentation Using Generative Adversarial Network: Nasim Souly, Concetto Spampinato, Mubarak Shah; [ pdf]
Efficient Low Rank Tensor Ring Completion: Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron; [ pdf] [ supp] [ arXiv]
Semantic Image Synthesis via Adversarial Learning: Hao Dong, Simiao Yu, Chao Wu, Yike Guo; [ pdf] [ supp] [ arXiv]
Unified Deep Supervised Domain Adaptation and Generalization: Saeid Motiian, Marco Piccirilli, Donald A. Adjeroh, Gianfranco Doretto; [ pdf] [ arXiv]
Interpretable Transformations With Encoder-Decoder Networks: Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow; [ pdf]
Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization: Kamran Ghasedi Dizaji, Amirhossein Herandi, Cheng Deng, Weidong Cai, Heng Huang; [ pdf] [ arXiv]
Deep Scene Image Classification With the MFAFVNet: Yunsheng Li, Mandar Dixit, Nuno Vasconcelos; [ pdf]
Learning Bag-Of-Features Pooling for Deep Convolutional Neural Networks: Nikolaos Passalis, Anastasios Tefas; [ pdf]
Adversarial Examples Detection in Deep Networks With Convolutional Filter Statistics: Xin Li, Fuxin Li; [ pdf] [ arXiv]
Joint Prediction of Activity Labels and Starting Times in Untrimmed Videos: Tahmida Mahmud, Mahmudul Hasan, Amit K. Roy-Chowdhury; [ pdf] [ supp]
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection: Huijuan Xu, Abir Das, Kate Saenko; [ pdf] [ supp] [ arXiv]
Temporal Context Network for Activity Localization in Videos: Xiyang Dai, Bharat Singh, Guyue Zhang, Larry S. Davis, Yan Qiu Chen; [ pdf] [ arXiv]
Localizing Moments in Video With Natural Language: Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, Trevor Darrell, Bryan Russell; [ pdf] [ supp] [ arXiv]
TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal: Hongyuan Zhu, Romain Vial, Shijian Lu; [ pdf]
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos: Rui Hou, Chen Chen, Mubarak Shah; [ pdf]
Learning Action Recognition Model From Depth and Skeleton Videos: Hossein Rahmani, Mohammed Bennamoun; [ pdf]
The "Something Something" Video Database for Learning and Evaluating Visual Common Sense: Raghav Goyal, Samira Ebrahimi Kahou, Vincent Michalski, Joanna Materzynska, Susanne Westphal, Heuna Kim, Valentin Haenel, Ingo Fruend, Peter Yianilos, Moritz Mueller-Freitag, Florian Hoppe, Christian Thurau, Ingo Bax, Roland Memisevic; [ pdf] [ supp] [ arXiv]
GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images: Avi Singh, Larry Yang, Sergey Levine; [ pdf] [ supp] [ arXiv]
Semi-Global Weighted Least Squares in Image Filtering: Wei Liu, Xiaogang Chen, Chuanhua Shen, Zhi Liu, Jie Yang; [ pdf] [ arXiv]
Scale Recovery for Monocular Visual Odometry Using Depth Estimated With Deep Convolutional Neural Fields: Xiaochuan Yin, Xiangwei Wang, Xiaoguo Du, Qijun Chen; [ pdf]
Deep Adaptive Image Clustering: Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, Chunhong Pan; [ pdf] [ supp]
One Network to Solve Them All -- Solving Linear Inverse Problems Using Deep Projection Models: J. H. Rick Chang, Chun-Liang Li, Barnabas Poczos, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan; [ pdf] [ supp]
Representation Learning by Learning to Count: Mehdi Noroozi, Hamed Pirsiavash, Paolo Favaro; [ pdf] [ arXiv]
StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks: Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris N. Metaxas; [ pdf] [ arXiv]
Unsupervised Learning of Object Landmarks by Factorized Spatial Embeddings: James Thewlis, Hakan Bilen, Andrea Vedaldi; [ pdf] [ supp]

展开全文

阅读 6+

评论 0+