成为VIP会员查看完整内容
VIP会员码认证
首页
主题
发现
会员
服务
注册
·
登录
0
ICCV 2017 accepted papers
专知出品
ICCV 2017 papers on the web
原文链接:http://openaccess.thecvf.com/ICCV2017.py
Papers
Globally-Optimal Inlier Set Maximisation for Simultaneous Camera Pose and Feature Correspondence
Dylan Campbell, Lars Petersson, Laurent Kneip, Hongdong Li
[
pdf
] [
supp
]
Robust Pseudo Random Fields for Light-Field Stereo Matching
Chao-Tsung Huang
[
pdf
]
A Lightweight Approach for On-The-Fly Reflectance Estimation
Kihwan Kim, Jinwei Gu, Stephen Tyree, Pavlo Molchanov, Matthias Niessner, Jan Kautz
[
pdf
] [
supp
]
Distributed Very Large Scale Bundle Adjustment by Global Camera Consensus
Runze Zhang, Siyu Zhu, Tian Fang, Long Quan
[
pdf
] [
supp
]
Practical Projective Structure From Motion (P2SfM)
Ludovic Magerand, Alessio Del Bue
[
pdf
] [
supp
]
Anticipating Daily Intention Using On-Wrist Motion Triggered Sensing
Tz-Ying Wu, Ting-An Chien, Cheng-Sheng Chan, Chan-Wei Hu, Min Sun
[
pdf
] [
supp
]
Rethinking Reprojection: Closing the Loop for Pose-Aware Shape Reconstruction From a Single Image
Rui Zhu, Hamed Kiani Galoogahi, Chaoyang Wang, Simon Lucey
[
pdf
] [
supp
]
End-To-End Learning of Geometry and Context for Deep Stereo Regression
Alex Kendall, Hayk Martirosyan, Saumitro Dasgupta, Peter Henry, Ryan Kennedy, Abraham Bachrach, Adam Bry
[
pdf
]
Using Sparse Elimination for Solving Minimal Problems in Computer Vision
Janne Heikkila
[
pdf
]
High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference
Xiaoguang Han, Zhen Li, Haibin Huang, Evangelos Kalogerakis, Yizhou Yu
[
pdf
] [
supp
] [
arXiv
]
Temporal Tessellation: A Unified Approach for Video Analysis
Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf
[
pdf
] [
arXiv
]
Learning Policies for Adaptive Tracking With Deep Feature Cascades
Chen Huang, Simon Lucey, Deva Ramanan
[
pdf
] [
supp
] [
arXiv
]
Temporal Shape Super-Resolution by Intra-Frame Motion Encoding Using High-Fps Structured Light
Yuki Shiba, Satoshi Ono, Ryo Furukawa, Shinsaku Hiura, Hiroshi Kawasaki
[
pdf
] [
supp
]
Real-Time Monocular Pose Estimation of 3D Objects Using Temporally Consistent Local Color Histograms
Henning Tjaden, Ulrich Schwanecke, Elmar Schomer
[
pdf
] [
supp
]
CAD Priors for Accurate and Flexible Instance Reconstruction
Tolga Birdal, Slobodan Ilic
[
pdf
] [
arXiv
]
Colored Point Cloud Registration Revisited
Jaesik Park, Qian-Yi Zhou, Vladlen Koltun
[
pdf
]
Learning Compact Geometric Features
Marc Khoury, Qian-Yi Zhou, Vladlen Koltun
[
pdf
] [
arXiv
]
Joint Layout Estimation and Global Multi-View Registration for Indoor Reconstruction
Jeong-Kyun Lee, Jaewon Yea, Min-Gyu Park, Kuk-Jin Yoon
[
pdf
] [
supp
] [
arXiv
]
A Geometric Framework for Statistical Analysis of Trajectories With Distinct Temporal Spans
Rudrasis Chakraborty, Vikas Singh, Nagesh Adluru, Baba C. Vemuri
[
pdf
]
An Optimal Transportation Based Univariate Neuroimaging Index
Liang Mi, Wen Zhang, Junwei Zhang, Yonghui Fan, Dhruman Goradia, Kewei Chen, Eric M. Reiman, Xianfeng Gu, Yalin Wang
[
pdf
]
S3FD: Single Shot Scale-Invariant Face Detector
Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
[
pdf
] [
supp
]
Amulet: Aggregating Multi-Level Convolutional Features for Salient Object Detection
Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Xiang Ruan
[
pdf
] [
supp
] [
arXiv
]
Learning Uncertain Convolutional Features for Accurate Saliency Detection
Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Baocai Yin
[
pdf
] [
supp
] [
arXiv
]
Zero-Order Reverse Filtering
Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia
[
pdf
] [
arXiv
]
Learning Blind Motion Deblurring
Patrick Wieschollek, Michael Hirsch, Bernhard Scholkopf, Hendrik P. A. Lensch
[
pdf
] [
supp
] [
arXiv
]
Joint Adaptive Sparsity and Low-Rankness on the Fly: An Online Tensor Reconstruction Scheme for Video Denoising
Bihan Wen, Yanjun Li, Luke Pfister, Yoram Bresler
[
pdf
] [
supp
]
Learning to Super-Resolve Blurry Face and Text Images
Xiangyu Xu, Deqing Sun, Jinshan Pan, Yujin Zhang, Hanspeter Pfister, Ming-Hsuan Yang
[
pdf
]
Video Frame Interpolation via Adaptive Separable Convolution
Simon Niklaus, Long Mai, Feng Liu
[
pdf
] [
arXiv
]
Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection
Pierre Baque, Francois Fleuret, Pascal Fua
[
pdf
] [
supp
] [
arXiv
]
Encouraging LSTMs to Anticipate Actions Very Early
Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Basura Fernando, Lars Petersson, Lars Andersson
[
pdf
] [
supp
] [
arXiv
]
PathTrack: Fast Trajectory Annotation With Path Supervision
Santiago Manen, Michael Gygli, Dengxin Dai, Luc Van Gool
[
pdf
] [
supp
] [
arXiv
]
Tracking the Untrackable: Learning to Track Multiple Cues With Long-Term Dependencies
Amir Sadeghian, Alexandre Alahi, Silvio Savarese
[
pdf
] [
arXiv
]
MirrorFlow: Exploiting Symmetries in Joint Optical Flow and Occlusion Estimation
Junhwa Hur, Stefan Roth
[
pdf
] [
supp
] [
arXiv
]
Tracking as Online Decision-Making: Learning a Policy From Streaming Videos With Reinforcement Learning
James Supancic,III, Deva Ramanan
[
pdf
] [
supp
]
Non-Convex Rank/Sparsity Regularization and Local Minima
Carl Olsson, Marcus Carlsson, Fredrik Andersson, Viktor Larsson
[
pdf
] [
supp
] [
arXiv
]
A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework
Weixin Luo, Wen Liu, Shenghua Gao
[
pdf
]
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
Xihui Liu, Haiyu Zhao, Maoqing Tian, Lu Sheng, Jing Shao, Shuai Yi, Junjie Yan, Xiaogang Wang
[
pdf
]
No Fuss Distance Metric Learning Using Proxies
Yair Movshovitz-Attias, Alexander Toshev, Thomas K. Leung, Sergey Ioffe, Saurabh Singh
[
pdf
] [
arXiv
]
Benchmarking and Error Diagnosis in Multi-Instance Pose Estimation
Matteo Ruggero Ronchi, Pietro Perona
[
pdf
] [
supp
] [
arXiv
]
Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-Identification
Zhongdao Wang, Luming Tang, Xihui Liu, Zhuliang Yao, Shuai Yi, Jing Shao, Junjie Yan, Shengjin Wang, Hongsheng Li, Xiaogang Wang
[
pdf
]
Fashion Forward: Forecasting Visual Style in Fashion
Ziad Al-Halah, Rainer Stiefelhagen, Kristen Grauman
[
pdf
] [
supp
] [
arXiv
]
Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach
Xingyi Zhou, Qixing Huang, Xiao Sun, Xiangyang Xue, Yichen Wei
[
pdf
] [
supp
] [
arXiv
]
Flow-Guided Feature Aggregation for Video Object Detection
Xizhou Zhu, Yujie Wang, Jifeng Dai, Lu Yuan, Yichen Wei
[
pdf
] [
arXiv
]
Reasoning About Fine-Grained Attribute Phrases Using Reference Games
Jong-Chyi Su, Chenyun Wu, Huaizu Jiang, Subhransu Maji
[
pdf
] [
supp
] [
arXiv
]
DeNet: Scalable Real-Time Object Detection With Directed Sparse Sampling
Lachlan Tychsen-Smith, Lars Petersson
[
pdf
] [
supp
] [
arXiv
]
MIHash: Online Hashing With Mutual Information
Fatih Cakir, Kun He, Sarah Adel Bargal, Stan Sclaroff
[
pdf
] [
supp
] [
arXiv
]
SafetyNet: Detecting and Rejecting Adversarial Examples Robustly
Jiajun Lu, Theerasit Issaranon, David Forsyth
[
pdf
] [
supp
] [
arXiv
]
Recurrent Models for Situation Recognition
Arun Mallya, Svetlana Lazebnik
[
pdf
] [
arXiv
]
Multi-Label Image Recognition by Recurrently Discovering Attentional Regions
Zhouxia Wang, Tianshui Chen, Guanbin Li, Ruijia Xu, Liang Lin
[
pdf
]
Deep Determinantal Point Process for Large-Scale Multi-Label Classification
Pengtao Xie, Ruslan Salakhutdinov, Luntian Mou, Eric P. Xing
[
pdf
]
Visual Semantic Planning Using Deep Successor Representations
Yuke Zhu, Daniel Gordon, Eric Kolve, Dieter Fox, Li Fei-Fei, Abhinav Gupta, Roozbeh Mottaghi, Ali Farhadi
[
pdf
] [
arXiv
]
Neural Person Search Machines
Hao Liu, Jiashi Feng, Zequn Jie, Karlekar Jayashree, Bo Zhao, Meibin Qi, Jianguo Jiang, Shuicheng Yan
[
pdf
] [
arXiv
]
DualNet: Learn Complementary Features for Image Recognition
Saihui Hou, Xu Liu, Zilei Wang
[
pdf
] [
supp
]
Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization
Sijia Cai, Wangmeng Zuo, Lei Zhang
[
pdf
]
Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner
Tseng-Hung Chen, Yuan-Hong Liao, Ching-Yao Chuang, Wan-Ting Hsu, Jianlong Fu, Min Sun
[
pdf
] [
supp
] [
arXiv
]
Attribute Recognition by Joint Recurrent Learning of Context and Correlation
Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li
[
pdf
] [
arXiv
]
VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization
Saihui Hou, Yushan Feng, Zilei Wang
[
pdf
] [
supp
]
Increasing CNN Robustness to Occlusions by Reducing Filter Support
Elad Osherov, Michael Lindenbaum
[
pdf
]
Exploiting Multi-Grain Ranking Constraints for Precisely Searching Visually-Similar Vehicles
Ke Yan, Yonghong Tian, Yaowei Wang, Wei Zeng, Tiejun Huang
[
pdf
]
Recurrent Scale Approximation for Object Detection in CNN
Yu Liu, Hongyang Li, Junjie Yan, Fangyin Wei, Xiaogang Wang, Xiaoou Tang
[
pdf
] [
arXiv
]
Embedding 3D Geometric Features for Rigid Object Part Segmentation
Yafei Song, Xiaowu Chen, Jia Li, Qinping Zhao
[
pdf
]
Towards Context-Aware Interaction Recognition for Visual Relationship Detection
Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid
[
pdf
]
When Unsupervised Domain Adaptation Meets Tensor Representations
Hao Lu, Lei Zhang, Zhiguo Cao, Wei Wei, Ke Xian, Chunhua Shen, Anton van den Hengel
[
pdf
] [
supp
] [
arXiv
]
Look, Listen and Learn
Relja Arandjelovic, Andrew Zisserman
[
pdf
] [
arXiv
]
Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization
Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra
[
pdf
] [
supp
]
Image-Based Localization Using LSTMs for Structured Feature Correlation
Florian Walch, Caner Hazirbas, Laura Leal-Taixe, Torsten Sattler, Sebastian Hilsenbeck, Daniel Cremers
[
pdf
] [
supp
] [
arXiv
]
Personalized Image Aesthetics
Jian Ren, Xiaohui Shen, Zhe Lin, Radomir Mech, David J. Foran
[
pdf
] [
supp
]
Predicting Deeper Into the Future of Semantic Segmentation
Pauline Luc, Natalia Neverova, Camille Couprie, Jakob Verbeek, Yann LeCun
[
pdf
] [
supp
] [
arXiv
]
Coordinating Filters for Faster Deep Neural Networks
Wei Wen, Cong Xu, Chunpeng Wu, Yandan Wang, Yiran Chen, Hai Li
[
pdf
] [
arXiv
]
Unsupervised Representation Learning by Sorting Sequences
Hsin-Ying Lee, Jia-Bin Huang, Maneesh Singh, Ming-Hsuan Yang
[
pdf
] [
arXiv
]
A Read-Write Memory Network for Movie Story Understanding
Seil Na, Sangho Lee, Jisung Kim, Gunhee Kim
[
pdf
] [
supp
]
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow
Jingchun Cheng, Yi-Hsuan Tsai, Shengjin Wang, Ming-Hsuan Yang
[
pdf
] [
supp
] [
arXiv
]
Unsupervised Action Discovery and Localization in Videos
Khurram Soomro, Mubarak Shah
[
pdf
]
Dense-Captioning Events in Videos
Ranjay Krishna, Kenji Hata, Frederic Ren, Li Fei-Fei, Juan Carlos Niebles
[
pdf
] [
supp
] [
arXiv
]
Learning Long-Term Dependencies for Action Recognition With a Biologically-Inspired Deep Network
Yemin Shi, Yonghong Tian, Yaowei Wang, Wei Zeng, Tiejun Huang
[
pdf
] [
arXiv
]
Compressive Quantization for Fast Object Instance Search in Videos
Tan Yu, Zhenzhen Wang, Junsong Yuan
[
pdf
]
Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos
Hehe Fan, Xiaojun Chang, De Cheng, Yi Yang, Dong Xu, Alexander G. Hauptmann
[
pdf
]
Deep Direct Regression for Multi-Oriented Scene Text Detection
Wenhao He, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu
[
pdf
] [
arXiv
]
Open Set Domain Adaptation
Pau Panareda Busto, Juergen Gall
[
pdf
] [
supp
]
Deformable Convolutional Networks
Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei
[
pdf
] [
arXiv
]
Ensemble Diffusion for Retrieval
Song Bai, Zhichao Zhou, Jingdong Wang, Xiang Bai, Longin Jan Latecki, Qi Tian
[
pdf
] [
supp
]
FoveaNet: Perspective-Aware Urban Scene Parsing
Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng
[
pdf
] [
supp
] [
arXiv
]
Beyond Planar Symmetry: Modeling Human Perception of Reflection and Rotation Symmetries in the Wild
Christopher Funk, Yanxi Liu
[
pdf
] [
arXiv
]
Learning to Reason: End-To-End Module Networks for Visual Question Answering
Ronghang Hu, Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Kate Saenko
[
pdf
] [
supp
]
Hard-Aware Deeply Cascaded Embedding
Yuhui Yuan, Kuiyuan Yang, Chao Zhang
[
pdf
] [
arXiv
]
Query-Guided Regression Network With Context Policy for Phrase Grounding
Kan Chen, Rama Kovvuri, Ram Nevatia
[
pdf
] [
arXiv
]
SUBIC: A Supervised, Structured Binary Code for Image Search
Himalaya Jain, Joaquin Zepeda, Patrick Perez, Remi Gribonval
[
pdf
] [
arXiv
]
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta
[
pdf
] [
supp
] [
arXiv
]
A Generative Model of People in Clothing
Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler
[
pdf
] [
supp
] [
arXiv
]
Escape From Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models
Roman Klokov, Victor Lempitsky
[
pdf
] [
arXiv
]
Improved Image Captioning via Policy Gradient Optimization of SPIDEr
Siqi Liu, Zhenhai Zhu, Ning Ye, Sergio Guadarrama, Kevin Murphy
[
pdf
] [
arXiv
]
Rolling Shutter Correction in Manhattan World
Pulak Purkait, Christopher Zach, Ales Leonardis
[
pdf
] [
supp
]
Local-To-Global Point Cloud Registration Using a Dictionary of Viewpoint Descriptors
David Avidar, David Malah, Meir Barzohar
[
pdf
]
3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks
Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem
[
pdf
]
BodyFusion: Real-Time Capture of Human Motion and Surface Geometry Using a Single Depth Camera
Tao Yu, Kaiwen Guo, Feng Xu, Yuan Dong, Zhaoqi Su, Jianhui Zhao, Jianguo Li, Qionghai Dai, Yebin Liu
[
pdf
]
Quasiconvex Plane Sweep for Triangulation With Outliers
Qianggong Zhang, Tat-Jun Chin, David Suter
[
pdf
] [
supp
]
"Maximizing Rigidity" Revisited: A Convex Programming Approach for Generic 3D Shape Reconstruction From Multiple Perspective Views
Pan Ji, Hongdong Li, Yuchao Dai, Ian Reid
[
pdf
] [
arXiv
]
Surface Registration via Foliation
Xiaopeng Zheng, Chengfeng Wen, Na Lei, Ming Ma, Xianfeng Gu
[
pdf
]
Rolling-Shutter-Aware Differential SfM and Image Rectification
Bingbing Zhuang, Loong-Fah Cheong, Gim Hee Lee
[
pdf
] [
supp
]
Corner-Based Geometric Calibration of Multi-Focus Plenoptic Cameras
Sotiris Nousias, Francois Chadebecq, Jonas Pichat, Pearse Keane, Sebastien Ourselin, Christos Bergeles
[
pdf
]
Focal Track: Depth and Accommodation With Oscillating Lens Deformation
Qi Guo, Emma Alexander, Todd Zickler
[
pdf
]
Reconfiguring the Imaging Pipeline for Computer Vision
Mark Buckler, Suren Jayasuriya, Adrian Sampson
[
pdf
] [
supp
] [
arXiv
]
Catadioptric HyperSpectral Light Field Imaging
Yujia Xue, Kang Zhu, Qiang Fu, Xilin Chen, Jingyi Yu
[
pdf
] [
supp
]
Cross-View Asymmetric Metric Learning for Unsupervised Person Re-Identification
Hong-Xing Yu, Ancong Wu, Wei-Shi Zheng
[
pdf
] [
supp
] [
arXiv
]
Real Time Eye Gaze Tracking With 3D Deformable Eye-Face Model
Kang Wang, Qiang Ji
[
pdf
]
Ensemble Deep Learning for Skeleton-Based Action Recognition Using Temporal Sliding LSTM Networks
Inwoong Lee, Doyoung Kim, Seoungyoon Kang, Sanghoon Lee
[
pdf
]
How Far Are We From Solving the 2D & 3D Face Alignment Problem? (And a Dataset of 230,000 3D Facial Landmarks)
Adrian Bulat, Georgios Tzimiropoulos
[
pdf
] [
supp
]
Large Pose 3D Face Reconstruction From a Single Image via Direct Volumetric CNN Regression
Aaron S. Jackson, Adrian Bulat, Vasileios Argyriou, Georgios Tzimiropoulos
[
pdf
] [
supp
] [
arXiv
]
RankIQA: Learning From Rankings for No-Reference Image Quality Assessment
Xialei Liu, Joost van de Weijer, Andrew D. Bagdanov
[
pdf
] [
supp
] [
arXiv
]
Look, Perceive and Segment: Finding the Salient Objects in Images via Two-Stream Fixation-Semantic CNNs
Xiaowu Chen, Anlin Zheng, Jia Li, Feng Lu
[
pdf
]
Delving Into Salient Object Subitizing and Detection
Shengfeng He, Jianbo Jiao, Xiaodan Zhang, Guoqiang Han, Rynson W.H. Lau
[
pdf
]
Learning Discriminative Data Fitting Functions for Blind Image Deblurring
Jinshan Pan, Jiangxin Dong, Yu-Wing Tai, Zhixun Su, Ming-Hsuan Yang
[
pdf
] [
supp
]
Video Deblurring via Semantic Segmentation and Pixel-Wise Non-Linear Kernel
Wenqi Ren, Jinshan Pan, Xiaochun Cao, Ming-Hsuan Yang
[
pdf
] [
arXiv
]
On-Demand Learning for Deep Image Restoration
Ruohan Gao, Kristen Grauman
[
pdf
]
Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising
Jun Xu, Lei Zhang, David Zhang, Xiangchu Feng
[
pdf
] [
supp
] [
arXiv
]
Coherent Online Video Style Transfer
Dongdong Chen, Jing Liao, Lu Yuan, Nenghai Yu, Gang Hua
[
pdf
] [
arXiv
]
SHaPE: A Novel Graph Theoretic Algorithm for Making Consensus-Based Decisions in Person Re-Identification Systems
Arko Barman, Shishir K. Shah
[
pdf
] [
supp
]
Need for Speed: A Benchmark for Higher Frame Rate Object Tracking
Hamed Kiani Galoogahi, Ashton Fagg, Chen Huang, Deva Ramanan, Simon Lucey
[
pdf
] [
supp
] [
arXiv
]
Learning Background-Aware Correlation Filters for Visual Tracking
Hamed Kiani Galoogahi, Ashton Fagg, Simon Lucey
[
pdf
] [
supp
] [
arXiv
]
Robust Object Tracking Based on Temporal and Spatial Deep Networks
Zhu Teng, Junliang Xing, Qiang Wang, Congyan Lang, Songhe Feng, Yi Jin
[
pdf
]
Real-Time Hand Tracking Under Occlusion From an Egocentric RGB-D Sensor
Franziska Mueller, Dushyant Mehta, Oleksandr Sotnychenko, Srinath Sridhar, Dan Casas, Christian Theobalt
[
pdf
] [
supp
]
Predicting Human Activities Using Stochastic Grammar
Siyuan Qi, Siyuan Huang, Ping Wei, Song-Chun Zhu
[
pdf
] [
arXiv
]
ProbFlow: Joint Optical Flow and Uncertainty Estimation
Anne S. Wannenwetsch, Margret Keuper, Stefan Roth
[
pdf
] [
supp
] [
arXiv
]
Sublabel-Accurate Discretization of Nonconvex Free-Discontinuity Problems
Thomas Mollenhoff, Daniel Cremers
[
pdf
] [
supp
] [
arXiv
]
DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding
Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, Jianxiong Xiao
[
pdf
] [
supp
]
BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography
Michael J. Wilber, Chen Fang, Hailin Jin, Aaron Hertzmann, John Collomosse, Serge Belongie
[
pdf
] [
arXiv
]
Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation
Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang
[
pdf
] [
arXiv
]
An Empirical Study of Language CNN for Image Captioning
Jiuxiang Gu, Gang Wang, Jianfei Cai, Tsuhan Chen
[
pdf
] [
arXiv
]
Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning
Berkan Demirel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis
[
pdf
] [
arXiv
]
Areas of Attention for Image Captioning
Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek
[
pdf
] [
supp
] [
arXiv
]
Generative Modeling of Audible Shapes for Object Perception
Zhoutong Zhang, Jiajun Wu, Qiujia Li, Zhengjia Huang, James Traer, Josh H. McDermott, Joshua B. Tenenbaum, William T. Freeman
[
pdf
]
Scene Graph Generation From Objects, Phrases and Region Captions
Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, Xiaogang Wang
[
pdf
] [
arXiv
]
Recurrent Multimodal Interaction for Referring Image Segmentation
Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Alan Yuille
[
pdf
] [
supp
] [
arXiv
]
Learning Feature Pyramids for Human Pose Estimation
Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, Xiaogang Wang
[
pdf
] [
supp
] [
arXiv
]
Structured Attentions for Visual Question Answering
Chen Zhu, Yanpeng Zhao, Shuaiyi Huang, Kewei Tu, Yi Ma
[
pdf
] [
arXiv
]
Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
Debidatta Dwibedi, Ishan Misra, Martial Hebert
[
pdf
] [
arXiv
]
Cascaded Feature Network for Semantic Segmentation of RGB-D Images
Di Lin, Guangyong Chen, Daniel Cohen-Or, Pheng-Ann Heng, Hui Huang
[
pdf
]
Encoder Based Lifelong Learning
Amal Rannen, Rahaf Aljundi, Matthew B. Blaschko, Tinne Tuytelaars
[
pdf
] [
supp
] [
arXiv
]
Transitive Invariance for Self-Supervised Visual Representation Learning
Xiaolong Wang, Kaiming He, Abhinav Gupta
[
pdf
] [
arXiv
]
Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction
Stepan Tulyakov, Anton Ivanov, Francois Fleuret
[
pdf
]
Fine-Grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach
Timnit Gebru, Judy Hoffman, Li Fei-Fei
[
pdf
] [
arXiv
]
SORT: Second-Order Response Transform for Visual Recognition
Yan Wang, Lingxi Xie, Chenxi Liu, Siyuan Qiao, Ya Zhang, Wenjun Zhang, Qi Tian, Alan Yuille
[
pdf
] [
arXiv
]
Adversarial Examples for Semantic Segmentation and Object Detection
Cihang Xie, Jianyu Wang, Zhishuai Zhang, Yuyin Zhou, Lingxi Xie, Alan Yuille
[
pdf
] [
supp
] [
arXiv
]
Genetic CNN
Lingxi Xie, Alan Yuille
[
pdf
] [
arXiv
]
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He, Xiangyu Zhang, Jian Sun
[
pdf
] [
arXiv
]
Infinite Latent Feature Selection: A Probabilistic Latent Graph-Based Ranking Approach
Giorgio Roffo, Simone Melzi, Umberto Castellani, Alessandro Vinciarelli
[
pdf
] [
arXiv
]
Video Fill in the Blank Using LR/RL LSTMs With Spatial-Temporal Attentions
Amir Mazaheri, Dong Zhang, Mubarak Shah
[
pdf
] [
arXiv
]
Primary Video Object Segmentation via Complementary CNNs and Neighborhood Reversible Flow
Jia Li, Anlin Zheng, Xiaowu Chen, Bin Zhou
[
pdf
]
Attentive Semantic Video Generation Using Captions
Tanya Marwah, Gaurav Mittal, Vineeth N. Balasubramanian
[
pdf
] [
arXiv
]
Following Gaze in Video
Adria Recasens, Carl Vondrick, Aditya Khosla, Antonio Torralba
[
pdf
]
Adaptive RNN Tree for Large-Scale Human Action Recognition
Wenbo Li, Longyin Wen, Ming-Ching Chang, Ser Nam Lim, Siwei Lyu
[
pdf
]
Spatio-Temporal Person Retrieval via Natural Language Queries
Masataka Yamaguchi, Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada
[
pdf
] [
supp
] [
arXiv
]
Automatic Spatially-Aware Fashion Concept Discovery
Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis
[
pdf
] [
arXiv
]
ChromaTag: A Colored Marker and Fast Detection Algorithm
Joseph DeGol, Timothy Bretl, Derek Hoiem
[
pdf
] [
supp
] [
arXiv
]
Adversarial Image Perturbation for Privacy Protection -- A Game Theory Perspective
Seong Joon Oh, Mario Fritz, Bernt Schiele
[
pdf
] [
supp
] [
arXiv
]
WeText: Scene Text Detection Under Weak Supervision
Shangxuan Tian, Shijian Lu, Chongshou Li
[
pdf
]
Arbitrary Style Transfer in Real-Time With Adaptive Instance Normalization
Xun Huang, Serge Belongie
[
pdf
] [
supp
] [
arXiv
]
Photographic Image Synthesis With Cascaded Refinement Networks
Qifeng Chen, Vladlen Koltun
[
pdf
] [
arXiv
]
SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again
Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, Nassir Navab
[
pdf
] [
supp
]
Unsupervised Creation of Parameterized Avatars
Lior Wolf, Yaniv Taigman, Adam Polyak
[
pdf
] [
supp
] [
arXiv
]
Learning for Active 3D Mapping
Karel Zimmermann, Tomas Petricek, Vojtech Salansky, Tomas Svoboda
[
pdf
] [
supp
] [
arXiv
]
Toward Perceptually-Consistent Stereo: A Scanline Study
Jialiang Wang, Daniel Glasner, Todd Zickler
[
pdf
]
Surface Normals in the Wild
Weifeng Chen, Donglai Xiang, Jia Deng
[
pdf
] [
supp
] [
arXiv
]
Unsupervised Learning of Stereo Matching
Chao Zhou, Hong Zhang, Xiaoyong Shen, Jiaya Jia
[
pdf
]
Unrestricted Facial Geometry Reconstruction Using Image-To-Image Translation
Matan Sela, Elad Richardson, Ron Kimmel
[
pdf
] [
supp
]
Learned Multi-Patch Similarity
Wilfried Hartmann, Silvano Galliani, Michal Havlena, Luc Van Gool, Konrad Schindler
[
pdf
] [
supp
] [
arXiv
]
Click Here: Human-Localized Keypoints as Guidance for Viewpoint Estimation
Ryan Szeto, Jason J. Corso
[
pdf
] [
supp
] [
arXiv
]
Unsupervised Adaptation for Deep Stereo
Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano
[
pdf
] [
supp
]
Composite Focus Measure for High Quality Depth Maps
Parikshit Sakurikar, P. J. Narayanan
[
pdf
]
Reconstruction-Based Disentanglement for Pose-Invariant Face Recognition
Xi Peng, Xiang Yu, Kihyuk Sohn, Dimitris N. Metaxas, Manmohan Chandraker
[
pdf
] [
supp
] [
arXiv
]
Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection
Shengtao Xiao, Jiashi Feng, Luoqi Liu, Xuecheng Nie, Wei Wang, Shuicheng Yan, Ashraf Kassim
[
pdf
] [
supp
]
Anchored Regression Networks Applied to Age Estimation and Super Resolution
Eirikur Agustsson, Radu Timofte, Luc Van Gool
[
pdf
]
Infant Footprint Recognition
Eryun Liu
[
pdf
]
Self-Paced Kernel Estimation for Robust Blind Image Deblurring
Dong Gong, Mingkui Tan, Yanning Zhang, Anton van den Hengel, Qinfeng Shi
[
pdf
]
Super-Trajectory for Video Segmentation
Wenguan Wang, Jianbing Shen, Jianwen Xie, Fatih Porikli
[
pdf
] [
arXiv
]
Be Your Own Prada: Fashion Synthesis With Structural Coherence
Shizhan Zhu, Raquel Urtasun, Sanja Fidler, Dahua Lin, Chen Change Loy
[
pdf
]
Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution
Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan
[
pdf
]
Learning Gaze Transitions From Depth to Improve Video Saliency Estimation
George Leifman, Dmitry Rudoy, Tristan Swedish, Eduardo Bayro-Corrochano, Ramesh Raskar
[
pdf
] [
supp
] [
arXiv
]
Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation
Shuhang Gu, Deyu Meng, Wangmeng Zuo, Lei Zhang
[
pdf
] [
supp
]
Modelling the Scene Dependent Imaging in Cameras With a Deep Neural Network
Seonghyeon Nam, Seon Joo Kim
[
pdf
] [
supp
] [
arXiv
]
Transformed Low-Rank Model for Line Pattern Noise Removal
Yi Chang, Luxin Yan, Sheng Zhong
[
pdf
]
Weakly Supervised Manifold Learning for Dense Semantic Object Correspondence
Utkarsh Gaur, B. S. Manjunath
[
pdf
]
Dual Motion GAN for Future-Flow Embedded Video Prediction
Xiaodan Liang, Lisa Lee, Wei Dai, Eric P. Xing
[
pdf
] [
arXiv
]
Online Robust Image Alignment via Subspace Learning From Gradient Orientations
Qingqing Zheng, Yi Wang, Pheng-Ann Heng
[
pdf
] [
supp
]
Learning Dynamic Siamese Network for Visual Object Tracking
Qing Guo, Wei Feng, Ce Zhou, Rui Huang, Liang Wan, Song Wang
[
pdf
]
High Order Tensor Formulation for Convolutional Sparse Coding
Adel Bibi, Bernard Ghanem
[
pdf
] [
supp
]
Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems
Tim Meinhardt, Michael Moller, Caner Hazirbas, Daniel Cremers
[
pdf
] [
supp
] [
arXiv
]
ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond
Siyuan Qiao, Wei Shen, Weichao Qiu, Chenxi Liu, Alan Yuille
[
pdf
] [
arXiv
]
Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection
Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, Abhinav Gupta
[
pdf
] [
supp
] [
arXiv
]
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong
[
pdf
] [
arXiv
]
Multi-Modal Factorized Bilinear Pooling With Co-Attention Learning for Visual Question Answering
Zhou Yu, Jun Yu, Jianping Fan, Dacheng Tao
[
pdf
] [
arXiv
]
SCNet: Learning Semantic Correspondence
Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho, Cordelia Schmid, Jean Ponce
[
pdf
] [
arXiv
]
Soft Proposal Networks for Weakly Supervised Object Localization
Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao
[
pdf
] [
arXiv
]
Class Rectification Hard Mining for Imbalanced Deep Learning
Qi Dong, Shaogang Gong, Xiatian Zhu
[
pdf
]
Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs
Vishwanath A. Sindagi, Vishal M. Patel
[
pdf
] [
arXiv
]
See the Glass Half Full: Reasoning About Liquid Containers, Their Volume and Content
Roozbeh Mottaghi, Connor Schenck, Dieter Fox, Ali Farhadi
[
pdf
] [
arXiv
]
Hierarchical Multimodal LSTM for Dense Visual-Semantic Embedding
Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, Gang Hua
[
pdf
]
Identity-Aware Textual-Visual Matching With Latent Co-Attention
Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, Xiaogang Wang
[
pdf
] [
arXiv
]
Learning Deep Neural Networks for Vehicle Re-ID With Visual-Spatio-Temporal Path Proposals
Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang
[
pdf
]
Learning From Noisy Labels With Distillation
Yuncheng Li, Jianchao Yang, Yale Song, Liangliang Cao, Jiebo Luo, Li-Jia Li
[
pdf
] [
arXiv
]
DSOD: Learning Deeply Supervised Object Detectors From Scratch
Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang, Yurong Chen, Xiangyang Xue
[
pdf
] [
arXiv
]
Phrase Localization and Visual Relationship Detection With Comprehensive Image-Language Cues
Bryan A. Plummer, Arun Mallya, Christopher M. Cervantes, Julia Hockenmaier, Svetlana Lazebnik
[
pdf
] [
supp
] [
arXiv
]
Chained Cascade Network for Object Detection
Wanli Ouyang, Kun Wang, Xin Zhu, Xiaogang Wang
[
pdf
]
VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition
Seokju Lee, Junsik Kim, Jae Shin Yoon, Seunghak Shin, Oleksandr Bailo, Namil Kim, Tae-Hee Lee, Hyun Seok Hong, Seung-Hoon Han, In So Kweon
[
pdf
] [
supp
]
Unsupervised Learning of Important Objects From First-Person Videos
Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi
[
pdf
] [
arXiv
]
An Analysis of Visual Question Answering Algorithms
Kushal Kafle, Christopher Kanan
[
pdf
] [
arXiv
]
Visual Relationship Detection With Internal and External Linguistic Knowledge Distillation
Ruichi Yu, Ang Li, Vlad I. Morariu, Larry S. Davis
[
pdf
] [
supp
] [
arXiv
]
A Two Stream Siamese Convolutional Neural Network for Person Re-Identification
Dahjung Chung, Khalid Tahboub, Edward J. Delp
[
pdf
]
No More Discrimination: Cross City Adaptation of Road Scene Segmenters
Yi-Hsin Chen, Wei-Yu Chen, Yu-Ting Chen, Bo-Cheng Tsai, Yu-Chiang Frank Wang, Min Sun
[
pdf
] [
supp
] [
arXiv
]
Open Vocabulary Scene Parsing
Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, Antonio Torralba
[
pdf
] [
supp
] [
arXiv
]
Learned Watershed: End-To-End Learning of Seeded Segmentation
Steffen Wolf, Lukas Schott, Ullrich Kothe, Fred Hamprecht
[
pdf
] [
supp
]
Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes
Yang Zhang, Philip David, Boqing Gong
[
pdf
] [
arXiv
]
Scale-Adaptive Convolutions for Scene Parsing
Rui Zhang, Sheng Tang, Yongdong Zhang, Jintao Li, Shuicheng Yan
[
pdf
]
Privacy-Preserving Visual Learning Using Doubly Permuted Homomorphic Encryption
Ryo Yonetani, Vishnu Naresh Boddeti, Kris M. Kitani, Yoichi Sato
[
pdf
] [
supp
] [
arXiv
]
Multi-Task Self-Supervised Visual Learning
Carl Doersch, Andrew Zisserman
[
pdf
] [
arXiv
]
A Self-Balanced Min-Cut Algorithm for Image Clustering
Xiaojun Chen, Joshua Zhexue Haung, Feiping Nie, Renjie Chen, Qingyao Wu
[
pdf
]
Is Second-Order Information Helpful for Large-Scale Visual Recognition?
Peihua Li, Jiangtao Xie, Qilong Wang, Wangmeng Zuo
[
pdf
] [
arXiv
]
Factorized Bilinear Models for Image Recognition
Yanghao Li, Naiyan Wang, Jiaying Liu, Xiaodi Hou
[
pdf
] [
supp
] [
arXiv
]
Octree Generating Networks: Efficient Convolutional Architectures for High-Resolution 3D Outputs
Maxim Tatarchenko, Alexey Dosovitskiy, Thomas Brox
[
pdf
] [
supp
] [
arXiv
]
Truncating Wide Networks Using Binary Tree Architectures
Yan Zhang, Mete Ozay, Shuohao Li, Takayuki Okatani
[
pdf
] [
supp
] [
arXiv
]
Bringing Background Into the Foreground: Making All Classes Equal in Weakly-Supervised Video Semantic Segmentation
Fatemeh Sadat Saleh, Mohammad Sadegh Aliakbarian, Mathieu Salzmann, Lars Petersson, Jose M. Alvarez
[
pdf
] [
arXiv
]
View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition From Skeleton Data
Pengfei Zhang, Cuiling Lan, Junliang Xing, Wenjun Zeng, Jianru Xue, Nanning Zheng
[
pdf
] [
arXiv
]
Joint Discovery of Object States and Manipulation Actions
Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Simon Lacoste-Julien
[
pdf
] [
arXiv
]
What Actions Are Needed for Understanding Human Actions in Videos?
Gunnar A. Sigurdsson, Olga Russakovsky, Abhinav Gupta
[
pdf
]
Lattice Long Short-Term Memory for Human Action Recognition
Lin Sun, Kui Jia, Kevin Chen, Dit-Yan Yeung, Bertram E. Shi, Silvio Savarese
[
pdf
] [
arXiv
]
Common Action Discovery and Localization in Unconstrained Videos
Jiong Yang, Junsong Yuan
[
pdf
]
Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks
Jae Shin Yoon, Francois Rameau, Junsik Kim, Seokju Lee, Seunghak Shin, In So Kweon
[
pdf
] [
arXiv
]
Am I a Baller? Basketball Performance Assessment From First-Person Videos
Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi
[
pdf
] [
arXiv
]
Deep Cropping via Attention Box Prediction and Aesthetics Assessment
Wenguan Wang, Jianbing Shen
[
pdf
]
Raster-To-Vector: Revisiting Floorplan Transformation
Chen Liu, Jiajun Wu, Pushmeet Kohli, Yasutaka Furukawa
[
pdf
] [
supp
]
Deep TextSpotter: An End-To-End Trainable Scene Text Localization and Recognition Framework
Michal Busta, Lukas Neumann, Jiri Matas
[
pdf
]
Playing for Benchmarks
Stephan R. Richter, Zeeshan Hayder, Vladlen Koltun
[
pdf
] [
arXiv
]
Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks
Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros
[
pdf
]
GANs for Biological Image Synthesis
Anton Osokin, Anatole Chessel, Rafael E. Carazo Salas, Federico Vaggi
[
pdf
] [
arXiv
]
Learning to Synthesize a 4D RGBD Light Field From a Single Image
Pratul P. Srinivasan, Tongzhou Wang, Ashwin Sreelal, Ravi Ramamoorthi, Ren Ng
[
pdf
] [
supp
] [
arXiv
]
Neural EPI-Volume Networks for Shape From Light Field
Stefan Heber, Wei Yu, Thomas Pock
[
pdf
]
Material Editing Using a Physically Based Rendering Network
Guilin Liu, Duygu Ceylan, Ersin Yumer, Jimei Yang, Jyh-Ming Lien
[
pdf
] [
supp
] [
arXiv
]
Turning Corners Into Cameras: Principles and Methods
Katherine L. Bouman, Vickie Ye, Adam B. Yedidia, Fredo Durand, Gregory W. Wornell, Antonio Torralba, William T. Freeman
[
pdf
]
Linear Differential Constraints for Photo-Polarimetric Height Estimation
Silvia Tozza, William A. P. Smith, Dizhong Zhu, Ravi Ramamoorthi, Edwin R. Hancock
[
pdf
] [
arXiv
]
Polynomial Solvers for Saturated Ideals
Viktor Larsson, Kalle Astrom, Magnus Oskarsson
[
pdf
]
Shape Inpainting Using 3D Generative Adversarial Network and Recurrent Convolutional Networks
Weiyue Wang, Qiangui Huang, Suya You, Chao Yang, Ulrich Neumann
[
pdf
]
SurfaceNet: An End-To-End 3D Neural Network for Multiview Stereopsis
Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, Lu Fang
[
pdf
]
Making Minimal Solvers for Absolute Pose Estimation Compact and Robust
Viktor Larsson, Zuzana Kukelova, Yinqiang Zheng
[
pdf
] [
supp
]
3D Surface Detail Enhancement From a Single Normal Map
Wuyuan Xie, Miaohui Wang, Xianbiao Qi, Lei Zhang
[
pdf
] [
supp
]
RMPE: Regional Multi-Person Pose Estimation
Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu
[
pdf
] [
arXiv
]
Online Video Object Detection Using Association LSTM
Yongyi Lu, Cewu Lu, Chi-Keung Tang
[
pdf
]
PolyFit: Polygonal Surface Reconstruction From Point Clouds
Liangliang Nan, Peter Wonka
[
pdf
] [
supp
]
Progressive Large Scale-Invariant Image Matching in Scale Space
Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, Tian Fang, Long Quan
[
pdf
]
Efficient Global 2D-3D Matching for Camera Localization in a Large-Scale 3D Map
Liu Liu, Hongdong Li, Yuchao Dai
[
pdf
] [
supp
]
Multi-View Non-Rigid Refinement and Normal Selection for High Quality 3D Reconstruction
Sk. Mohammadul Haque, Venu Madhav Govindu
[
pdf
] [
supp
]
Multi-Stage Multi-Recursive-Input Fully Convolutional Networks for Neuronal Boundary Detection
Wei Shen, Bin Wang, Yuan Jiang, Yan Wang, Alan Yuille
[
pdf
]
Depth and Image Restoration From Light Field in a Scattering Medium
Jiandong Tian, Zachary Murez, Tong Cui, Zhen Zhang, David Kriegman, Ravi Ramamoorthi
[
pdf
]
Video Reflection Removal Through Spatio-Temporal Optimization
Ajay Nandoriya, Mohamed Elgharib, Changil Kim, Mohamed Hefeeda, Wojciech Matusik
[
pdf
]
Efficient Online Local Metric Adaptation via Negative Samples for Person Re-Identification
Jiahuan Zhou, Pei Yu, Wei Tang, Ying Wu
[
pdf
] [
supp
]
Stepwise Metric Promotion for Unsupervised Video Person Re-Identification
Zimo Liu, Dong Wang, Huchuan Lu
[
pdf
]
Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis
Rui Huang, Shu Zhang, Tianyu Li, Ran He
[
pdf
] [
supp
] [
arXiv
]
Group Re-Identification via Unsupervised Transfer of Sparse Features Encoding
Giuseppe Lisanti, Niki Martinel, Alberto Del Bimbo, Gian Luca Foresti
[
pdf
] [
arXiv
]
Visual Transformation Aided Contrastive Learning for Video-Based Kinship Verification
Hamdi Dibeklioglu
[
pdf
]
Decoder Network Over Lightweight Reconstructed Feature for Fast Semantic Style Transfer
Ming Lu, Hao Zhao, Anbang Yao, Feng Xu, Yurong Chen, Li Zhang
[
pdf
]
Blind Image Deblurring With Outlier Handling
Jiangxin Dong, Jinshan Pan, Zhixun Su, Ming-Hsuan Yang
[
pdf
] [
supp
]
Paying Attention to Descriptions Generated by Image Captioning Models
Hamed R. Tavakoli, Rakshith Shetty, Ali Borji, Jorma Laaksonen
[
pdf
] [
arXiv
]
Fast Image Processing With Fully-Convolutional Networks
Qifeng Chen, Jia Xu, Vladlen Koltun
[
pdf
] [
arXiv
]
Robust Video Super-Resolution With Learned Temporal Dynamics
Ding Liu, Zhaowen Wang, Yuchen Fan, Xianming Liu, Zhangyang Wang, Shiyu Chang, Thomas Huang
[
pdf
]
Should We Encode Rain Streaks in Video as Deterministic or Stochastic?
Wei Wei, Lixuan Yi, Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu
[
pdf
] [
supp
]
Joint Bi-Layer Optimization for Single-Image Rain Streak Removal
Lei Zhu, Chi-Wing Fu, Dani Lischinski, Pheng-Ann Heng
[
pdf
] [
supp
]
Low-Dimensionality Calibration Through Local Anisotropic Scaling for Robust Hand Model Personalization
Edoardo Remelli, Anastasia Tkach, Andrea Tagliasacchi, Mark Pauly
[
pdf
] [
supp
]
Non-Markovian Globally Consistent Multi-Object Tracking
Andrii Maksai, Xinchao Wang, Francois Fleuret, Pascal Fua
[
pdf
] [
supp
]
CREST: Convolutional Residual Learning for Visual Tracking
Yibing Song, Chao Ma, Lijun Gong, Jiawei Zhang, Rynson W. H. Lau, Ming-Hsuan Yang
[
pdf
] [
arXiv
]
Volumetric Flow Estimation for Incompressible Fluids Using the Stationary Stokes Equations
Katrin Lasinger, Christoph Vogel, Konrad Schindler
[
pdf
] [
supp
]
Bounding Boxes, Segmentations and Object Coordinates: How Important Is Recognition for 3D Scene Flow Estimation in Autonomous Driving Scenarios?
Aseem Behl, Omid Hosseini Jafari, Siva Karthik Mustikovela, Hassan Abu Alhaija, Carsten Rother, Andreas Geiger
[
pdf
] [
supp
]
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao
[
pdf
] [
arXiv
]
Deep Metric Learning With Angular Loss
Jian Wang, Feng Zhou, Shilei Wen, Xiao Liu, Yuanqing Lin
[
pdf
] [
arXiv
]
Compositional Human Pose Regression
Xiao Sun, Jiaxiang Shang, Shuang Liang, Yichen Wei
[
pdf
] [
arXiv
]
MUTAN: Multimodal Tucker Fusion for Visual Question Answering
Hedi Ben-younes, Remi Cadene, Matthieu Cord, Nicolas Thome
[
pdf
] [
arXiv
]
Revisiting IM2GPS in the Deep Learning Era
Nam Vo, Nathan Jacobs, James Hays
[
pdf
] [
supp
] [
arXiv
]
Scene Parsing With Global Context Embedding
Wei-Chih Hung, Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang
[
pdf
] [
supp
]
A Simple yet Effective Baseline for 3D Human Pose Estimation
Julieta Martinez, Rayat Hossain, Javier Romero, James J. Little
[
pdf
] [
arXiv
]
Dual-Glance Model for Deciphering Social Relationships
Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli
[
pdf
]
Sketching With Style: Visual Search With Sketches and Aesthetic Context
John Collomosse, Tu Bui, Michael J. Wilber, Chen Fang, Hailin Jin
[
pdf
] [
supp
]
Point Set Registration With Global-Local Correspondence and Transformation Estimation
Su Zhang, Yang Yang, Kun Yang, Yi Luo, Sim-Heng Ong
[
pdf
]
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?
John McCormac, Ankur Handa, Stefan Leutenegger, Andrew J. Davison
[
pdf
]
A Unified Model for Near and Remote Sensing
Scott Workman, Menghua Zhai, David J. Crandall, Nathan Jacobs
[
pdf
] [
supp
] [
arXiv
]
Directionally Convolutional Networks for 3D Shape Segmentation
Haotian Xu, Ming Dong, Zichun Zhong
[
pdf
]
AMAT: Medial Axis Transform for Natural Images
Stavros Tsogkas, Sven Dickinson
[
pdf
] [
supp
] [
arXiv
]
Deep Dual Learning for Semantic Image Segmentation
Ping Luo, Guangrun Wang, Liang Lin, Xiaogang Wang
[
pdf
]
Regional Interactive Image Segmentation Networks
Jun Hao Liew, Yunchao Wei, Wei Xiong, Sim-Heng Ong, Jiashi Feng
[
pdf
] [
supp
]
Learning Efficient Convolutional Networks Through Network Slimming
Zhuang Liu, Jianguo Li, Zhiqiang Shen, Gao Huang, Shoumeng Yan, Changshui Zhang
[
pdf
] [
supp
] [
arXiv
]
CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training
Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, Gang Hua
[
pdf
] [
supp
]
Universal Adversarial Perturbations Against Semantic Image Segmentation
Jan Hendrik Metzen, Mummadi Chaithanya Kumar, Thomas Brox, Volker Fischer
[
pdf
] [
supp
] [
arXiv
]
Associative Domain Adaptation
Philip Haeusser, Thomas Frerix, Alexander Mordvintsev, Daniel Cremers
[
pdf
] [
supp
] [
arXiv
]
Introspective Neural Networks for Generative Modeling
Justin Lazarow, Long Jin, Zhuowen Tu
[
pdf
]
Towards a Unified Compositional Model for Visual Pattern Modeling
Wei Tang, Pei Yu, Jiahuan Zhou, Ying Wu
[
pdf
] [
supp
]
Least Squares Generative Adversarial Networks
Xudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, Zhen Wang, Stephen Paul Smolley
[
pdf
] [
supp
] [
arXiv
]
Centered Weight Normalization in Accelerating Training of Deep Neural Networks
Lei Huang, Xianglong Liu, Yang Liu, Bo Lang, Dacheng Tao
[
pdf
] [
supp
]
Deep Growing Learning
Guangcong Wang, Xiaohua Xie, Jianhuang Lai, Jiaxuan Zhuo
[
pdf
]
Smart Mining for Deep Metric Learning
Ben Harwood, Vijay Kumar B G, Gustavo Carneiro, Ian Reid, Tom Drummond
[
pdf
] [
supp
] [
arXiv
]
Temporal Generative Adversarial Nets With Singular Value Clipping
Masaki Saito, Eiichi Matsumoto, Shunta Saito
[
pdf
] [
arXiv
]
Sampling Matters in Deep Embedding Learning
Chao-Yuan Wu, R. Manmatha, Alexander J. Smola, Philipp Krahenbuhl
[
pdf
] [
supp
] [
arXiv
]
DualGAN: Unsupervised Dual Learning for Image-To-Image Translation
Zili Yi, Hao Zhang, Ping Tan, Minglun Gong
[
pdf
]
Learning View-Invariant Features for Person Identification in Temporally Synchronized Videos Taken by Wearable Cameras
Kang Zheng, Xiaochuan Fan, Yuewei Lin, Hao Guo, Hongkai Yu, Dazhou Guo, Song Wang
[
pdf
]
MarioQA: Answering Questions by Watching Gameplay Videos
Jonghwan Mun, Paul Hongsuck Seo, Ilchae Jung, Bohyung Han
[
pdf
] [
supp
] [
arXiv
]
SBGAR: Semantics Based Group Activity Recognition
Xin Li, Mooi Choo Chuah
[
pdf
]
Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video
Davide Moltisanti, Michael Wray, Walterio Mayol-Cuevas, Dima Damen
[
pdf
] [
supp
] [
arXiv
]
Unmasking the Abnormal Events in Video
Radu Tudor Ionescu, Sorina Smeureanu, Bogdan Alexe, Marius Popescu
[
pdf
] [
arXiv
]
Chained Multi-Stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection
Mohammadreza Zolfaghari, Gabriel L. Oliveira, Nima Sedaghat, Thomas Brox
[
pdf
] [
supp
] [
arXiv
]
Temporal Action Detection With Structured Segment Networks
Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, Xiaoou Tang, Dahua Lin
[
pdf
] [
supp
] [
arXiv
]
Jointly Recognizing Object Fluents and Tasks in Egocentric Videos
Yang Liu, Ping Wei, Song-Chun Zhu
[
pdf
]
Transferring Objects: Joint Inference of Container and Human Pose
Hanqing Wang, Wei Liang, Lap-Fai Yu
[
pdf
]
Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention
Jinkyu Kim, John Canny
[
pdf
] [
arXiv
]
Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning
Abhishek Das, Satwik Kottur, Jose M. F. Moura, Stefan Lee, Dhruv Batra
[
pdf
] [
supp
] [
arXiv
]
Mask R-CNN
Kaiming He, Georgia Gkioxari, Piotr Dollar, Ross Girshick
[
pdf
] [
arXiv
]
Towards Diverse and Natural Image Descriptions via a Conditional GAN
Bo Dai, Sanja Fidler, Raquel Urtasun, Dahua Lin
[
pdf
] [
arXiv
]
Focal Loss for Dense Object Detection
Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollar
[
pdf
] [
arXiv
]
Inferring and Executing Programs for Visual Reasoning
Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick
[
pdf
] [
supp
] [
arXiv
]
Visual Forecasting by Imitating Dynamics in Natural Sequences
Kuo-Hao Zeng, William B. Shen, De-An Huang, Min Sun, Juan Carlos Niebles
[
pdf
] [
supp
] [
arXiv
]
TorontoCity: Seeing the World With a Million Eyes
Shenlong Wang, Min Bai, Gellert Mattyus, Hang Chu, Wenjie Luo, Bin Yang, Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun
[
pdf
] [
arXiv
]
Low-Shot Visual Recognition by Shrinking and Hallucinating Features
Bharath Hariharan, Ross Girshick
[
pdf
] [
supp
] [
arXiv
]
A Coarse-Fine Network for Keypoint Localization
Shaoli Huang, Mingming Gong, Dacheng Tao
[
pdf
]
Detect to Track and Track to Detect
Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman
[
pdf
]
Single Shot Text Detector With Regional Attention
Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li
[
pdf
] [
arXiv
]
SubUNets: End-To-End Hand Shape and Continuous Sign Language Recognition
Necati Cihan Camgoz, Simon Hadfield, Oscar Koller, Richard Bowden
[
pdf
]
A Spatiotemporal Oriented Energy Network for Dynamic Texture Recognition
Isma Hadji, Richard P. Wildes
[
pdf
] [
supp
] [
arXiv
]
Probabilistic Structure From Motion With Objects (PSfMO)
Paul Gay, Cosimo Rubino, Vaibhav Bansal, Alessio Del Bue
[
pdf
]
A 3D Morphable Model of Craniofacial Shape and Texture Variation
Hang Dai, Nick Pears, William A. P. Smith, Christian Duncan
[
pdf
]
Multi-View Dynamic Shape Refinement Using Local Temporal Integration
Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer
[
pdf
]
Learning Hand Articulations by Hallucinating Heat Distribution
Chiho Choi, Sangpil Kim, Karthik Ramani
[
pdf
] [
supp
]
Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization With Spatially-Varying Lighting
Robert Maier, Kihwan Kim, Daniel Cremers, Jan Kautz, Matthias Niessner
[
pdf
] [
supp
] [
arXiv
]
Robust Hand Pose Estimation During the Interaction With an Unknown Object
Chiho Choi, Sang Ho Yoon, Chin-Ning Chen, Karthik Ramani
[
pdf
] [
supp
]
Detailed Surface Geometry and Albedo Recovery From RGB-D Video Under Natural Illumination
Xinxin Zuo, Sen Wang, Jiangbin Zheng, Ruigang Yang
[
pdf
] [
arXiv
]
Monocular Free-Head 3D Gaze Tracking With Deep Learning and Geometry Constraints
Wangjiang Zhu, Haoping Deng
[
pdf
]
Filter Selection for Hyperspectral Estimation
Boaz Arad, Ohad Ben-Shahar
[
pdf
]
A Microfacet-Based Reflectance Model for Photometric Stereo With Highly Specular Surfaces
Lixiong Chen, Yinqiang Zheng, Boxin Shi, Art Subpa-Asa, Imari Sato
[
pdf
]
Detecting Faces Using Inside Cascaded Contextual CNN
Kaipeng Zhang, Zhanpeng Zhang, Hao Wang, Zhifeng Li, Yu Qiao, Wei Liu
[
pdf
]
A Novel Space-Time Representation on the Positive Semidefinite Cone for Facial Expression Recognition
Anis Kacem, Mohamed Daoudi, Boulbaba Ben Amor, Juan Carlos Alvarez-Paiva
[
pdf
]
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding
Dieu Linh Tran, Robert Walecki, Ognjen (Oggi) Rudovic, Stefanos Eleftheriadis, Bjorn Schuller, Maja Pantic
[
pdf
] [
arXiv
]
Pose-Invariant Face Alignment With a Single CNN
Amin Jourabloo, Mao Ye, Xiaoming Liu, Liu Ren
[
pdf
] [
arXiv
]
Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
Kihyuk Sohn, Sifei Liu, Guangyu Zhong, Xiang Yu, Ming-Hsuan Yang, Manmohan Chandraker
[
pdf
] [
supp
] [
arXiv
]
Deeply-Learned Part-Aligned Representations for Person Re-Identification
Liming Zhao, Xi Li, Yueting Zhuang, Jingdong Wang
[
pdf
] [
arXiv
]
Semantic Line Detection and Its Applications
Jun-Tae Lee, Han-Ul Kim, Chul Lee, Chang-Su Kim
[
pdf
]
A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing
Qingnan Fan, Jiaolong Yang, Gang Hua, Baoquan Chen, David Wipf
[
pdf
] [
supp
] [
arXiv
]
Revisiting Cross-Channel Information Transfer for Chromatic Aberration Correction
Tiancheng Sun, Yifan Peng, Wolfgang Heidrich
[
pdf
] [
supp
]
High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits
Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia
[
pdf
] [
arXiv
]
Learning Visual Attention to Identify People With Autism Spectrum Disorder
Ming Jiang, Qi Zhao
[
pdf
]
DSLR-Quality Photos on Mobile Devices With Deep Convolutional Networks
Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Kenneth Vanhoey, Luc Van Gool
[
pdf
] [
supp
] [
arXiv
]
Non-Uniform Blind Deblurring by Reblurring
Yuval Bahat, Netalee Efrat, Michal Irani
[
pdf
]
Misalignment-Robust Joint Filter for Cross-Modal Image Pairs
Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi
[
pdf
] [
supp
]
Low-Rank Tensor Completion: A Pseudo-Bayesian Learning Approach
Wei Chen, Nan Song
[
pdf
]
DeepCD: Learning Deep Complementary Descriptors for Patch Representations
Tsun-Yi Yang, Jo-Han Hsu, Yen-Yu Lin, Yung-Yu Chuang
[
pdf
]
Beyond Standard Benchmarks: Parameterizing Performance Evaluation in Visual Object Tracking
Luka Cehovin Zajc, Alan Lukezic, Ales Leonardis, Matej Kristan
[
pdf
]
The Pose Knows: Video Forecasting by Generating Pose Futures
Jacob Walker, Kenneth Marino, Abhinav Gupta, Martial Hebert
[
pdf
] [
arXiv
]
What Will Happen Next? Forecasting Player Moves in Sports Videos
Panna Felsen, Pulkit Agrawal, Jitendra Malik
[
pdf
]
Robust Kronecker-Decomposable Component Analysis for Low-Rank Modeling
Mehdi Bahri, Yannis Panagakis, Stefanos Zafeiriou
[
pdf
] [
arXiv
]
Recurrent Topic-Transition GAN for Visual Paragraph Generation
Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, Eric P. Xing
[
pdf
] [
arXiv
]
A Two-Streamed Network for Estimating Fine-Scaled Depth Maps From Single RGB Images
Jun Li, Reinhard Klein, Angela Yao
[
pdf
] [
supp
] [
arXiv
]
Weakly Supervised Object Localization Using Things and Stuff Transfer
Miaojing Shi, Holger Caesar, Vittorio Ferrari
[
pdf
] [
supp
] [
arXiv
]
Single Image Action Recognition Using Semantic Body Part Actions
Zhichen Zhao, Huimin Ma, Shaodi You
[
pdf
] [
arXiv
]
Incremental Learning of Object Detectors Without Catastrophic Forgetting
Konstantin Shmelkov, Cordelia Schmid, Karteek Alahari
[
pdf
] [
arXiv
]
Generative Adversarial Networks Conditioned by Brain Signals
Simone Palazzo, Concetto Spampinato, Isaak Kavasidis, Daniela Giordano, Mubarak Shah
[
pdf
]
Learning to Disambiguate by Asking Discriminative Questions
Yining Li, Chen Huang, Xiaoou Tang, Chen Change Loy
[
pdf
] [
supp
] [
arXiv
]
Interpretable Explanations of Black Boxes by Meaningful Perturbation
Ruth C. Fong, Andrea Vedaldi
[
pdf
] [
supp
] [
arXiv
]
DeepRoadMapper: Extracting Road Topology From Aerial Images
Gellert Mattyus, Wenjie Luo, Raquel Urtasun
[
pdf
]
Monocular 3D Human Pose Estimation by Predicting Depth on Joints
Bruce Xiaohan Nie, Ping Wei, Song-Chun Zhu
[
pdf
]
Large-Scale Image Retrieval With Attentive Deep Local Features
Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, Bohyung Han
[
pdf
] [
arXiv
]
Deep Globally Constrained MRFs for Human Pose Estimation
Ioannis Marras, Petar Palasek, Ioannis Patras
[
pdf
]
Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning
Soravit Changpinyo, Wei-Lun Chao, Fei Sha
[
pdf
] [
supp
] [
arXiv
]
Multi-Label Learning of Part Detectors for Heavily Occluded Pedestrian Detection
Chunluan Zhou, Junsong Yuan
[
pdf
] [
supp
]
SGN: Sequential Grouping Networks for Instance Segmentation
Shu Liu, Jiaya Jia, Sanja Fidler, Raquel Urtasun
[
pdf
]
Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors
Hong-Yu Zhou, Bin-Bin Gao, Jianxin Wu
[
pdf
] [
arXiv
]
Aesthetic Critiques Generation for Photos
Kuang-Yu Chang, Kung-Hung Lu, Chu-Song Chen
[
pdf
] [
supp
]
Hide-And-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization
Krishna Kumar Singh, Yong Jae Lee
[
pdf
] [
supp
]
Two-Phase Learning for Weakly Supervised Object Localization
Dahun Kim, Donghyeon Cho, Donggeun Yoo, In So Kweon
[
pdf
] [
arXiv
]
Curriculum Dropout
Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, Rene Vidal, Vittorio Murino
[
pdf
] [
supp
] [
arXiv
]
Predictor Combination at Test Time
Kwang In Kim, James Tompkin, Christian Richardt
[
pdf
] [
supp
]
Guided Perturbations: Self-Corrective Behavior in Convolutional Neural Networks
Swami Sankaranarayanan, Arpit Jain, Ser Nam Lim
[
pdf
]
Learning Robust Visual-Semantic Embeddings
Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov
[
pdf
] [
supp
] [
arXiv
]
PUnDA: Probabilistic Unsupervised Domain Adaptation for Knowledge Transfer Across Visual Categories
Behnam Gholami, Ognjen (Oggi) Rudovic, Vladimir Pavlovic
[
pdf
] [
supp
]
Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses
Christian Rupprecht, Iro Laina, Robert DiPietro, Maximilian Baust, Federico Tombari, Nassir Navab, Gregory D. Hager
[
pdf
] [
arXiv
]
CDTS: Collaborative Detection, Tracking, and Segmentation for Online Multiple Object Segmentation in Videos
Yeong Jun Koh, Chang-Su Kim
[
pdf
]
Temporal Superpixels Based on Proximity-Weighted Patch Matching
Se-Ho Lee, Won-Dong Jang, Chang-Su Kim
[
pdf
]
Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge
Ryota Hinami, Tao Mei, Shin'ichi Satoh
[
pdf
] [
supp
] [
arXiv
]
TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
Jiyang Gao, Zhenheng Yang, Kan Chen, Chen Sun, Ram Nevatia
[
pdf
] [
supp
] [
arXiv
]
Online Real-Time Multiple Spatiotemporal Action Localisation and Prediction
Gurkirt Singh, Suman Saha, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin
[
pdf
] [
supp
] [
arXiv
]
Leveraging Weak Semantic Relevance for Complex Video Event Classification
Chao Li, Jiewei Cao, Zi Huang, Lei Zhu, Heng Tao Shen
[
pdf
]
Weakly Supervised Summarization of Web Videos
Rameswar Panda, Abir Das, Ziyan Wu, Jan Ernst, Amit K. Roy-Chowdhury
[
pdf
] [
supp
]
FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras
Shanghang Zhang, Guanhang Wu, Joao P. Costeira, Jose M. F. Moura
[
pdf
]
Fast Face-Swap Using Convolutional Neural Networks
Iryna Korshunova, Wenzhe Shi, Joni Dambre, Lucas Theis
[
pdf
] [
arXiv
]
Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images
Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz
[
pdf
] [
supp
]
First-Person Activity Forecasting With Online Inverse Reinforcement Learning
Nicholas Rhinehart, Kris M. Kitani
[
pdf
] [
supp
] [
arXiv
]
Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment With Limited Resources
Adrian Bulat, Georgios Tzimiropoulos
[
pdf
] [
supp
]
MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction
Ayush Tewari, Michael Zollhofer, Hyeongwoo Kim, Pablo Garrido, Florian Bernard, Patrick Perez, Christian Theobalt
[
pdf
] [
supp
] [
arXiv
]
RPAN: An End-To-End Recurrent Pose-Attention Network for Action Recognition in Videos
Wenbin Du, Yali Wang, Yu Qiao
[
pdf
] [
supp
]
Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition
Chi Nhan Duong, Kha Gia Quach, Khoa Luu, Ngan Le, Marios Savvides
[
pdf
] [
arXiv
]
Attribute-Enhanced Face Recognition With Neural Tensor Fusion Networks
Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, Sankha S. Mukherjee, Timothy M. Hospedales, Neil M. Robertson, Yongxin Yang
[
pdf
]
Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro
Zhedong Zheng, Liang Zheng, Yi Yang
[
pdf
] [
arXiv
]
Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks With Spatiotemporal Transformer Modules
Congqi Cao, Yifan Zhang, Yi Wu, Hanqing Lu, Jian Cheng
[
pdf
]
Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition
Wanglong Wu, Meina Kan, Xin Liu, Yi Yang, Shiguang Shan, Xilin Chen
[
pdf
]
Learning Discriminative Aggregation Network for Video-Based Face Recognition
Yongming Rao, Ji Lin, Jiwen Lu, Jie Zhou
[
pdf
]
Synergy Between Face Alignment and Tracking via Discriminative Global Consensus Optimization
Muhammad Haris Khan, John McDonagh, Georgios Tzimiropoulos
[
pdf
]
SVDNet for Pedestrian Retrieval
Yifan Sun, Liang Zheng, Weijian Deng, Shengjin Wang
[
pdf
] [
arXiv
]
Towards More Accurate Iris Recognition Using Deeply Learned Spatially Corresponding Features
Zijing Zhao, Ajay Kumar
[
pdf
] [
supp
]
Semantically Informed Multiview Surface Refinement
Maros Blaha, Mathias Rothermel, Martin R. Oswald, Torsten Sattler, Audrey Richard, Jan D. Wegner, Marc Pollefeys, Konrad Schindler
[
pdf
] [
supp
] [
arXiv
]
BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects Without Using Depth
Mahdi Rad, Vincent Lepetit
[
pdf
] [
arXiv
]
Modeling Urban Scenes From Pointclouds
William Nguatem, Helmut Mayer
[
pdf
] [
supp
]
Parameter-Free Lens Distortion Calibration of Central Cameras
Filippo Bergamasco, Luca Cosmo, Andrea Gasparetto, Andrea Albarelli, Andrea Torsello
[
pdf
]
Pose Guided RGBD Feature Learning for 3D Object Pose Estimation
Vassileios Balntas, Andreas Doumanoglou, Caner Sahin, Juil Sock, Rigas Kouskouridas, Tae-Kyun Kim
[
pdf
]
Efficient Global Illumination for Morphable Models
Andreas Schneider, Sandro Schonborn, Lavrenti Frobeen, Bernhard Egger, Thomas Vetter
[
pdf
]
Low Compute and Fully Parallel Computer Vision With HashMatch
Sean Ryan Fanello, Julien Valentin, Adarsh Kowdle, Christoph Rhemann, Vladimir Tankovich, Carlo Ciliberto, Philip Davidson, Shahram Izadi
[
pdf
] [
supp
]
Dense Non-Rigid Structure-From-Motion and Shading With Unknown Albedos
Mathias Gallardo, Toby Collins, Adrien Bartoli
[
pdf
] [
supp
]
From Point Clouds to Mesh Using Regression
Lubor Ladicky, Olivier Saurer, SoHyeon Jeong, Fabio Maninchedda, Marc Pollefeys
[
pdf
]
Stereo DSO: Large-Scale Direct Sparse Visual Odometry With Stereo Cameras
Rui Wang, Martin Schworer, Daniel Cremers
[
pdf
] [
supp
] [
arXiv
]
Space-Time Localization and Mapping
Minhaeng Lee, Charless C. Fowlkes
[
pdf
] [
supp
]
Benchmarking Single-Image Reflection Removal Algorithms
Renjie Wan, Boxin Shi, Ling-Yu Duan, Ah-Hwee Tan, Alex C. Kot
[
pdf
]
Attention-Aware Deep Reinforcement Learning for Video Face Recognition
Yongming Rao, Jiwen Lu, Jie Zhou
[
pdf
]
Learning to Fuse 2D and 3D Image Cues for Monocular Body Pose Estimation
Bugra Tekin, Pablo Marquez-Neila, Mathieu Salzmann, Pascal Fua
[
pdf
] [
supp
] [
arXiv
]
Deep Facial Action Unit Recognition From Partially Labeled Data
Shan Wu, Shangfei Wang, Bowen Pan, Qiang Ji
[
pdf
]
Pose-Driven Deep Convolutional Model for Person Re-Identification
Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian
[
pdf
] [
arXiv
]
Recognition of Action Units in the Wild With Deep Nets and a New Global-Local Loss
C. Fabian Benitez-Quiroz, Yan Wang, Aleix M. Martinez
[
pdf
] [
supp
]
Faster Than Real-Time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses
Chandrasekhar Bhagavatula, Chenchen Zhu, Khoa Luu, Marios Savvides
[
pdf
] [
arXiv
]
Towards Large-Pose Face Frontalization in the Wild
Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, Manmohan Chandraker
[
pdf
] [
supp
] [
arXiv
]
A Joint Intrinsic-Extrinsic Prior Model for Retinex
Bolun Cai, Xianming Xu, Kailing Guo, Kui Jia, Bin Hu, Dacheng Tao
[
pdf
]
Going Unconstrained With Rolling Shutter Deblurring
Mahesh Mohan M. R., A. N. Rajagopalan, Gunasekaran Seetharaman
[
pdf
] [
supp
]
A Stagewise Refinement Model for Detecting Salient Objects in Images
Tiantian Wang, Ali Borji, Lihe Zhang, Pingping Zhang, Huchuan Lu
[
pdf
]
From Square Pieces to Brick Walls: The Next Challenge in Solving Jigsaw Puzzles
Shir Gur, Ohad Ben-Shahar
[
pdf
]
Online Video Deblurring via Dynamic Temporal Blending Network
Tae Hyun Kim, Kyoung Mu Lee, Bernhard Scholkopf, Michael Hirsch
[
pdf
] [
arXiv
]
Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector
Dingwen Zhang, Junwei Han, Yu Zhang
[
pdf
]
Fast Multi-Image Matching via Density-Based Clustering
Roberto Tron, Xiaowei Zhou, Carlos Esteves, Kostas Daniilidis
[
pdf
]
Characterizing and Improving Stability in Neural Style Transfer
Agrim Gupta, Justin Johnson, Alexandre Alahi, Li Fei-Fei
[
pdf
] [
supp
] [
arXiv
]
Cross-Modal Deep Variational Hashing
Venice Erin Liong, Jiwen Lu, Yap-Peng Tan, Jie Zhou
[
pdf
]
Spatial Memory for Context Reasoning in Object Detection
Xinlei Chen, Abhinav Gupta
[
pdf
] [
arXiv
]
Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval
Yuming Shen, Li Liu, Ling Shao, Jingkuan Song
[
pdf
] [
arXiv
]
Learning a Recurrent Residual Fusion Network for Multimodal Matching
Yu Liu, Yanming Guo, Erwin M. Bakker, Michael S. Lew
[
pdf
]
Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition
Anders Glent Buch, Lilita Kiforenko, Dirk Kraft
[
pdf
] [
arXiv
]
CoupleNet: Coupling Global Structure With Local Parts for Object Detection
Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, Yi Wu, Hanqing Lu
[
pdf
] [
arXiv
]
Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training
Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, Bernt Schiele
[
pdf
] [
supp
] [
arXiv
]
Drone-Based Object Counting by Spatially Regularized Regional Proposal Network
Meng-Ru Hsieh, Yen-Liang Lin, Winston H. Hsu
[
pdf
] [
arXiv
]
BlitzNet: A Real-Time Deep Network for Scene Understanding
Nikita Dvornik, Konstantin Shmelkov, Julien Mairal, Cordelia Schmid
[
pdf
] [
supp
]
Joint Learning of Object and Action Detectors
Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, Cordelia Schmid
[
pdf
]
Situation Recognition With Graph Neural Networks
Ruiyu Li, Makarand Tapaswi, Renjie Liao, Jiaya Jia, Raquel Urtasun, Sanja Fidler
[
pdf
] [
supp
] [
arXiv
]
Learning Visual N-Grams From Web Data
Ang Li, Allan Jabri, Armand Joulin, Laurens van der Maaten
[
pdf
] [
supp
] [
arXiv
]
Attention-Based Multimodal Fusion for Video Description
Chiori Hori, Takaaki Hori, Teng-Yok Lee, Ziming Zhang, Bret Harsham, John R. Hershey, Tim K. Marks, Kazuhiko Sumi
[
pdf
] [
supp
] [
arXiv
]
Learning the Latent "Look": Unsupervised Discovery of a Style-Coherent Embedding From Fashion Images
Wei-Lin Hsiao, Kristen Grauman
[
pdf
] [
arXiv
]
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
Tanmay Gupta, Kevin Shih, Saurabh Singh, Derek Hoiem
[
pdf
] [
arXiv
]
Learning Discriminative Latent Attributes for Zero-Shot Classification
Huajie Jiang, Ruiping Wang, Shiguang Shan, Yi Yang, Xilin Chen
[
pdf
] [
supp
]
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
Hanwang Zhang, Zawlin Kyaw, Jinyang Yu, Shih-Fu Chang
[
pdf
]
Higher-Order Minimum Cost Lifted Multicuts for Motion Segmentation
Margret Keuper
[
pdf
] [
arXiv
]
Deep Free-Form Deformation Network for Object-Mask Registration
Haoyang Zhang, Xuming He
[
pdf
]
Region-Based Correspondence Between 3D Shapes via Spatially Smooth Biclustering
Matteo Denitto, Simone Melzi, Manuele Bicego, Umberto Castellani, Alessandro Farinelli, Mario A. T. Figueiredo, Yanir Kleiman, Maks Ovsjanikov
[
pdf
]
Learning Discriminative ab-Divergences for Positive Definite Matrices
Anoop Cherian, Panagiotis Stanitsas, Mehrtash Harandi, Vassilios Morellas, Nikolaos Papanikolopoulos
[
pdf
]
Consensus Convolutional Sparse Coding
Biswarup Choudhury, Robin Swanson, Felix Heide, Gordon Wetzstein, Wolfgang Heidrich
[
pdf
] [
supp
]
Domain-Adaptive Deep Network Compression
Marc Masana, Joost van de Weijer, Luis Herranz, Andrew D. Bagdanov, Jose M. Alvarez
[
pdf
] [
arXiv
]
Self-Supervised Learning of Pose Embeddings From Spatiotemporal Relations in Videos
Omer Sumer, Tobias Dencker, Bjorn Ommer
[
pdf
] [
supp
] [
arXiv
]
Approximate Grassmannian Intersections: Subspace-Valued Subspace Learning
Calvin Murdock, Fernando De la Torre
[
pdf
]
Side Information in Robust Principal Component Analysis: Algorithms and Applications
Niannan Xue, Yannis Panagakis, Stefanos Zafeiriou
[
pdf
] [
supp
] [
arXiv
]
Summarization and Classification of Wearable Camera Streams by Learning the Distributions Over Deep Features of Out-Of-Sample Image Sequences
Alessandro Perina, Sadegh Mohammadi, Nebojsa Jojic, Vittorio Murino
[
pdf
] [
supp
]
Unsupervised Learning From Video to Detect Foreground Objects in Single Images
Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu
[
pdf
] [
arXiv
]
Supplementary Meta-Learning: Towards a Dynamic Model for Deep Neural Networks
Feihu Zhang, Benjamin W. Wah
[
pdf
]
Adversarial Inverse Graphics Networks: Learning 2D-To-3D Lifting and Image-To-Image Translation From Unpaired Supervision
Hsiao-Yu Fish Tung, Adam W. Harley, William Seto, Katerina Fragkiadaki
[
pdf
] [
supp
]
Active Learning for Human Pose Estimation
Buyu Liu, Vittorio Ferrari
[
pdf
]
Interleaved Group Convolutions
Ting Zhang, Guo-Jun Qi, Bin Xiao, Jingdong Wang
[
pdf
] [
supp
]
Learning-Based Cloth Material Recovery From Video
Shan Yang, Junbang Liang, Ming C. Lin
[
pdf
]
Unsupervised Video Understanding by Reconciliation of Posture Similarities
Timo Milbich, Miguel Bautista, Ekaterina Sutter, Bjorn Ommer
[
pdf
] [
supp
]
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, Cordelia Schmid
[
pdf
] [
arXiv
]
AMTnet: Action-Micro-Tube Regression by End-To-End Trainable Deep Architecture
Suman Saha, Gurkirt Singh, Fabio Cuzzolin
[
pdf
] [
supp
]
Constrained Convolutional Sparse Coding for Parametric Based Reconstruction of Line Drawings
Sara Shaheen, Lama Affara, Bernard Ghanem
[
pdf
] [
supp
]
Neural Ctrl-F: Segmentation-Free Query-By-String Word Spotting in Handwritten Manuscript Collections
Tomas Wilkinson, Jonas Lindstrom, Anders Brun
[
pdf
]
Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions
Pascal Mettes, Cees G. M. Snoek
[
pdf
] [
arXiv
]
Semantic Video CNNs Through Representation Warping
Raghudeep Gadde, Varun Jampani, Peter V. Gehler
[
pdf
] [
supp
] [
arXiv
]
Video Frame Synthesis Using Deep Voxel Flow
Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala
[
pdf
] [
arXiv
]
Detail-Revealing Deep Video Super-Resolution
Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia
[
pdf
] [
arXiv
]
Learning Video Object Segmentation With Visual Memory
Pavel Tokmakov, Karteek Alahari, Cordelia Schmid
[
pdf
] [
arXiv
]
EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis
Mehdi S. M. Sajjadi, Bernhard Scholkopf, Michael Hirsch
[
pdf
] [
supp
] [
arXiv
]
Makeup-Go: Blind Reversion of Portrait Edit
Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia
[
pdf
]
Shadow Detection With Conditional Generative Adversarial Networks
Vu Nguyen, Tomas F. Yago Vicente, Maozheng Zhao, Minh Hoai, Dimitris Samaras
[
pdf
]
Learning High Dynamic Range From Outdoor Panoramas
Jinsong Zhang, Jean-Francois Lalonde
[
pdf
] [
arXiv
]
DCTM: Discrete-Continuous Transformation Matching for Semantic Flow
Seungryong Kim, Dongbo Min, Stephen Lin, Kwanghoon Sohn
[
pdf
] [
arXiv
]
MemNet: A Persistent Memory Network for Image Restoration
Ying Tai, Jian Yang, Xiaoming Liu, Chunyan Xu
[
pdf
] [
arXiv
]
Structure-Measure: A New Way to Evaluate Foreground Maps
Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, Ali Borji
[
pdf
]
Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
Donghyeon Cho, Jinsun Park, Tae-Hyun Oh, Yu-Wing Tai, In So Kweon
[
pdf
] [
arXiv
]
Practical and Efficient Multi-View Matching
Eleonora Maset, Federica Arrigoni, Andrea Fusiello
[
pdf
]
Unrolled Memory Inner-Products: An Abstract GPU Operator for Efficient Vision-Related Computations
Yu-Sheng Lin, Wei-Chao Chen, Shao-Yi Chien
[
pdf
]
Learning to Push the Limits of Efficient FFT-Based Image Deconvolution
Jakob Kruse, Carsten Rother, Uwe Schmidt
[
pdf
] [
supp
]
Learning Spread-Out Local Feature Descriptors
Xu Zhang, Felix X. Yu, Sanjiv Kumar, Shih-Fu Chang
[
pdf
] [
arXiv
]
Visual Odometry for Pixel Processor Arrays
Laurie Bose, Jianing Chen, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas
[
pdf
]
Joint Estimation of Camera Pose, Depth, Deblurring, and Super-Resolution From a Blurred Image Sequence
Haesol Park, Kyoung Mu Lee
[
pdf
] [
supp
] [
arXiv
]
2D-Driven 3D Object Detection in RGB-D Images
Jean Lahoud, Bernard Ghanem
[
pdf
] [
supp
]
Ray Space Features for Plenoptic Structure-From-Motion
Yingliang Zhang, Peihong Yu, Wei Yang, Yuanxi Ma, Jingyi Yu
[
pdf
]
Depth Estimation Using Structured Light Flow -- Analysis of Projected Pattern Flow on an Object's Surface
Ryo Furukawa, Ryusuke Sagawa, Hiroshi Kawasaki
[
pdf
] [
supp
]
Monocular Dense 3D Reconstruction of a Complex Dynamic Scene From Two Perspective Frames
Suryansh Kumar, Yuchao Dai, Hongdong Li
[
pdf
] [
supp
] [
arXiv
]
Optimal Transformation Estimation With Semantic Cues
Danda Pani Paudel, Adlane Habed, Luc Van Gool
[
pdf
]
Dynamics Enhanced Multi-Camera Motion Segmentation From Unsynchronized Videos
Xikang Zhang, Bengisu Ozbay, Mario Sznaier, Octavia Camps
[
pdf
]
Taking the Scenic Route to 3D: Optimising Reconstruction From Moving Cameras
Oscar Mendez, Simon Hadfield, Nicolas Pugeault, Richard Bowden
[
pdf
] [
supp
]
FLaME: Fast Lightweight Mesh Estimation Using Variational Smoothing on Delaunay Graphs
W. Nicholas Greene, Nicholas Roy
[
pdf
]
Efficient Algorithms for Moral Lineage Tracing
Markus Rempfler, Jan-Hendrik Lange, Florian Jug, Corinna Blasse, Eugene W. Myers, Bjoern H. Menze, Bjoern Andres
[
pdf
] [
supp
] [
arXiv
]
From RGB to Spectrum for Natural Scenes via Manifold-Based Mapping
Yan Jia, Yinqiang Zheng, Lin Gu, Art Subpa-Asa, Antony Lam, Yoichi Sato, Imari Sato
[
pdf
]
DeepFuse: A Deep Unsupervised Approach for Exposure Fusion With Extreme Exposure Image Pairs
K. Ram Prabhakar, V Sai Srikar, R. Venkatesh Babu
[
pdf
] [
supp
]
Learning Dense Facial Correspondences in Unconstrained Images
Ronald Yu, Shunsuke Saito, Haoxiang Li, Duygu Ceylan, Hao Li
[
pdf
] [
supp
] [
arXiv
]
Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification
Shuangjie Xu, Yu Cheng, Kang Gu, Yang Yang, Shiyu Chang, Pan Zhou
[
pdf
] [
arXiv
]
Automatic Content-Aware Projection for 360deg Videos
Yeong Won Kim, Chang-Ryeol Lee, Dae-Yong Cho, Yong Hoon Kwon, Hyeok-Jae Choi, Kuk-Jin Yoon
[
pdf
] [
supp
]
Blur-Invariant Deep Learning for Blind-Deblurring
T. M. Nimisha, Akash Kumar Singh, A. N. Rajagopalan
[
pdf
] [
supp
]
Non-Linear Convolution Filters for CNN-Based Learning
Georgios Zoumpourlis, Alexandros Doumanoglou, Nicholas Vretos, Petros Daras
[
pdf
] [
arXiv
]
AOD-Net: All-In-One Dehazing Network
Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng
[
pdf
]
Simultaneous Detection and Removal of High Altitude Clouds From an Image
Tushar Sandhan, Jin Young Choi
[
pdf
]
Understanding Low- and High-Level Contributions to Fixation Prediction
Matthias Kummerer, Thomas S. A. Wallis, Leon A. Gatys, Matthias Bethge
[
pdf
] [
supp
]
Image Super-Resolution Using Dense Skip Connections
Tong Tong, Gen Li, Xiejie Liu, Qinquan Gao
[
pdf
]
Convergence Analysis of MAP Based Blur Kernel Estimation
Sunghyun Cho, Seungyong Lee
[
pdf
] [
supp
] [
arXiv
]
Blob Reconstruction Using Unilateral Second Order Gaussian Kernels With Application to High-ISO Long-Exposure Image Denoising
Gang Wang, Carlos Lopez-Molina, Bernard De Baets
[
pdf
]
Deep Generative Adversarial Compression Artifact Removal
Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, Alberto Del Bimbo
[
pdf
] [
arXiv
]
Online Multi-Object Tracking Using CNN-Based Single Object Tracker With Spatial-Temporal Attention Mechanism
Qi Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang, Bin Liu, Nenghai Yu
[
pdf
] [
arXiv
]
Mutual Enhancement for Detection of Multiple Logos in Sports Videos
Yuan Liao, Xiaoqing Lu, Chengcui Zhang, Yongtao Wang, Zhi Tang
[
pdf
]
Referring Expression Generation and Comprehension via Attributes
Jingyu Liu, Liang Wang, Ming-Hsuan Yang
[
pdf
] [
supp
]
RoomNet: End-To-End Room Layout Estimation
Chen-Yu Lee, Vijay Badrinarayanan, Tomasz Malisiewicz, Andrew Rabinovich
[
pdf
]
SSH: Single Stage Headless Face Detector
Mahyar Najibi, Pouya Samangouei, Rama Chellappa, Larry S. Davis
[
pdf
] [
arXiv
]
AnnArbor: Approximate Nearest Neighbors Using Arborescence Coding
Artem Babenko, Victor Lempitsky
[
pdf
]
Boosting Image Captioning With Attributes
Ting Yao, Yingwei Pan, Yehao Li, Zhaofan Qiu, Tao Mei
[
pdf
] [
arXiv
]
Learning to Estimate 3D Hand Pose From Single RGB Images
Christian Zimmermann, Thomas Brox
[
pdf
] [
supp
]
Locally-Transferred Fisher Vectors for Texture Classification
Yang Song, Fan Zhang, Qing Li, Heng Huang, Lauren J. O'Donnell, Weidong Cai
[
pdf
]
Object-Level Proposals
Jianxiang Ma, Anlong Ming, Zilong Huang, Xinggang Wang, Yu Zhou
[
pdf
]
Extreme Clicking for Efficient Object Annotation
Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari
[
pdf
] [
supp
] [
arXiv
]
WordSup: Exploiting Word Annotations for Character Based Text Detection
Han Hu, Chengquan Zhang, Yuxuan Luo, Yuzhuo Wang, Junyu Han, Errui Ding
[
pdf
] [
arXiv
]
Illuminating Pedestrians via Simultaneous Detection & Segmentation
Garrick Brazil, Xi Yin, Xiaoming Liu
[
pdf
]
Generalized Orderless Pooling Performs Implicit Salient Matching
Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, Erik Rodner
[
pdf
] [
supp
] [
arXiv
]
Exploiting Spatial Structure for Localizing Manipulated Image Regions
Jawadul H. Bappy, Amit K. Roy-Chowdhury, Jason Bunk, Lakshmanan Nataraj, B. S. Manjunath
[
pdf
]
RDFNet: RGB-D Multi-Level Residual Feature Fusion for Indoor Semantic Segmentation
Seong-Jin Park, Ki-Sang Hong, Seungyong Lee
[
pdf
] [
supp
]
The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes
Gerhard Neuhold, Tobias Ollmann, Samuel Rota Bulo, Peter Kontschieder
[
pdf
] [
supp
]
Self-Organized Text Detection With Minimal Post-Processing via Border Learning
Yue Wu, Prem Natarajan
[
pdf
]
Sparse Exact PGA on Riemannian Manifolds
Monami Banerjee, Rudrasis Chakraborty, Baba C. Vemuri
[
pdf
]
Tensor RPCA by Bayesian CP Factorization With Complex Noise
Qiong Luo, Zhi Han, Xi'ai Chen, Yao Wang, Deyu Meng, Dong Liang, Yandong Tang
[
pdf
] [
supp
]
Multimodal Gaussian Process Latent Variable Models With Harmonization
Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian
[
pdf
]
Segmentation-Aware Convolutional Networks Using Local Attention Masks
Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos
[
pdf
] [
supp
] [
arXiv
]
Rotation Equivariant Vector Field Networks
Diego Marcos, Michele Volpi, Nikos Komodakis, Devis Tuia
[
pdf
] [
arXiv
]
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
Jian-Hao Luo, Jianxin Wu, Weiyao Lin
[
pdf
] [
arXiv
]
AutoDIAL: Automatic DomaIn Alignment Layers
Fabio Maria Carlucci, Lorenzo Porzi, Barbara Caputo, Elisa Ricci, Samuel Rota Bulo
[
pdf
] [
supp
] [
arXiv
]
Focusing Attention: Towards Accurate Text Recognition in Natural Images
Zhanzhan Cheng, Fan Bai, Yunlu Xu, Gang Zheng, Shiliang Pu, Shuigeng Zhou
[
pdf
] [
arXiv
]
Unsupervised Object Segmentation in Video by Efficient Selection of Highly Probable Positive Features
Emanuela Haller, Marius Leordeanu
[
pdf
] [
arXiv
]
Nonparametric Variational Auto-Encoders for Hierarchical Representation Learning
Prasoon Goyal, Zhiting Hu, Xiaodan Liang, Chenyu Wang, Eric P. Xing
[
pdf
] [
arXiv
]
Dense and Low-Rank Gaussian CRFs Using Deep Embeddings
Siddhartha Chandra, Nicolas Usunier, Iasonas Kokkinos
[
pdf
]
A Multimodal Deep Regression Bayesian Network for Affective Video Content Analyses
Quan Gan, Shangfei Wang, Longfei Hao, Qiang Ji
[
pdf
]
Moving Object Detection in Time-Lapse or Motion Trigger Image Sequences Using Low-Rank and Invariant Sparse Decomposition
Moein Shakeri, Hong Zhang
[
pdf
] [
supp
]
A Multilayer-Based Framework for Online Background Subtraction With Freely Moving Cameras
Yizhe Zhu, Ahmed Elgammal
[
pdf
] [
supp
] [
arXiv
]
Dynamic Label Graph Matching for Unsupervised Video Re-Identification
Mang Ye, Andy J. Ma, Liang Zheng, Jiawei Li, Pong C. Yuen
[
pdf
] [
arXiv
]
Spatiotemporal Modeling for Crowd Counting in Videos
Feng Xiong, Xingjian Shi, Dit-Yan Yeung
[
pdf
] [
supp
] [
arXiv
]
Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning
Tae-Hyun Oh, Kyungdon Joo, Neel Joshi, Baoyuan Wang, In So Kweon, Sing Bing Kang
[
pdf
] [
supp
]
What Is Around the Camera?
Stamatios Georgoulis, Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Tinne Tuytelaars, Luc Van Gool
[
pdf
] [
arXiv
]
Weakly-Supervised Learning of Visual Relations
Julia Peyre, Josef Sivic, Ivan Laptev, Cordelia Schmid
[
pdf
] [
arXiv
]
BIER - Boosting Independent Embeddings Robustly
Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof
[
pdf
] [
supp
]
3D Graph Neural Networks for RGBD Semantic Segmentation
Xiaojuan Qi, Renjie Liao, Jiaya Jia, Sanja Fidler, Raquel Urtasun
[
pdf
]
Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
Heliang Zheng, Jianlong Fu, Tao Mei, Jiebo Luo
[
pdf
]
Learning 3D Object Categories by Looking Around Them
David Novotny, Diane Larlus, Andrea Vedaldi
[
pdf
] [
supp
] [
arXiv
]
Quantitative Evaluation of Confidence Measures in a Machine Learning World
Matteo Poggi, Fabio Tosi, Stefano Mattoccia
[
pdf
] [
supp
]
Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks
Hui Li, Peng Wang, Chunhua Shen
[
pdf
]
DeepSetNet: Predicting Sets With Deep Neural Networks
S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid
[
pdf
] [
supp
] [
arXiv
]
Learning From Video and Text via Large-Scale Discriminative Clustering
Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, Josef Sivic
[
pdf
] [
arXiv
]
TALL: Temporal Activity Localization via Language Query
Jiyang Gao, Chen Sun, Zhenheng Yang, Ram Nevatia
[
pdf
] [
supp
] [
arXiv
]
End-To-End Face Detection and Cast Grouping in Movies Using Erdos-Renyi Clustering
SouYoung Jin, Hang Su, Chris Stauffer, Erik Learned-Miller
[
pdf
] [
supp
]
Active Decision Boundary Annotation With Deep Generative Models
Miriam Huijser, Jan C. van Gemert
[
pdf
] [
supp
] [
arXiv
]
Convolutional Dictionary Learning via Local Processing
Vardan Papyan, Yaniv Romano, Jeremias Sulam, Michael Elad
[
pdf
] [
supp
] [
arXiv
]
Editable Parametric Dense Foliage From 3D Capture
Gaurav Chaurasia, Paul Beardsley
[
pdf
]
Refractive Structure-From-Motion Through a Flat Refractive Interface
Francois Chadebecq, Francisco Vasconcelos, George Dwyer, Rene Lacher, Sebastien Ourselin, Tom Vercauteren, Danail Stoyanov
[
pdf
]
Submodular Trajectory Optimization for Aerial 3D Scanning
Mike Roberts, Debadeepta Dey, Anh Truong, Sudipta Sinha, Shital Shah, Ashish Kapoor, Pat Hanrahan, Neel Joshi
[
pdf
] [
supp
] [
arXiv
]
Camera Calibration by Global Constraints on the Motion of Silhouettes
Gil Ben-Artzi
[
pdf
] [
arXiv
]
Deltille Grids for Geometric Camera Calibration
Hyowon Ha, Michal Perdoch, Hatem Alismail, In So Kweon, Yaser Sheikh
[
pdf
]
A Lightweight Single-Camera Polarization Compass With Covariance Estimation
Wolfgang Sturzl
[
pdf
] [
supp
]
Reflectance Capture Using Univariate Sampling of BRDFs
Zhuo Hui, Kalyan Sunkavalli, Joon-Young Lee, Sunil Hadap, Jian Wang, Aswin C. Sankaranarayanan
[
pdf
] [
supp
]
Estimating Defocus Blur via Rank of Local Patches
Guodong Xu, Yuhui Quan, Hui Ji
[
pdf
] [
supp
]
RGB-Infrared Cross-Modality Person Re-Identification
Ancong Wu, Wei-Shi Zheng, Hong-Xing Yu, Shaogang Gong, Jianhuang Lai
[
pdf
] [
supp
]
Intrinsic 3D Dynamic Surface Tracking Based on Dynamic Ricci Flow and Teichmuller Map
Xiaokang Yu, Na Lei, Yalin Wang, Xianfeng Gu
[
pdf
]
Multi-Scale Deep Learning Architectures for Person Re-Identification
Xuelin Qian, Yanwei Fu, Yu-Gang Jiang, Tao Xiang, Xiangyang Xue
[
pdf
] [
supp
] [
arXiv
]
Range Loss for Deep Face Recognition With Long-Tailed Training Data
Xiao Zhang, Zhiyuan Fang, Yandong Wen, Zhifeng Li, Yu Qiao
[
pdf
]
Face Sketch Matching via Coupled Deep Transform Learning
Shruti Nagpal, Maneet Singh, Richa Singh, Mayank Vatsa, Afzel Noore, Angshul Majumdar
[
pdf
]
Realistic Dynamic Facial Textures From a Single Image Using GANs
Kyle Olszewski, Zimo Li, Chao Yang, Yi Zhou, Ronald Yu, Zeng Huang, Sitao Xiang, Shunsuke Saito, Pushmeet Kohli, Hao Li
[
pdf
] [
supp
]
Pixel Recursive Super Resolution
Ryan Dahl, Mohammad Norouzi, Jonathon Shlens
[
pdf
] [
supp
] [
arXiv
]
PanNet: A Deep Network Architecture for Pan-Sharpening
Junfeng Yang, Xueyang Fu, Yuwen Hu, Yue Huang, Xinghao Ding, John Paisley
[
pdf
]
Recurrent Color Constancy
Yanlin Qian, Ke Chen, Jarno Nikkanen, Joni-Kristian Kamarainen, Jiri Matas
[
pdf
] [
supp
]
Saliency Pattern Detection by Ranking Structured Trees
Lei Zhu, Haibin Ling, Jin Wu, Huiping Deng, Jin Liu
[
pdf
]
Monocular Video-Based Trailer Coupler Detection Using Multiplexer Convolutional Neural Network
Yousef Atoum, Joseph Roth, Michael Bliss, Wende Zhang, Xiaoming Liu
[
pdf
]
Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy Visual Tracking
Heng Fan, Haibin Ling
[
pdf
] [
supp
] [
arXiv
]
Non-Rigid Object Tracking via Deformable Patches Using Shape-Preserved KCF and Level Sets
Xin Sun, Ngai-Man Cheung, Hongxun Yao, Yiluan Guo
[
pdf
]
A Discriminative View of MRF Pre-Processing Algorithms
Chen Wang, Charles Herrmann, Ramin Zabih
[
pdf
] [
supp
] [
arXiv
]
Offline Handwritten Signature Modeling and Verification Based on Archetypal Analysis
Elias N. Zois, Ilias Theodorakopoulos, George Economou
[
pdf
]
Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization
Huseyin Coskun, Felix Achilles, Robert DiPietro, Nassir Navab, Federico Tombari
[
pdf
]
Learning Spatio-Temporal Representation With Pseudo-3D Residual Networks
Zhaofan Qiu, Ting Yao, Tao Mei
[
pdf
]
Deeper, Broader and Artier Domain Generalization
Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales
[
pdf
]
Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval
Jifei Song, Qian Yu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
[
pdf
] [
supp
]
Soft-NMS -- Improving Object Detection With One Line of Code
Navaneeth Bodla, Bharat Singh, Rama Chellappa, Larry S. Davis
[
pdf
] [
arXiv
]
Semantic Jitter: Dense Supervision for Visual Comparisons via Synthetic Images
Aron Yu, Kristen Grauman
[
pdf
] [
supp
] [
arXiv
]
Video Scene Parsing With Predictive Feature Learning
Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan
[
pdf
] [
supp
] [
arXiv
]
Understanding and Mapping Natural Beauty
Scott Workman, Richard Souvenir, Nathan Jacobs
[
pdf
] [
supp
]
Human Pose Estimation Using Global and Local Normalization
Ke Sun, Cuiling Lan, Junliang Xing, Wenjun Zeng, Dong Liu, Jingdong Wang
[
pdf
] [
supp
] [
arXiv
]
HashNet: Deep Learning to Hash by Continuation
Zhangjie Cao, Mingsheng Long, Jianmin Wang, Philip S. Yu
[
pdf
] [
supp
] [
arXiv
]
Scaling the Scattering Transform: Deep Hybrid Networks
Edouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko
[
pdf
] [
supp
] [
arXiv
]
Flip-Invariant Motion Representation
Takumi Kobayashi
[
pdf
] [
supp
]
Scene Categorization With Spectral Features
Salman H. Khan, Munawar Hayat, Fatih Porikli
[
pdf
]
Image2song: Song Retrieval via Bridging Image Content and Lyric Words
Xuelong Li, Di Hu, Xiaoqiang Lu
[
pdf
] [
supp
] [
arXiv
]
Deep Functional Maps: Structured Prediction for Dense Shape Correspondence
Or Litany, Tal Remez, Emanuele Rodola, Alex Bronstein, Michael Bronstein
[
pdf
] [
arXiv
]
Training Deep Networks to Be Spatially Sensitive
Nicholas Kolkin, Eli Shechtman, Gregory Shakhnarovich
[
pdf
] [
arXiv
]
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds
Fangyu Liu, Shuaipeng Li, Liqiang Zhang, Chenghu Zhou, Rongtian Ye, Yuebin Wang, Jiwen Lu
[
pdf
] [
supp
] [
arXiv
]
Semi Supervised Semantic Segmentation Using Generative Adversarial Network
Nasim Souly, Concetto Spampinato, Mubarak Shah
[
pdf
]
Efficient Low Rank Tensor Ring Completion
Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron
[
pdf
] [
supp
] [
arXiv
]
Semantic Image Synthesis via Adversarial Learning
Hao Dong, Simiao Yu, Chao Wu, Yike Guo
[
pdf
] [
supp
] [
arXiv
]
Unified Deep Supervised Domain Adaptation and Generalization
Saeid Motiian, Marco Piccirilli, Donald A. Adjeroh, Gianfranco Doretto
[
pdf
] [
arXiv
]
Interpretable Transformations With Encoder-Decoder Networks
Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow
[
pdf
]
Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization
Kamran Ghasedi Dizaji, Amirhossein Herandi, Cheng Deng, Weidong Cai, Heng Huang
[
pdf
] [
arXiv
]
Deep Scene Image Classification With the MFAFVNet
Yunsheng Li, Mandar Dixit, Nuno Vasconcelos
[
pdf
]
Learning Bag-Of-Features Pooling for Deep Convolutional Neural Networks
Nikolaos Passalis, Anastasios Tefas
[
pdf
]
Adversarial Examples Detection in Deep Networks With Convolutional Filter Statistics
Xin Li, Fuxin Li
[
pdf
] [
arXiv
]
Joint Prediction of Activity Labels and Starting Times in Untrimmed Videos
Tahmida Mahmud, Mahmudul Hasan, Amit K. Roy-Chowdhury
[
pdf
] [
supp
]
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection
Huijuan Xu, Abir Das, Kate Saenko
[
pdf
] [
supp
] [
arXiv
]
Temporal Context Network for Activity Localization in Videos
Xiyang Dai, Bharat Singh, Guyue Zhang, Larry S. Davis, Yan Qiu Chen
[
pdf
] [
arXiv
]
Localizing Moments in Video With Natural Language
Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, Trevor Darrell, Bryan Russell
[
pdf
] [
supp
] [
arXiv
]
TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal
Hongyuan Zhu, Romain Vial, Shijian Lu
[
pdf
]
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos
Rui Hou, Chen Chen, Mubarak Shah
[
pdf
]
Learning Action Recognition Model From Depth and Skeleton Videos
Hossein Rahmani, Mohammed Bennamoun
[
pdf
]
The "Something Something" Video Database for Learning and Evaluating Visual Common Sense
Raghav Goyal, Samira Ebrahimi Kahou, Vincent Michalski, Joanna Materzynska, Susanne Westphal, Heuna Kim, Valentin Haenel, Ingo Fruend, Peter Yianilos, Moritz Mueller-Freitag, Florian Hoppe, Christian Thurau, Ingo Bax, Roland Memisevic
[
pdf
] [
supp
] [
arXiv
]
GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images
Avi Singh, Larry Yang, Sergey Levine
[
pdf
] [
supp
] [
arXiv
]
Semi-Global Weighted Least Squares in Image Filtering
Wei Liu, Xiaogang Chen, Chuanhua Shen, Zhi Liu, Jie Yang
[
pdf
] [
arXiv
]
Scale Recovery for Monocular Visual Odometry Using Depth Estimated With Deep Convolutional Neural Fields
Xiaochuan Yin, Xiangwei Wang, Xiaoguo Du, Qijun Chen
[
pdf
]
Deep Adaptive Image Clustering
Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, Chunhong Pan
[
pdf
] [
supp
]
One Network to Solve Them All -- Solving Linear Inverse Problems Using Deep Projection Models
J. H. Rick Chang, Chun-Liang Li, Barnabas Poczos, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan
[
pdf
] [
supp
]
Representation Learning by Learning to Count
Mehdi Noroozi, Hamed Pirsiavash, Paolo Favaro
[
pdf
] [
arXiv
]
StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks
Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris N. Metaxas
[
pdf
] [
arXiv
]
Unsupervised Learning of Object Landmarks by Factorized Spatial Embeddings
James Thewlis, Hakan Bilen, Andrea Vedaldi
[
pdf
] [
supp
]
展开全文
点赞
0
阅读
6+
评论
0+
评论
相关主题
ICCV
Top
提示
微信扫码
咨询专知VIP会员与技术项目合作
(加微信请备注: "专知")
微信扫码咨询专知VIP会员
Top