小型数据集多域多定义多定义地标本地化 (Multi-Domain Multi-Definition Landmark Localization for Small Datasets)

We present a novel method for multi image domain and multi-landmark definition learning for small dataset facial localization. Training a small dataset alongside a large(r) dataset helps with robust learning for the former, and provides a universal mechanism for facial landmark localization for new and/or smaller standard datasets. To this end, we propose a Vision Transformer encoder with a novel decoder with a definition agnostic shared landmark semantic group structured prior, that is learnt, as we train on more than one dataset concurrently. Due to our novel definition agnostic group prior the datasets may vary in landmark definitions and domains. During the decoder stage we use cross- and self-attention, whose output is later fed into domain/definition specific heads that minimize a Laplacian-log-likelihood loss. We achieve state-of-the-art performance on standard landmark localization datasets such as COFW and WFLW, when trained with a bigger dataset. We also show state-of-the-art performance on several varied image domain small datasets for animals, caricatures, and facial portrait paintings. Further, we contribute a small dataset (150 images) of pareidolias to show efficacy of our method. Finally, we provide several analysis and ablation studies to justify our claims.

翻译：我们为小型数据集的面部本地化提供了一个多图像域和多陆标记定义学习的新方法。在大型(r)数据集的同时培训一个小型数据集,有助于对前者进行强有力的学习,并为新的和(或)较小的标准数据集提供一个通用的面部里程碑定位机制。为此,我们提议了一个具有新颖解码器的愿景变异器编码器,该编码器具有定义性共同标志性共同标志性语义组结构,在我们同时培训不止一个数据集时学习。由于我们的新定义,在数据组之前的不可知性组可能会在里程碑定义和域上出现差异。在解码阶段,我们使用交叉和自我目的,其输出随后被输入到域/定义特定头,以最大限度地减少拉普拉西亚-loglishing损失。我们在接受更大数据集培训时,在标准地标化数据集(如COFW和WLFW)上取得了最先进的表现。我们还展示了几个不同图像域域域域域域的状态性表现,用于动物、刻度、卡利卡仪、图像的最终分析。我们提供了几种数据分析方法。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日