时间: 野生的模拟时钟阅读 (It's About Time: Analog Clock Reading in the Wild) - 专知论文

会员服务 ·

0

可约的 · 伪标记 · Sim2Real · 未标记 · 模型评估 ·

2021 年 12 月 6 日

It's About Time: Analog Clock Reading in the Wild

翻译：时间: 野生的模拟时钟阅读

Charig Yang,Weidi Xie,Andrew Zisserman

from arxiv, Project page: https://charigyang.github.io/abouttime/

In this paper, we present a framework for reading analog clocks in natural images or videos. Specifically, we make the following contributions: First, we create a scalable pipeline for generating synthetic clocks, significantly reducing the requirements for the labour-intensive annotations; Second, we introduce a clock recognition architecture based on spatial transformer networks (STN), which is trained end-to-end for clock alignment and recognition. We show that the model trained on the proposed synthetic dataset generalises towards real clocks with good accuracy, advocating a Sim2Real training regime; Third, to further reduce the gap between simulation and real data, we leverage the special property of time, i.e. uniformity, to generate reliable pseudo-labels on real unlabelled clock videos, and show that training on these videos offers further improvements while still requiring zero manual annotations. Lastly, we introduce three benchmark datasets based on COCO, Open Images, and The Clock movie, totalling 4,472 images with clocks, with full annotations for time, accurate to the minute.

翻译：在本文中,我们提出了一个在自然图像或视频中读取模拟时钟的框架。具体来说,我们做出了以下贡献:第一,我们为制作合成时钟创建了可缩放的管道,大大减少了对劳动密集型说明的需求;第二,我们引入了基于空间变压器网络的时钟识别结构(STN),该结构经过了对时钟对齐和识别的训练,它端对端对端对端。我们展示了在拟议合成数据集中经过培训的模型,该模型以精确的方式向真实时钟提供精确的缩写,倡导Sim2Real培训制度;第三,为了进一步缩小模拟与真实数据之间的差距,我们利用时间的特殊属性,即一致性,在真实的无标签时钟视频上生成可靠的假标签,并展示这些视频的培训可以进一步改进,同时仍然需要零手动说明。最后,我们引入了三个基于CO、开放图像和Clock电影的基准数据集,共4 472张带有时钟的图像,并附有时间的完整说明,准确到分钟。

0

相关内容

可约的

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

专知会员服务

34+阅读 · 2021年11月30日

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

65+阅读 · 2021年9月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

【ACL 2019 Tutorials】论据挖掘研究进展（Advances in Argument Mining）

【ACL 2019 Tutorials】论据挖掘研究进展（Advances in Argument Mining）

专知会员服务

16+阅读 · 2019年11月18日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

已删除

将门创投

3+阅读 · 2019年4月12日

SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation

Arxiv

0+阅读 · 2022年2月10日

Simplify the Usage of Lexicon in Chinese NER

Arxiv

5+阅读 · 2020年10月14日

Scene-based Factored Attention for Image Captioning

Arxiv

4+阅读 · 2019年8月7日

SFA: Small Faces Attention Face Detector

Arxiv

4+阅读 · 2018年12月20日

OMNIA Faster R-CNN: Detection in the wild through dataset merging and soft distillation

OMNIA Faster R-CNN: Detection in the wild through dataset merging and soft distillation

Arxiv

6+阅读 · 2018年12月6日

Generating Fine-Grained Open Vocabulary Entity Type Descriptions

Arxiv

4+阅读 · 2018年5月27日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Arxiv

10+阅读 · 2018年3月8日

Few-Example Object Detection with Model Communication

Arxiv

7+阅读 · 2018年2月14日

DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild

Arxiv

3+阅读 · 2018年2月1日

Detecting Curve Text in the Wild: New Dataset and New Solution

Arxiv

4+阅读 · 2017年12月6日

VIP会员

文章信息

相关主题

相关VIP内容

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

专知会员服务

34+阅读 · 2021年11月30日

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

65+阅读 · 2021年9月12日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

【论文推荐】小样本视频合成，Few-shot Video-to-Video Synthesis

专知会员服务

24+阅读 · 2019年12月15日

【ACL 2019 Tutorials】论据挖掘研究进展（Advances in Argument Mining）

【ACL 2019 Tutorials】论据挖掘研究进展（Advances in Argument Mining）

专知会员服务

16+阅读 · 2019年11月18日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《多智能体不确定环境追逃博弈研究》216页

美智库最新发布《解放军"人机编组协同作战"发展路径：理论与实践》53页

现代战争"杀伤区"理论：空间尺度与结构特征、控制手段与毁伤机制、生存策略与战线转移

《俄军无人机创新技术或已在乌克兰达成"战场空中封锁"作战效果》最新18页报告

相关资讯

已删除

将门创投

3+阅读 · 2019年4月12日

相关论文

SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation

Arxiv

0+阅读 · 2022年2月10日

Simplify the Usage of Lexicon in Chinese NER

Arxiv

5+阅读 · 2020年10月14日

Scene-based Factored Attention for Image Captioning

Arxiv

4+阅读 · 2019年8月7日

SFA: Small Faces Attention Face Detector

Arxiv

4+阅读 · 2018年12月20日

OMNIA Faster R-CNN: Detection in the wild through dataset merging and soft distillation

OMNIA Faster R-CNN: Detection in the wild through dataset merging and soft distillation

Arxiv

6+阅读 · 2018年12月6日

Generating Fine-Grained Open Vocabulary Entity Type Descriptions

Arxiv

4+阅读 · 2018年5月27日

Domain Adaptive Faster R-CNN for Object Detection in the Wild

Arxiv

10+阅读 · 2018年3月8日

Few-Example Object Detection with Model Communication

Arxiv

7+阅读 · 2018年2月14日

DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild

Arxiv

3+阅读 · 2018年2月1日

Detecting Curve Text in the Wild: New Dataset and New Solution

Arxiv

4+阅读 · 2017年12月6日

微信扫码咨询专知VIP会员