校准谷歌趋势时间序列 (Calibration of Google Trends Time Series) - 专知论文

会员服务 ·

0

Google Trends · ReQuEST · Google · anchor · 缩放 ·

2021 年 2 月 4 日

Calibration of Google Trends Time Series

翻译：校准谷歌趋势时间序列

from arxiv, Proceedings of the 29th ACM Conference on Information and Knowledge Management (CIKM), 2020

Google Trends is a tool that allows researchers to analyze the popularity of Google search queries across time and space. In a single request, users can obtain time series for up to 5 queries on a common scale, normalized to the range from 0 to 100 and rounded to integer precision. Despite the overall value of Google Trends, rounding causes major problems, to the extent that entirely uninformative, all-zero time series may be returned for unpopular queries when requested together with more popular queries. We address this issue by proposing Google Trends Anchor Bank (G-TAB), an efficient solution for the calibration of Google Trends data. Our method expresses the popularity of an arbitrary number of queries on a common scale without being compromised by rounding errors. The method proceeds in two phases. In the offline preprocessing phase, an "anchor bank" is constructed, a set of queries spanning the full spectrum of popularity, all calibrated against a common reference query by carefully chaining together multiple Google Trends requests. In the online deployment phase, any given search query is calibrated by performing an efficient binary search in the anchor bank. Each search step requires one Google Trends request, but few steps suffice, as we demonstrate in an empirical evaluation. We make our code publicly available as an easy-to-use library at https://github.com/epfl-dlab/GoogleTrendsAnchorBank.

翻译：谷歌趋势是一个工具,使研究人员能够分析谷歌搜索查询在时间和空间间广度的普及程度。在一个单一的请求中,用户可以获得最多5个通用查询的时间序列, 普通化为0至100, 整整整精确度不等。尽管谷歌趋势的总体价值是0至100之间的, 四舍五入为整数的精确度。尽管“ 谷歌趋势” 的总值, 四舍五入造成了重大问题, 以致于在完全没有信息规范的情况下, 将所有零时间序列都返回到不受欢迎的查询中, 以及更受欢迎的查询。我们提出谷歌趋势数据库( G- TAB) 的高效校准数据校准。我们的方法是, 在主机库中进行高效的二进制搜索, 我们的搜索步骤需要一份Google States的任意数量的查询, 而在离线前处理阶段, 建立一个“ 锁定银行” 库, 覆盖全域的查询, 与共同的查询相校准。在在线部署阶段, 任何给搜索查询的校准是通过在主机库中进行高效的二进搜索。每个搜索步骤都要求一份Google Streal- bas- a exliglemental as a as as pilling as eximliver as as as as as as as as as as as as.

0

相关内容

Google Trends

多标签学习的新趋势（2020 Survey）

多标签学习的新趋势（2020 Survey）

专知会员服务

44+阅读 · 2020年12月6日

【NeurIPS2020】可靠图神经网络鲁棒聚合

【NeurIPS2020】可靠图神经网络鲁棒聚合

专知会员服务

20+阅读 · 2020年11月6日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

专知会员服务

20+阅读 · 2020年5月12日

【Google AI】开源NoisyStudent：自监督图像分类

【Google AI】开源NoisyStudent：自监督图像分类

专知会员服务

55+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Google Research Football (scenario 2) 实验

Google Research Football (scenario 2) 实验

CreateAMind

8+阅读 · 2019年8月29日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【大数据】StreamSets：一个大数据采集工具

【大数据】StreamSets：一个大数据采集工具

产业智能官

40+阅读 · 2018年12月5日

【泡泡一分钟】一种实用且高效的多视图匹配方法

【泡泡一分钟】一种实用且高效的多视图匹配方法

泡泡机器人SLAM

6+阅读 · 2018年11月19日

【泡泡一分钟】基于均值偏移聚类方法的3D点云配准算法（3dv-49）

【泡泡一分钟】基于均值偏移聚类方法的3D点云配准算法（3dv-49）

泡泡机器人SLAM

6+阅读 · 2018年2月28日

Python机器学习教程资料/代码

Python机器学习教程资料/代码

机器学习研究会

8+阅读 · 2018年2月22日

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

机器学习研究会

9+阅读 · 2017年10月24日

【回顾】用于目标检测的DSOD模型（ICCV 2017）

【回顾】用于目标检测的DSOD模型（ICCV 2017）

AI研习社

3+阅读 · 2017年10月16日

Integração e Entrega Contínua para aplicações móveis desenvolvidas em React Native

Integração e Entrega Contínua para aplicações móveis desenvolvidas em React Native

Arxiv

0+阅读 · 2021年3月30日

Strong Optimal Classification Trees

Arxiv

0+阅读 · 2021年3月29日

Improving Unsupervised Image Clustering With Robust Learning

Improving Unsupervised Image Clustering With Robust Learning

Arxiv

0+阅读 · 2021年3月29日

Monte Carlo algorithm for the extrema of tempered stable processes

Arxiv

0+阅读 · 2021年3月29日

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

Arxiv

7+阅读 · 2020年12月15日

Products of Euclidean metrics and applications to proximity questions among curves

Arxiv

3+阅读 · 2020年4月13日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

Reinforcement Learning with Perturbed Rewards

Arxiv

4+阅读 · 2018年10月5日

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Arxiv

3+阅读 · 2018年8月2日

Road surface 3d reconstruction based on dense subpixel disparity map estimation

Arxiv

3+阅读 · 2018年7月5日

VIP会员

文章信息

相关主题

相关VIP内容

多标签学习的新趋势（2020 Survey）

多标签学习的新趋势（2020 Survey）

专知会员服务

44+阅读 · 2020年12月6日

【NeurIPS2020】可靠图神经网络鲁棒聚合

【NeurIPS2020】可靠图神经网络鲁棒聚合

专知会员服务

20+阅读 · 2020年11月6日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

【ACL2020-Google】BLEURT:一种基于迁移学习的自然语言生成度量

专知会员服务

20+阅读 · 2020年5月12日

【Google AI】开源NoisyStudent：自监督图像分类

【Google AI】开源NoisyStudent：自监督图像分类

专知会员服务

55+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人机协同时代的军事指挥控制演进

《英国智库：瓦解俄罗斯防空系统生产，夺回制空权》最新报告

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

《战术突击工具包：军队的“边缘”操作系统》报告

相关资讯

Google Research Football (scenario 2) 实验

Google Research Football (scenario 2) 实验

CreateAMind

8+阅读 · 2019年8月29日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【大数据】StreamSets：一个大数据采集工具

【大数据】StreamSets：一个大数据采集工具

产业智能官

40+阅读 · 2018年12月5日

【泡泡一分钟】一种实用且高效的多视图匹配方法

【泡泡一分钟】一种实用且高效的多视图匹配方法

泡泡机器人SLAM

6+阅读 · 2018年11月19日

【泡泡一分钟】基于均值偏移聚类方法的3D点云配准算法（3dv-49）

【泡泡一分钟】基于均值偏移聚类方法的3D点云配准算法（3dv-49）

泡泡机器人SLAM

6+阅读 · 2018年2月28日

Python机器学习教程资料/代码

Python机器学习教程资料/代码

机器学习研究会

8+阅读 · 2018年2月22日

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

【推荐】树莓派/OpenCV/dlib人脸定位/瞌睡检测

机器学习研究会

9+阅读 · 2017年10月24日

【回顾】用于目标检测的DSOD模型（ICCV 2017）

【回顾】用于目标检测的DSOD模型（ICCV 2017）

AI研习社

3+阅读 · 2017年10月16日

相关论文

Integração e Entrega Contínua para aplicações móveis desenvolvidas em React Native

Integração e Entrega Contínua para aplicações móveis desenvolvidas em React Native

Arxiv

0+阅读 · 2021年3月30日

Strong Optimal Classification Trees

Arxiv

0+阅读 · 2021年3月29日

Improving Unsupervised Image Clustering With Robust Learning

Improving Unsupervised Image Clustering With Robust Learning

Arxiv

0+阅读 · 2021年3月29日

Monte Carlo algorithm for the extrema of tempered stable processes

Arxiv

0+阅读 · 2021年3月29日

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

Arxiv

7+阅读 · 2020年12月15日

Products of Euclidean metrics and applications to proximity questions among curves

Arxiv

3+阅读 · 2020年4月13日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

Reinforcement Learning with Perturbed Rewards

Arxiv

4+阅读 · 2018年10月5日

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Geometry-Based Multiple Camera Head Detection in Dense Crowds

Arxiv

3+阅读 · 2018年8月2日

Road surface 3d reconstruction based on dense subpixel disparity map estimation

Arxiv

3+阅读 · 2018年7月5日

微信扫码咨询专知VIP会员