快速数学函数, 声音 (Faster Math Functions, Soundly) - 专知论文

会员服务 ·

0

泛函 · 容差 · 模型评估 · 整数线性规划 · 优化器 ·

2021 年 7 月 12 日

Faster Math Functions, Soundly

翻译：快速数学函数, 声音

Ian Briggs,Pavel Panchekha

Standard library implementations of functions like sin and exp optimize for accuracy, not speed, because they are intended for general-purpose use. But applications tolerate inaccuracy from cancellation, rounding error, and singularities-sometimes even very high error-and many application could tolerate error in function implementations as well. This raises an intriguing possibility: speeding up numerical code by tuning standard function implementations. This paper thus introduces OpTuner, an automatic method for selecting the best implementation of mathematical functions at each use site. OpTuner assembles dozens of implementations for the standard mathematical functions from across the speed-accuracy spectrum. OpTuner then uses error Taylor series and integer linear programming to compute optimal assignments of function implementation to use site and presents the user with a speed-accuracy Pareto curve they can use to speed up their code. In a case study on the POV-Ray ray tracer, OpTuner speeds up a critical computation, leading to a whole program speedup of 9% with no change in the program output (whereas human efforts result in slower code and lower-quality output). On a broader study of 37 standard benchmarks, OpTuner matches 216 implementations to 89 use sites and demonstrates speed-ups of 107% for negligible decreases in accuracy and of up to 438% for error-tolerant applications.

翻译：标准图书馆执行功能, 如罪与表达的精度, 而不是速度, 因为它们是通用的。但应用程序容忍取消、圆差和奇数的不准确性, 有时甚至非常高的错误和许多应用程序可能容忍功能执行中的错误。这提出了一种令人感兴趣的可能性: 通过调整标准功能执行来加快数字代码。本文由此介绍了 OpTuner, 这是在每个使用网站选择数学函数最佳执行的自动方法 OpTuner 。 OpTuner 收集了几十个标准数学函数的全程序执行, 并且没有变化。 OpTuner 使用泰勒序列和整线性线性编程来计算功能执行的最佳任务, 以便使用网站, 并给用户提供一个速度精确的Pareto 曲线, 以加速执行其代码。在对 POV- Ray 射线追踪器的案例研究中, OpTuner 加快了关键计算, 导致整个程序加速了9%, 程序输出没有变化( 人类努力的结果是更慢的代码和低质量的输出 ) 。在更宽的进度点上, 展示了107- 标准点的精确性 4, 将执行速度比 10 的进度比的进度点的进度比 4 。

0

相关内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

34+阅读 · 2020年8月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

162+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

一文读懂Faster RCNN

一文读懂Faster RCNN

极市平台

5+阅读 · 2020年1月6日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Implicit Regularization Effects of the Sobolev Norms in Image Processing

Arxiv

0+阅读 · 2021年9月13日

Adversarial Examples for Evaluating Math Word Problem Solvers

Arxiv

0+阅读 · 2021年9月13日

DEF: Deep Estimation of Sharp Geometric Features in 3D Shapes

Arxiv

0+阅读 · 2021年9月10日

Adjoint Differentiation for generic matrix functions

Arxiv

0+阅读 · 2021年9月10日

Mesh convolutional neural networks for wall shear stress estimation in 3D artery models

Arxiv

0+阅读 · 2021年9月10日

On a Numerical Solution of Structural model for the Probability of Defaultunder a Regime-Switching Synchronous-Jump Tempered Stable LévyModel with Desingularized Meshfree Collocation method

Arxiv

0+阅读 · 2021年9月10日

FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging

Arxiv

0+阅读 · 2021年9月9日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Spectral Network Embedding: A Fast and Scalable Method via Sparsity

Arxiv

3+阅读 · 2018年6月7日

Being Robust (in High Dimensions) Can Be Practical

Arxiv

3+阅读 · 2017年12月14日

VIP会员

文章信息

相关主题

整数线性规划

相关VIP内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

最新《非光滑优化》十讲硬核课程，剑桥大学梁经纬博士主讲

专知会员服务

34+阅读 · 2020年8月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

162+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

自动驾驶轨迹规划中的基础模型：进展综述与开放挑战

《用于提升多域战备的大型语言模型辅助场景生成器》报告

【斯坦福博士论文】为人类使用优化 AI 模型

国防领域人工智能规模化应用的理论与实践

相关资讯

一文读懂Faster RCNN

一文读懂Faster RCNN

极市平台

5+阅读 · 2020年1月6日

意识是一种数学模式

意识是一种数学模式

CreateAMind

3+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Implicit Regularization Effects of the Sobolev Norms in Image Processing

Arxiv

0+阅读 · 2021年9月13日

Adversarial Examples for Evaluating Math Word Problem Solvers

Arxiv

0+阅读 · 2021年9月13日

DEF: Deep Estimation of Sharp Geometric Features in 3D Shapes

Arxiv

0+阅读 · 2021年9月10日

Adjoint Differentiation for generic matrix functions

Arxiv

0+阅读 · 2021年9月10日

Mesh convolutional neural networks for wall shear stress estimation in 3D artery models

Arxiv

0+阅读 · 2021年9月10日

On a Numerical Solution of Structural model for the Probability of Defaultunder a Regime-Switching Synchronous-Jump Tempered Stable LévyModel with Desingularized Meshfree Collocation method

Arxiv

0+阅读 · 2021年9月10日

FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging

Arxiv

0+阅读 · 2021年9月9日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

Spectral Network Embedding: A Fast and Scalable Method via Sparsity

Arxiv

3+阅读 · 2018年6月7日

Being Robust (in High Dimensions) Can Be Practical

Arxiv

3+阅读 · 2017年12月14日

微信扫码咨询专知VIP会员