FLOPs 作为线性代数代数代数辨别器的 FLOPs 测试 (A Test for FLOPs as a Discriminant for Linear Algebra Algorithms) - 专知论文

会员服务 ·

0

线性的 · 判别器 · Performer · CASES · 统计量 ·

2022 年 9 月 7 日

A Test for FLOPs as a Discriminant for Linear Algebra Algorithms

翻译：FLOPs 作为线性代数代数代数辨别器的 FLOPs 测试

Aravind Sankaran,Paolo Bientinesi

Linear algebra expressions, which play a central role in countless scientific computations, are often computed via a sequence of calls to existing libraries of building blocks (such as those provided by BLAS and LAPACK). A sequence identifies a computing strategy, i.e., an algorithm, and normally for one linear algebra expression many alternative algorithms exist. Although mathematically equivalent, those algorithms might exhibit significant differences in terms of performance. Several high-level languages and tools for matrix computations such as Julia, Armadillo, Linnea, etc., make algorithmic choices by minimizing the number of Floating Point Operations (FLOPs). However, there can be several algorithms that share the same (or have nearly identical) number of FLOPs; in many cases, these algorithms exhibit execution times which are statistically equivalent and one could arbitrarily select one of them as the best algorithm. It is however not unlikely to find cases where the execution times are significantly different from one another (despite the FLOP count being almost the same). It is also possible that the algorithm that minimizes FLOPs is not the one that minimizes execution time. In this work, we develop a methodology to test the reliability of FLOPs as discriminant for linear algebra algorithms. Given a set of algorithms (for an instance of a linear algebra expression) as input, the methodology ranks them into performance classes; algorithms in the same class are statistically equivalent in performance. To this end, we measure the algorithms iteratively until the changes in the ranks converge to a value close to zero. FLOPs are a valid discriminant for an instance if all the algorithms with minimum FLOPs are assigned the best rank; otherwise, the instance is regarded as an anomaly, which can then be used in the investigation of the root cause of performance differences.

翻译：在无数科学计算中起着核心作用的线性代数表达式,通常通过向现有建筑群库(例如由BLAS和LAPACK提供)的调用顺序来计算。一个序列可以确定一个计算策略,即算法,通常是一个线性代数表达式,许多替代算法存在。虽然在数学上等同,但这些算法在性能方面可能表现出显著的差异。一些用于计算矩阵的高级语言和工具,如Julia、Armadillo、Linnea等,通过尽可能减少浮点操作(FLOPs)的数量来作出算法选择。然而,可能有一些与FLOPs相同(或几乎完全相同)数目的算法可以确定一个相同的计算法。这些算法显示执行时间在统计学上和任意选择其中的一种最佳算法。然而,如果FLOPs 的调值是几乎相同的表达式,那么,也有可能通过接近的算法来尽量减少FLOPsalbrial 的算法的演算法,这样算算算算法在进行一个最起码的演算法的演算。

0

相关内容

线性的

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

Nrf2-miR-233-NR2B通路抑制硝酸甘油诱导痛觉过敏的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

食品风险残留物快速检测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于基尼系数和改进压缩感知的ISAR成像新方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

基于HPLC-TPMT EAD-MS联用技术的天芪降糖胶囊中TPMT酶亲和活性成分研究

国家自然科学基金

0+阅读 · 2013年12月31日

纳米材料催化发光阵列与等离子体辅助活化检测多氯联苯机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

藏语语音合成关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

岩黄连中解毒、通脑络成分的筛选及多靶点抗AD分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

松材线虫伴生细菌的分离鉴定及与宿主互作的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

透波构件电性能修正补偿原理与加工修正量确定方法

国家自然科学基金

0+阅读 · 2008年12月31日

Surprises in adversarially-trained linear regression

Arxiv

0+阅读 · 2022年10月20日

A general model-and-run solver for multistage robust discrete linear optimization

Arxiv

0+阅读 · 2022年10月20日

PAC-Bayesian Learning of Optimization Algorithms

Arxiv

0+阅读 · 2022年10月20日

Double precision is not necessary for LSQR for solving discrete linear ill-posed problems

Arxiv

0+阅读 · 2022年10月20日

A Segment-Wise Gaussian Process-Based Ground Segmentation With Local Smoothness Estimation

Arxiv

0+阅读 · 2022年10月19日

Performance Evaluation of Serverless Edge Computing for Machine Learning Applications

Arxiv

0+阅读 · 2022年10月19日

Warped Dynamic Linear Models for Time Series of Counts

Arxiv

0+阅读 · 2022年10月18日

Complex moment-based methods for differential eigenvalue problems

Arxiv

0+阅读 · 2022年10月18日

Tight Analysis of Extra-gradient and Optimistic Gradient Methods For Nonconvex Minimax Problems

Arxiv

0+阅读 · 2022年10月17日

Matrix Decomposition and Applications

Arxiv

54+阅读 · 2022年1月1日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Surprises in adversarially-trained linear regression

Arxiv

0+阅读 · 2022年10月20日

A general model-and-run solver for multistage robust discrete linear optimization

Arxiv

0+阅读 · 2022年10月20日

PAC-Bayesian Learning of Optimization Algorithms

Arxiv

0+阅读 · 2022年10月20日

Double precision is not necessary for LSQR for solving discrete linear ill-posed problems

Arxiv

0+阅读 · 2022年10月20日

A Segment-Wise Gaussian Process-Based Ground Segmentation With Local Smoothness Estimation

Arxiv

0+阅读 · 2022年10月19日

Performance Evaluation of Serverless Edge Computing for Machine Learning Applications

Arxiv

0+阅读 · 2022年10月19日

Warped Dynamic Linear Models for Time Series of Counts

Arxiv

0+阅读 · 2022年10月18日

Complex moment-based methods for differential eigenvalue problems

Arxiv

0+阅读 · 2022年10月18日

Tight Analysis of Extra-gradient and Optimistic Gradient Methods For Nonconvex Minimax Problems

Arxiv

0+阅读 · 2022年10月17日

Matrix Decomposition and Applications

Arxiv

54+阅读 · 2022年1月1日

相关基金

Nrf2-miR-233-NR2B通路抑制硝酸甘油诱导痛觉过敏的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

食品风险残留物快速检测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于基尼系数和改进压缩感知的ISAR成像新方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

基于HPLC-TPMT EAD-MS联用技术的天芪降糖胶囊中TPMT酶亲和活性成分研究

国家自然科学基金

0+阅读 · 2013年12月31日

纳米材料催化发光阵列与等离子体辅助活化检测多氯联苯机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

藏语语音合成关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

岩黄连中解毒、通脑络成分的筛选及多靶点抗AD分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

松材线虫伴生细菌的分离鉴定及与宿主互作的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

透波构件电性能修正补偿原理与加工修正量确定方法

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员