自定义 8 位浮动点值格式, 用于在近邻搜索中减少共享内存银行冲突 (Custom 8-bit floating point value format for reducing shared memory bank conflict in approximate nearest neighbor search) - 专知论文

会员服务 ·

0

可约的 · 近邻 · 近似 · INFORMS · 查全率/召回率 ·

2023 年 1 月 17 日

Custom 8-bit floating point value format for reducing shared memory bank conflict in approximate nearest neighbor search

翻译：自定义 8 位浮动点值格式, 用于在近邻搜索中减少共享内存银行冲突

Hiroyuki Ootomo,Akira Naruse

from arxiv, Extended "extended abstract of the SC22 research poster"

The k-nearest neighbor search is used in various applications such as machine learning, computer vision, database search, and information retrieval. While the computational cost of the exact nearest neighbor search is enormous, an approximate nearest neighbor search (ANNS) has been attracting much attention. IVFPQ is one of the ANNS methods. Although we can leverage the high bandwidth and low latency of shared memory to compute the search phase of the IVFPQ on NVIDIA GPUs, the throughput can degrade due to shared memory bank conflict. To reduce the bank conflict and improve the search throughput, we propose a custom 8-bit floating point value format. This format doesn't have a sign bit and can be converted from/to FP32 with a few instructions. We use this format for IVFPQ on GPUs and achieved better performance without significant recall loss compared to FP32 and FP16.

翻译：k 最近的邻居搜索被用于机器学习、计算机视觉、数据库搜索和信息检索等各种应用程序。虽然近邻搜索的计算成本非常巨大, 但近邻搜索( ANNS) 的计算成本非常高, 吸引了人们的注意。 IVFPQ 是 ANNS 方法之一。尽管我们可以利用高带宽和低延迟的共享内存来计算 NVIDIA GPUs 上的IVFPQ 搜索阶段, 但是由于共享的内存银行冲突, 吞吐量可以降解。为了减少银行冲突, 改进搜索量, 我们建议了一种定制的 8- 位浮动点值格式。这个格式没有符号, 并且可以转换为 FP32, 并有几处指示。我们用这个格式来计算 GPUs 上的 IVFPQ, 并在不与 FP32 和 FP16 相比出现重大回扣损失的情况下实现更好的业绩。

0

相关内容

可约的

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

在化合物GL-V9 诱导肝癌细胞凋亡中，蛋白E2F-1调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

高光谱光学近场显微成像方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

哺乳动物体细胞克隆胚胎抗氧化机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

多酸基生物质碳纳米管固体催化剂的制备及催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

MICU1调控线粒体钙离子摄取的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

固液界面体系下硫化矿物表面结构、性质及药剂吸附的密度泛函研究

国家自然科学基金

0+阅读 · 2011年12月31日

肾康丸对糖尿病肾病大鼠miR-192介导通路的影响

国家自然科学基金

1+阅读 · 2009年12月31日

碳纳米管和富勒烯诱导高分子结晶微观机理的分子动力学研究

国家自然科学基金

0+阅读 · 2008年12月31日

QVRF: A Quantization-error-aware Variable Rate Framework for Learned Image Compression

Arxiv

0+阅读 · 2023年3月10日

Machine Learning-based Framework for Optimally Solving the Analytical Inverse Kinematics for Redundant Manipulators

Arxiv

0+阅读 · 2023年3月9日

Greener yet Powerful: Taming Large Code Generation Models with Quantization

Arxiv

0+阅读 · 2023年3月9日

Cones: Concept Neurons in Diffusion Models for Customized Generation

Arxiv

0+阅读 · 2023年3月9日

Inequality Restricted Estimator for Gamma Regression: Bayesian approach as a solution to the Multicollinearity

Arxiv

0+阅读 · 2023年3月9日

Improved Trajectory Reconstruction for Markerless Pose Estimation

Arxiv

0+阅读 · 2023年3月8日

Exploring Adversarial Attacks on Neural Networks: An Explainable Approach

Arxiv

0+阅读 · 2023年3月8日

Principal Component Analysis of Two-dimensional Functional Data with Serial Correlation

Arxiv

0+阅读 · 2023年3月8日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

VIP会员

文章信息

相关主题

查全率/召回率

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

在无标注条件下适配视觉—语言模型：全面综述

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

QVRF: A Quantization-error-aware Variable Rate Framework for Learned Image Compression

Arxiv

0+阅读 · 2023年3月10日

Machine Learning-based Framework for Optimally Solving the Analytical Inverse Kinematics for Redundant Manipulators

Arxiv

0+阅读 · 2023年3月9日

Greener yet Powerful: Taming Large Code Generation Models with Quantization

Arxiv

0+阅读 · 2023年3月9日

Cones: Concept Neurons in Diffusion Models for Customized Generation

Arxiv

0+阅读 · 2023年3月9日

Inequality Restricted Estimator for Gamma Regression: Bayesian approach as a solution to the Multicollinearity

Arxiv

0+阅读 · 2023年3月9日

Improved Trajectory Reconstruction for Markerless Pose Estimation

Arxiv

0+阅读 · 2023年3月8日

Exploring Adversarial Attacks on Neural Networks: An Explainable Approach

Arxiv

0+阅读 · 2023年3月8日

Principal Component Analysis of Two-dimensional Functional Data with Serial Correlation

Arxiv

0+阅读 · 2023年3月8日

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Extreme Language Model Compression with Optimal Subwords and Shared Projections

Arxiv

18+阅读 · 2019年9月25日

3D Hand Shape and Pose Estimation from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

Arxiv

17+阅读 · 2019年3月3日

相关基金

在化合物GL-V9 诱导肝癌细胞凋亡中，蛋白E2F-1调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

高光谱光学近场显微成像方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

哺乳动物体细胞克隆胚胎抗氧化机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

多酸基生物质碳纳米管固体催化剂的制备及催化性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

MICU1调控线粒体钙离子摄取的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

固液界面体系下硫化矿物表面结构、性质及药剂吸附的密度泛函研究

国家自然科学基金

0+阅读 · 2011年12月31日

肾康丸对糖尿病肾病大鼠miR-192介导通路的影响

国家自然科学基金

1+阅读 · 2009年12月31日

碳纳米管和富勒烯诱导高分子结晶微观机理的分子动力学研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员