Honeycomb: 基于 FPGA 的 SmartNIC 上有序键值存储加速 (Honeycomb: ordered key-value store acceleration on an FPGA-based SmartNIC) - 专知论文

会员服务 ·

0

键值存储 · 有序 · FPGA · 存储 · 负载 ·

2023 年 3 月 24 日

Honeycomb: ordered key-value store acceleration on an FPGA-based SmartNIC

翻译：Honeycomb: 基于 FPGA 的 SmartNIC 上有序键值存储加速

Junyi Liu,Aleksandar Dragojevic,Shane Flemming,Antonios Katsarakis,Dario Korolija,Igor Zablotchi,Ho-cheung Ng,Anuj Kalia,Miguel Castro

In-memory ordered key-value stores are an important building block in modern distributed applications. We present Honeycomb, a hybrid software-hardware system for accelerating read-dominated workloads on ordered key-value stores that provides linearizability for all operations including scans. Honeycomb stores a B-Tree in host memory, and executes SCAN and GET on an FPGA-based SmartNIC, and PUT, UPDATE and DELETE on the CPU. This approach enables large stores and simplifies the FPGA implementation but raises the challenge of data access and synchronization across the slow PCIe bus. We describe how Honeycomb overcomes this challenge with careful data structure design, caching, request parallelism with out-of-order request execution, wait-free read operations, and batching synchronization between the CPU and the FPGA. For read-heavy YCSB workloads, Honeycomb improves the throughput of a state-of-the-art ordered key-value store by at least 1.8x. For scan-heavy workloads inspired by cloud storage, Honeycomb improves throughput by more than 2x. The cost-performance, which is more important for large-scale deployments, is improved by at least 1.5x on these workloads.

翻译：内存中的有序键值存储是现代分布式应用程序的重要构建块。我们提出了 Honeycomb，一种混合软件和硬件系统，用于加速面向读取的有序键值存储上的工作负载，为包括扫描在内的所有操作提供了线性化。Honeycomb 在主机内存中存储 B-Tree，并在基于 FPGA 的 SmartNIC 上执行 SCAN 和 GET，在 CPU 上执行 PUT、UPDATE 和 DELETE。这种方法可以实现大型存储并简化 FPGA 实现，但也带来了跨慢速 PCIe 总线的数据访问和同步挑战。我们描述了 Honeycomb 如何通过仔细的数据结构设计、缓存、带有乱序请求执行的请求并行性、无等待读操作以及 CPU 和 FPGA 之间的批量同步来克服这个挑战。对于面向读取的 YCSB 工作负载，Honeycomb 将一种最先进的有序键值存储的吞吐量提高了至少 1.8 倍。对于受云存储启发的扫描重负载，Honeycomb 将吞吐量提高了 2 倍以上。对于这些工作负载，成本性能提高了至少 1.5 倍，这对于大规模部署更为重要。

0

相关内容

键值存储

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

【2021新书】ApachePulsar 实战，402页pdf

【2021新书】ApachePulsar 实战，402页pdf

专知会员服务

72+阅读 · 2021年12月29日

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

105+阅读 · 2021年10月30日

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

专知会员服务

63+阅读 · 2021年1月16日

【2020新书】Kafka权威指南，322页pdf，Apache Kafka: The Definitive Guide

【2020新书】Kafka权威指南，322页pdf，Apache Kafka: The Definitive Guide

专知会员服务

21+阅读 · 2020年10月26日

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

专知会员服务

33+阅读 · 2020年4月1日

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

专知会员服务

18+阅读 · 2020年2月2日

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

专知会员服务

28+阅读 · 2019年11月14日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

机器之心

0+阅读 · 2022年10月7日

Istio 宣布引入全新的无 sidecar 数据平面模式，sidecar 模式依然保留

Istio 宣布引入全新的无 sidecar 数据平面模式，sidecar 模式依然保留

InfoQ

0+阅读 · 2022年9月11日

Presto on Apache Kafka 在 Uber的大规模应用

Presto on Apache Kafka 在 Uber的大规模应用

AI前线

0+阅读 · 2022年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

TensorFlow 2.0新特性之Ragged Tensor

TensorFlow 2.0新特性之Ragged Tensor

深度学习每日摘要

18+阅读 · 2019年4月5日

Github项目推荐 | 推荐系统实例与最佳实践 by 微软

Github项目推荐 | 推荐系统实例与最佳实践 by 微软

AI研习社

20+阅读 · 2019年1月2日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

基于非易失内存设备的数据读写性能优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

千核级通用微处理器共享存储体系结构研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

自旋轨道耦合玻色-爱因斯坦凝聚体中无序效应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

桌面网格平台上的BESIII离线物理软件和调度策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向高精度计算领域动态可配置加速器体系结构研究

国家自然科学基金

0+阅读 · 2013年12月31日

番茄抗病膜蛋白TARK1稳定性的调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Steger-Warming FVS 的长管道气液两相瞬变流计算及其水锤的气阀防护研究

国家自然科学基金

0+阅读 · 2012年12月31日

CUDA、OpenMP和MPI混合加速的隐式粒子模拟算法与框架研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于链接权重化的链接预测研究

国家自然科学基金

1+阅读 · 2011年12月31日

Human Gait Database for Normal Walk Collected by Smartphone Accelerometer

Arxiv

0+阅读 · 2023年5月16日

A Survey on Automated Program Repair Techniques

Arxiv

0+阅读 · 2023年5月13日

Research Focused Software Development Kits and Wearable Devices in Physical Activity Research

Arxiv

0+阅读 · 2023年5月12日

PreNAS: Preferred One-Shot Learning Towards Efficient Neural Architecture Search

Arxiv

0+阅读 · 2023年5月12日

SigRec: Automatic Recovery of Function Signatures in Smart Contracts

Arxiv

0+阅读 · 2023年5月11日

DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network

Arxiv

11+阅读 · 2023年3月5日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

ReNAS:Relativistic Evaluation of Neural Architecture Search

Arxiv

11+阅读 · 2021年3月10日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

A Survey on Deep Transfer Learning

A Survey on Deep Transfer Learning

Arxiv

11+阅读 · 2018年8月6日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

如何使用TensorFlow 排序构建推荐系统? How to build a recommendation system using TensorFlow Ranking?

专知会员服务

19+阅读 · 2022年3月13日

【2021新书】ApachePulsar 实战，402页pdf

【2021新书】ApachePulsar 实战，402页pdf

专知会员服务

72+阅读 · 2021年12月29日

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

105+阅读 · 2021年10月30日

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

【Manning新书】C++并行实战，592页pdf，C++ Concurrency in Action

专知会员服务

63+阅读 · 2021年1月16日

【2020新书】Kafka权威指南，322页pdf，Apache Kafka: The Definitive Guide

【2020新书】Kafka权威指南，322页pdf，Apache Kafka: The Definitive Guide

专知会员服务

21+阅读 · 2020年10月26日

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

【阿里巴巴达摩院】TResNet: 高性能的GPU专用架构，GPU-Dedicated Architecture

专知会员服务

33+阅读 · 2020年4月1日

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

【DeepMind-ICLR2020】MEMO-情景记忆的灵活组合的深层网络，A DEEP NETWORK FOR FLEXIBLE COMBINATION OF EPISODIC MEMORIES

专知会员服务

18+阅读 · 2020年2月2日

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

专知会员服务

28+阅读 · 2019年11月14日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能绝不能完全自主》

《人工智能的法律与伦理：军事自主机器独特挑战的深度剖析》316页

从数据到主导：AI与兵棋推演构筑决策优势

《特洛伊木马货柜：武器化集装箱的战略威胁》最新报告

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

不再让CPU和总线拖后腿：Exafunction让GPU跑的更快！

机器之心

0+阅读 · 2022年10月7日

Istio 宣布引入全新的无 sidecar 数据平面模式，sidecar 模式依然保留

Istio 宣布引入全新的无 sidecar 数据平面模式，sidecar 模式依然保留

InfoQ

0+阅读 · 2022年9月11日

Presto on Apache Kafka 在 Uber的大规模应用

Presto on Apache Kafka 在 Uber的大规模应用

AI前线

0+阅读 · 2022年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

TensorFlow 2.0新特性之Ragged Tensor

TensorFlow 2.0新特性之Ragged Tensor

深度学习每日摘要

18+阅读 · 2019年4月5日

Github项目推荐 | 推荐系统实例与最佳实践 by 微软

Github项目推荐 | 推荐系统实例与最佳实践 by 微软

AI研习社

20+阅读 · 2019年1月2日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Human Gait Database for Normal Walk Collected by Smartphone Accelerometer

Arxiv

0+阅读 · 2023年5月16日

A Survey on Automated Program Repair Techniques

Arxiv

0+阅读 · 2023年5月13日

Research Focused Software Development Kits and Wearable Devices in Physical Activity Research

Arxiv

0+阅读 · 2023年5月12日

PreNAS: Preferred One-Shot Learning Towards Efficient Neural Architecture Search

Arxiv

0+阅读 · 2023年5月12日

SigRec: Automatic Recovery of Function Signatures in Smart Contracts

Arxiv

0+阅读 · 2023年5月11日

DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network

Arxiv

11+阅读 · 2023年3月5日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

ReNAS:Relativistic Evaluation of Neural Architecture Search

Arxiv

11+阅读 · 2021年3月10日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

A Survey on Deep Transfer Learning

A Survey on Deep Transfer Learning

Arxiv

11+阅读 · 2018年8月6日

相关基金

基于非易失内存设备的数据读写性能优化方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

千核级通用微处理器共享存储体系结构研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

自旋轨道耦合玻色-爱因斯坦凝聚体中无序效应的研究

国家自然科学基金

0+阅读 · 2013年12月31日

桌面网格平台上的BESIII离线物理软件和调度策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向高精度计算领域动态可配置加速器体系结构研究

国家自然科学基金

0+阅读 · 2013年12月31日

番茄抗病膜蛋白TARK1稳定性的调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Steger-Warming FVS 的长管道气液两相瞬变流计算及其水锤的气阀防护研究

国家自然科学基金

0+阅读 · 2012年12月31日

CUDA、OpenMP和MPI混合加速的隐式粒子模拟算法与框架研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于链接权重化的链接预测研究

国家自然科学基金

1+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员