BERT4ETH：一个预训练的Transformer用于以太坊欺诈检测 (BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection) - 专知论文

会员服务 ·

0

BERT · 欺诈检测 · 预训练 · 异构 · Transformer ·

2023 年 3 月 29 日

BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection

翻译：BERT4ETH：一个预训练的Transformer用于以太坊欺诈检测

Sihao Hu,Zhen Zhang,Bingqiao Luo,Shengliang Lu,Bingsheng He,Ling Liu

from arxiv, the Web conference (WWW) 2023

As various forms of fraud proliferate on Ethereum, it is imperative to safeguard against these malicious activities to protect susceptible users from being victimized. While current studies solely rely on graph-based fraud detection approaches, it is argued that they may not be well-suited for dealing with highly repetitive, skew-distributed and heterogeneous Ethereum transactions. To address these challenges, we propose BERT4ETH, a universal pre-trained Transformer encoder that serves as an account representation extractor for detecting various fraud behaviors on Ethereum. BERT4ETH features the superior modeling capability of Transformer to capture the dynamic sequential patterns inherent in Ethereum transactions, and addresses the challenges of pre-training a BERT model for Ethereum with three practical and effective strategies, namely repetitiveness reduction, skew alleviation and heterogeneity modeling. Our empirical evaluation demonstrates that BERT4ETH outperforms state-of-the-art methods with significant enhancements in terms of the phishing account detection and de-anonymization tasks. The code for BERT4ETH is available at: https://github.com/git-disl/BERT4ETH.

翻译：随着以太坊上各种欺诈行为的增多，保护易受攻击的用户免受被利用的危险变得十分重要。虽然当前的研究仅依赖于基于图形的欺诈检测方法，但有人认为它们可能不适合处理高度重复、偏斜分布和异构的以太坊交易。为了应对这些挑战，我们提出了BERT4ETH，这是一个通用的预训练Transformer编码器，用作以太坊各种欺诈行为的帐户表示提取器。BERT4ETH具有Transformer的优越建模能力，可捕捉以太坊交易中固有的动态顺序模式，并通过三种实际有效的策略，即重复性减少、偏斜减轻和异构建模，解决为以太坊预训练BERT模型制定的挑战。我们的经验评估证明BERT4ETH在欺诈行为检测和去匿名化任务方面优于现有的最先进方法。BERT4ETH的代码可在以下位置获得：https://github.com/git-disl/BERT4ETH。

0

相关内容

BERT

BERT全称Bidirectional Encoder Representations from Transformers，是预训练语言表示的方法，可以在大型文本语料库（如维基百科）上训练通用的“语言理解”模型，然后将该模型用于下游NLP任务，比如机器翻译、问答。

Graph Transformer近期进展

Graph Transformer近期进展

专知会员服务

64+阅读 · 2023年1月5日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【清华腾讯-AAAI2020】双向图卷积神经网络谣言检测，Rumor Detection on Social Media with Bi-Directional Graph Convolutional Networks

【清华腾讯-AAAI2020】双向图卷积神经网络谣言检测，Rumor Detection on Social Media with Bi-Directional Graph Convolutional Networks

专知会员服务

70+阅读 · 2020年1月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

促红细胞生成素衍生物对钙黏蛋白突变小鼠CDH23erl/erl的听力保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

剪应力响应的二氢杨梅素Pickering乳液用于治疗动脉粥样硬化的研究

国家自然科学基金

0+阅读 · 2015年12月31日

开发基于可控化学免疫技术（Altermune）的抗流感病毒疗法

国家自然科学基金

0+阅读 · 2014年12月31日

用于间充质干细胞的高分子基因载体的设计制备和抗肿瘤治疗研究

国家自然科学基金

0+阅读 · 2014年12月31日

EBV编码miR-BART22靶向LMP2A改善鼻咽癌干细胞放射抵抗

国家自然科学基金

0+阅读 · 2012年12月31日

生物扰动对河流沉积物中铊的二次释放作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ESD术中用创面修复温敏凝胶的设计及其作用机理

国家自然科学基金

0+阅读 · 2012年12月31日

云提供商可信性审计与验证研究

国家自然科学基金

0+阅读 · 2012年12月31日

代数曲线在序列中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

肝癌细胞膜蛋白cytokeratin-1用于肝癌在体分子显像和靶向治疗的相关研究

国家自然科学基金

0+阅读 · 2011年12月31日

Channel Cycle Time: A New Measure of Short-term Fairness

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

What Symptoms and How Long? An Interpretable AI Approach for Depression Detection in Social Media

Arxiv

1+阅读 · 2023年5月18日

Unlimiformer: Long-Range Transformers with Unlimited Length Input

Arxiv

0+阅读 · 2023年5月18日

CLEME: Debiasing Multi-reference Evaluation for Grammatical Error Correction

Arxiv

0+阅读 · 2023年5月18日

A Survey on Time-Series Pre-Trained Models

Arxiv

7+阅读 · 2023年5月18日

Deep Learning for Time Series Anomaly Detection: A Survey

Arxiv

21+阅读 · 2022年11月9日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

A Comprehensive Survey on Community Detection with Deep Learning

Arxiv

14+阅读 · 2021年5月26日

Data Augmentation using Pre-trained Transformer Models

Arxiv

17+阅读 · 2020年3月4日

VIP会员

文章信息

相关主题

相关VIP内容

Graph Transformer近期进展

Graph Transformer近期进展

专知会员服务

64+阅读 · 2023年1月5日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

【清华腾讯-AAAI2020】双向图卷积神经网络谣言检测，Rumor Detection on Social Media with Bi-Directional Graph Convolutional Networks

【清华腾讯-AAAI2020】双向图卷积神经网络谣言检测，Rumor Detection on Social Media with Bi-Directional Graph Convolutional Networks

专知会员服务

70+阅读 · 2020年1月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

赋能真实世界：基于大语言模型的产业智能体技术、实践与评测综述

军事行动中人工智能系统目标交战的附带损伤评估模型 | 最新文献

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

美陆军协会（AUSA）2025 年会公布的美国十大武器与防务产品创新

相关资讯

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

相关论文

Channel Cycle Time: A New Measure of Short-term Fairness

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

What Symptoms and How Long? An Interpretable AI Approach for Depression Detection in Social Media

Arxiv

1+阅读 · 2023年5月18日

Unlimiformer: Long-Range Transformers with Unlimited Length Input

Arxiv

0+阅读 · 2023年5月18日

CLEME: Debiasing Multi-reference Evaluation for Grammatical Error Correction

Arxiv

0+阅读 · 2023年5月18日

A Survey on Time-Series Pre-Trained Models

Arxiv

7+阅读 · 2023年5月18日

Deep Learning for Time Series Anomaly Detection: A Survey

Arxiv

21+阅读 · 2022年11月9日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

A Comprehensive Survey on Community Detection with Deep Learning

Arxiv

14+阅读 · 2021年5月26日

Data Augmentation using Pre-trained Transformer Models

Arxiv

17+阅读 · 2020年3月4日

相关基金

促红细胞生成素衍生物对钙黏蛋白突变小鼠CDH23erl/erl的听力保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

剪应力响应的二氢杨梅素Pickering乳液用于治疗动脉粥样硬化的研究

国家自然科学基金

0+阅读 · 2015年12月31日

开发基于可控化学免疫技术（Altermune）的抗流感病毒疗法

国家自然科学基金

0+阅读 · 2014年12月31日

用于间充质干细胞的高分子基因载体的设计制备和抗肿瘤治疗研究

国家自然科学基金

0+阅读 · 2014年12月31日

EBV编码miR-BART22靶向LMP2A改善鼻咽癌干细胞放射抵抗

国家自然科学基金

0+阅读 · 2012年12月31日

生物扰动对河流沉积物中铊的二次释放作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ESD术中用创面修复温敏凝胶的设计及其作用机理

国家自然科学基金

0+阅读 · 2012年12月31日

云提供商可信性审计与验证研究

国家自然科学基金

0+阅读 · 2012年12月31日

代数曲线在序列中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

肝癌细胞膜蛋白cytokeratin-1用于肝癌在体分子显像和靶向治疗的相关研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员