Financial fraud cases are on the rise even with the current technological advancements. Due to the lack of inter-organization synergy and because of privacy concerns, authentic financial transaction data is rarely available. On the other hand, data-driven technologies like machine learning need authentic data to perform precisely in real-world systems. This study proposes a blockchain and smart contract-based approach to achieve robust Machine Learning (ML) algorithm for e-commerce fraud detection by facilitating inter-organizational collaboration. The proposed method uses blockchain to secure the privacy of the data. Smart contract deployed inside the network fully automates the system. An ML model is incrementally upgraded from collaborative data provided by the organizations connected to the blockchain. To incentivize the organizations, we have introduced an incentive mechanism that is adaptive to the difficulty level in updating a model. The organizations receive incentives based on the difficulty faced in updating the ML model. A mining criterion has been proposed to mine the block efficiently. And finally, the blockchain network istested under different difficulty levels and under different volumes of data to test its efficiency. The model achieved 98.93% testing accuracy and 98.22% Fbeta score (recall-biased f measure) over eight incremental updates. Our experiment shows that both data volume and difficulty level of blockchain impacts the mining time. For difficulty level less than five, mining time and difficulty level has a positive correlation. For difficulty level two and three, less than a second is required to mine a block in our system. Difficulty level five poses much more difficulties to mine the blocks.
翻译:金融欺诈案件正在上升,即使随着当前的技术进步,目前金融欺诈案件也在增加。由于缺乏组织间协同作用,也由于隐私问题,真正的金融交易数据很少。另一方面,由数据驱动的技术,如机器学习需要真实的数据,才能在现实世界系统中准确运行。本研究报告提出采用一个链链和智能合同型方法,通过促进组织间合作,实现强有力的电子商务欺诈检测机械学习算法。拟议方法使用链块来保障数据的隐私。在网络内部署的智能合同完全自动化了系统。ML模型从与链链相连的组织提供的合作数据中逐步升级。为了激励各组织,我们引入了一种奖励机制,在更新模型时需要适应困难程度。各组织根据在更新ML模型时面临的困难而获得奖励。已经提议了一个采矿标准,通过促进组织间合作,高效率地进行采矿。在不同的困难程度和不同数量的数据中测试了块链网络,在98.93%中测试准确度和98.22%的Fetta模型,为了激励各组织,在更新模型时差的难度程度上,比我们连续5级的难度要低。在8级和递增的采矿难度水平上,一个时间级数据水平比我们之间的难度要低。