Bitcoin (BTC) is probably the most transparent payment network in the world, thanks to the full history of transactions available to the public. Though, Bitcoin is not a fully anonymous environment, rather a pseudonymous one, accounting for a number of attempts to beat its pseudonimity using clustering techniques. There is, however, a recurring assumption in all the cited deanonymization techniques: that each transaction output has an address attached to it. That assumption is false. An evidence is that, as of block height 591,872, there are several millions transactions with at least one output for which the Bitcoin Core client cannot infer an address. In this paper, we present a novel approach based on sound graph theory for identifying transaction inputs and outputs. Our solution implements two simple yet innovative features: it does not rely on BTC addresses and explores all the transactions stored in the blockchain. All the other existing solutions fail with respect to one or both of the cited features. In detail, we first introduce the concept of Unknown Transaction and provide a new framework to parse the Bitcoin blockchain by taking them into account. Then, we introduce a theoretical model to detect, study, and classify -- for the first time in the literature -- unknown transaction patterns in the user network. Further, in an extensive experimental campaign, we apply our model to the Bitcoin network to uncover hidden transaction patterns within the Bitcoin user network. Results are striking: we discovered more than 30,000 unknown transaction DAGs, with a few of them exhibiting a complex yet ordered topology and potentially connected to automated payment services. To the best of our knowledge, the proposed framework is the only one that enables a complete study of the unknown transaction patterns, hence enabling further research in the fields -- for which we provide some directions.
翻译:Bitcoin (BTC) 可能是世界上最透明的支付网络。 由于向公众开放的交易历史悠久, Bitcoin (BTC) 可能是世界上最透明的支付网络。 虽然Bitcoin 并不是完全匿名的环境, 却是一个假名化的环境, 说明一些试图利用集群技术击败其假名的尝试。 然而, 在所有引用的匿名技术中, 都有一个反复的假设: 每个交易输出都附着一个地址。 这个假设是错误的。 证据是, 在块高度591, 872上, 至少有一百万个至少连接了比特币核心客户无法推断地址的复杂产出的交易。 在本文中, 我们展示了一种基于健全的图表理论理论的办法来识别交易投入和产出。 我们的解决方案采用了两种简单而创新的模式: 它不依赖于 BTC 地址, 并探索了所有存储在铁链中的所有交易。 所有其他现有的解决方案都无法与所引用的某个或两个特征相匹配。 详细地说, 我们首先引入了未知的交易交易的理念概念概念, 并且提供了一个新的框架, 通过将比特币链链链链链接的链路输入了它们。 然后, 我们将一个未知的模型中, 将一个未知的模型用于一个未知的网络 。