Over the years, web content has evolved from simple text and static images hosted on a single server to a complex, interactive and multimedia-rich content hosted on different servers. As a result, a modern website during its loading time fetches content not only from its owner's domain but also from a range of third-party domains providing additional functionalities and services. Here we infer the network of the third-party domains by observing the domains' interactions within users' browsers from all over the globe. We find that this network possesses structural properties commonly found in other complex networks in nature and society, such as power-law degree distribution, strong clustering, and the small-world property. These properties imply that a hyperbolic geometry underlies the ecosystem's topology and we use statistical inference methods to find the domains' coordinates in this geometry, which abstract how popular and similar the domains are. The hyperbolic map we obtain is meaningful, revealing collaborations between controversial services and social networks that have not been previously revealed. Furthermore, the map can facilitate applications, such as the prediction of third-party domains co-hosting on the same physical machine, and merging in terms of company acquisition. Such predictions cannot be made by just observing the domains' interactions within the users' browsers.
翻译:多年来, 网络内容已经从一个服务器上的简单文本和静态图像演变成一个复杂、互动和多媒体内容的服务器。 因此, 一个现代网站在装货时间里不仅从所有者领域获取内容,而且从提供额外功能和服务的一系列第三方领域获取内容。 我们在这里通过观察全球各地用户浏览器内域际互动来推断第三方领域的网络。 我们发现, 这个网络拥有自然和社会上其他复杂网络中常见的结构属性, 如权力法度分布、强大的集群和小世界属性。 这些属性意味着, 生态系统的表层和我们使用统计推论方法来查找地理坐标, 这个地理测量方法抽象地说明了域有多受欢迎和类似的领域。 我们获得的双向地图是有意义的, 揭示了以前未曾披露过的有争议的服务和社会网络之间的协作。 此外, 地图可以促进应用, 比如预测第三方域域内在同一物理机器上的共同托管, 以及将公司用户合并到服务器中。 这样的预测不能通过服务器来观测。