A large number of Data Marketplaces (DMs) have appeared in the last few years to help owners monetise their data, and data buyers fuel their marketing process, train their ML models, and perform other data-driven decision processes. In this paper, we present a first of its kind measurement study of the growing DM ecosystem and shed light on several totally unknown facts about it. For example, we show that the median price of live data products sold under a subscription model is around US\$1,400 per month. For one-off purchases of static data, the median price is around US\$2,200. We analyse the prices of different categories of data and show that products about telecommunications, manufacturing, automotive, and gaming command the highest prices. We also develop classifiers for comparing prices across different DMs as well as a regression analysis for revealing features that correlate with data product prices.
翻译:在过去几年里,出现了大量数据市场,以帮助所有者将其数据货币化,数据购买者为其营销过程提供燃料,培训其ML模型,并开展其他数据驱动的决策过程。在本文中,我们首次对DM生态系统的发展进行了实物计量研究,并揭示了这方面的若干完全未知的事实。例如,我们表明,在订阅模式下销售的活数据产品的中位价格约为每月1 400美元左右。对于一次性购买静态数据,中位价格约为2 200美元。我们分析了不同类别数据的价格,并表明有关电信、制造、汽车和赌博产品的价格最高。我们还开发了分类师,以比较不同DMS的价格,并进行了回归分析,以揭示与数据产品价格相关的特征。