A Web API (WAPI) is a type of API whose interaction with its consumers is done through the Internet. While being accessed through the Internet can be challenging, mostly when WAPIs evolve, it gives providers the possibility to monitor their usage, and understand and analyze consumers' behavior. Currently, WAPI usage is mostly logged for traffic monitoring and troubleshooting. Even though they contain invaluable information \rev{regarding consumers' behavior}, they are not sufficiently used by providers. In this paper, \rev{we first consider two phases of the application development lifecycle, and based on them we distinguish two different types of usage logs, namely development logs and production logs. For each of them we show the potential analyses (e.g., WAPI usability evaluation, consumers' needs identification)} that can be performed, as well as the main impediments, that may be caused by the unsuitable log format. We then conduct a case study using logs of the same WAPI from different deployments and different formats, to demonstrate the occurrence of these impediments and at the same time the importance of a proper log format. Next, based on the case study results, we present the main quality issues of WAPI log data and explain their impact on data analyses. For each of them, we give some practical suggestions on how to deal with them, as well as mitigating their root cause.
翻译:网络 API (WAPI) 是一种通过互联网与消费者互动的 API (WAPI) 。 虽然通过互联网访问可能具有挑战性, 主要是在WAPI 演变时, 它让供应商有机会监测其使用情况, 并理解和分析消费者的行为。 目前, WAPI 的使用大多记录用于交通监测和排除麻烦。 尽管它包含宝贵的信息\ rev{rev{reme the Control actual }, 但它们并没有被提供者充分使用。 在本文中, 我们首先考虑应用开发生命周期的两个阶段, 并在此基础上我们区分两种不同的使用记录, 即开发日志和生产日志。 对于其中每一种, 我们展示了这些潜在分析(例如WAPI 使用率评估、消费者需求识别) 。 目前, 以及主要数据协议格式可能带来的主要障碍。 我们用不同部署和不同格式的相同的WAPIPI 进行案例研究, 来展示这些障碍的发生过程, 以及同时我们区分了两种不同的使用日志记录格式的重要性, 。 对于其中每一种记录格式, 我们展示了它们的主要分析结果。