Third-party libraries (TPLs) have become a significant part of the Android ecosystem. Developers can employ various TPLs to facilitate their app development. Unfortunately, the popularity of TPLs also brings new security issues. For example, TPLs may carry malicious or vulnerable code, which can infect popular apps to pose threats to mobile users. Furthermore, TPL detection is essential for downstream tasks, such as vulnerabilities and malware detection. Thus, various tools have been developed to identify TPLs. However, no existing work has studied these TPL detection tools in detail, and different tools focus on different applications and techniques with performance differences. A comprehensive understanding of these tools will help us make better use of them. To this end, we conduct a comprehensive empirical study to fill the gap by evaluating and comparing all publicly available TPL detection tools based on six criteria: accuracy of TPL construction, effectiveness, efficiency, accuracy of version identification, resiliency to code obfuscation, and ease of use. Besides, we enhance these open-source tools by fixing their limitations, to improve their detection ability. Finally, we build an extensible framework that integrates all existing available TPL detection tools, providing an online service for the research community. We release the evaluation dataset and enhanced tools. According to our study, we also present the essential findings and discuss promising implications to the community. We believe our work provides a clear picture of existing TPL detection techniques and also gives a roadmap for future research.
翻译:第三方图书馆(TPL)已成为Android生态系统的一个重要部分。 开发者可以使用各种TPL来推动其应用程序开发。 不幸的是,TPL的普及也带来了新的安全问题。例如,TPL可能携带恶意或脆弱的代码,从而影响公众应用,从而对移动用户构成威胁。此外,TPL的检测对于下游任务至关重要,例如脆弱性和恶意软件检测等。因此,开发了各种工具来识别TPL。然而,没有一项现有工作详细研究这些TPL的检测工具,以及侧重于不同应用和技术的不同工具,其性能有差异。对这些工具的全面理解将有助于我们更好地利用这些工具。为此,我们开展了一项全面的实证研究,通过评估和比较所有公开提供的TPL检测工具对移动用户构成威胁。此外,我们还根据以下六项标准,评估TPL的准确性、有效性、效率、版本识别的准确性能、对代码模糊性能的弹性,以及使用方便性。此外,我们通过确定这些公开源工具的局限性,加强这些工具的检测能力。最后,我们对这些工具的全面理解将有助于我们更好地利用这些工具进行深入的研究,我们为现有的探测和发行提供在线评估提供我们现有工具。