We carry out the first in-depth characterization of residential proxies (RESIPs) in China, for which little is studied in previous works. Our study is made possible through a semantic-based classifier to automatically capture RESIP services. In addition to the classifier, new techniques have also been identified to capture RESIPs without interacting with and relaying traffic through RESIP services, which can significantly lower the cost and thus allow a continuous monitoring of RESIPs. Our RESIP service classifier has achieved a good performance with a recall of 99.7% and a precision of 97.6% in 10-fold cross validation. Applying the classifier has identified 399 RESIP services, a much larger set compared to 38 RESIP services collected in all previous works. Our effort of RESIP capturing lead to a collection of 9,077,278 RESIP IPs (51.36% are located in China), 96.70% of which are not covered in publicly available RESIP datasets. An extensive measurement on RESIPs and their services has uncovered a set of interesting findings as well as several security implications. Especially, 80.05% RESIP IPs located in China have sourced at least one malicious traffic flows during 2021, resulting in 52-million malicious traffic flows in total. And RESIPs have also been observed in corporation networks of 559 sensitive organizations including government agencies, education institutions and enterprises. Also, 3,232,698 China RESIP IPs have opened at least one TCP/UDP ports for accepting relaying requests, which incurs non-negligible security risks to the local network of RESIPs. Besides, 91% China RESIP IPs are of a lifetime less than 10 days while most China RESIP services show up a crest-trough pattern in terms of the daily active RESIPs across time.
翻译:我们首次对中国的住宅代理进行了深入的特征描述,而以前的工作对此研究很少。我们的研究是通过一个基于语义的分类器得以实现的。除了分类器之外,还发现一些新技术可以在不与RESIP服务进行互动和中继通信的情况下捕获RESIP服务,这样可以大大降低成本,从而能够持续监测RESIP。我们的RESIP服务分类器取得了良好的业绩,回收率为99.7%,精确率为97.6%,在10倍的交叉验证中进行。我们的研究通过一个基于语义的分类器确定了399 RESIP服务,比以往所有工作中收集的38 RESIP服务要大得多。除了分类器外,我们为捕获RESIP服务而没有与RESIP服务进行互动和通过RESIP服务进行中继通信,这可以大大降低成本,从而可以持续地监测RESIP的运行情况。 对RESIP服务进行了广泛的测量,在10-IP服务中,在20个中国的互联网网络中发现了一组最不令人感兴趣的发现问题,特别是一些安全影响。