Data insufficiency problems (i.e., data missing and label scarcity) caused by inadequate services and infrastructures or imbalanced development levels of cities have seriously affected the urban computing tasks in real scenarios. Prior transfer learning methods inspire an elegant solution to the data insufficiency, but are only concerned with one kind of insufficiency issue and fail to give consideration to both sides. In addition, most previous cross-city transfer methods overlook inter-city data privacy which is a public concern in practical applications. To address the above challenging problems, we propose a novel Cross-city Federated Transfer Learning framework (CcFTL) to cope with the data insufficiency and privacy problems. Concretely, CcFTL transfers the relational knowledge from multiple rich-data source cities to the target city. Besides, the model parameters specific to the target task are firstly trained on the source data and then fine-tuned to the target city by parameter transfer. With our adaptation of federated training and homomorphic encryption settings, CcFTL can effectively deal with the data privacy problem among cities. We take the urban region profiling as an application of smart cities and evaluate the proposed method with a real-world study. The experiments demonstrate the notable superiority of our framework over several competitive state-of-the-art methods.
翻译:由于服务和基础设施不足或城市发展水平不平衡,造成数据不足的问题(即数据缺失和标签缺乏),造成数据不足的问题(即数据缺失和标签缺乏),城市基础设施不足或发展水平不平衡,这些都严重影响了城市计算任务; 先前的转移学习方法促使对数据不足问题有一个优雅的解决办法,但只关心一种不足问题,而没有考虑到双方; 此外,大多数以往的跨城市转移方法忽略了在实际应用中公众关注的城市间数据隐私问题; 为解决上述具有挑战性的问题,我们提议建立一个新的跨城市联邦转移学习框架(CCFTL),以应对数据不足和隐私问题; 具体地说,CcFTL将多个丰富数据来源城市的关联知识转移给目标城市; 此外,目标任务的具体示范参数首先经过源数据培训,然后通过参数转移微调适应目标城市; 由于我们适应了联邦培训和同质加密环境,CcFTL可以有效地处理城市间的数据隐私问题。 我们把城市地区概况分析作为智能城市的一种应用,并用现实世界的竞争性框架评估拟议方法。