Data insufficiency problem (i.e., data missing and label scarcity issues) caused by inadequate services and infrastructures or unbalanced development levels of cities has seriously affected the urban computing tasks in real scenarios. Prior transfer learning methods inspire an elegant solution to the data insufficiency, but are only concerned with one kind of insufficiency issue and fail to give consideration to both sides. In addition, most previous cross-city transfer methods overlooks the inter-city data privacy which is a public concern in practical application. To address above challenging problems, we propose a novel Cross-city Federated Transfer Learning framework (CcFTL) to cope with the data insufficiency and privacy problems. Concretely, CcFTL transfers the relational knowledge from multiple rich-data source cities to the target city. Besides, the model parameters specific to the target task are firstly trained on the source data and then fine-tuned to the target city by parameter transfer. With our adaptation of federated training and homomorphic encryption settings, CcFTL can effectively deal with the data privacy problem among cities. We take the urban region profiling as an application of smart cities and evaluate the proposed method with a real-world study. The experiments demonstrate the notable superiority of our framework over several competitive state-of-the-art models.
翻译:由城市服务和基础设施不足或发展水平不平衡造成的数据不足问题(即数据缺失和标签短缺问题),因城市服务和基础设施不足或发展程度不平衡造成的数据不足问题(即数据缺失和标签短缺问题),严重影响了城市计算任务; 先前的转移学习方法在现实情景中严重影响了城市计算工作; 先前的转移学习方法为数据不足提供了优雅的解决方案,但只关注一种数据不足问题,而没有考虑到双方; 此外,大多数以往的跨城市转移方法忽略了在实际应用中公众关切的城市间数据隐私问题; 为了解决上述具有挑战性的问题,我们提议建立一个新的跨城市联邦转移学习框架(CCFTL),以应对数据不足和隐私问题; 具体地说,CCFTL将多个丰富数据来源城市的关联知识转移给目标城市; 此外,目标任务的具体模型参数首先经过源数据培训,然后通过参数转移对目标城市进行微调; 由于我们适应了联邦化培训和同质加密环境,CFTL可以有效地处理城市间数据隐私问题。 我们把城市概况分析作为智能城市的应用,并评估拟议的方法与现实世界竞争模型。