The effects of the so-called "refugee crisis" of 2015-16 continue to dominate the political agenda in Europe. Migration flows were sudden and unexpected, leaving governments unprepared and exposing significant shortcomings in the field of migration forecasting. Migration is a complex system typified by episodic variation, underpinned by causal factors that are interacting, highly context dependent and short-lived. Correspondingly, migration monitoring relies on scattered data, while approaches to forecasting focus on specific migration flows and often have inconsistent results that are difficult to generalise at the regional or global levels. Here we show that adaptive machine learning algorithms that integrate official statistics and non-traditional data sources at scale can effectively forecast asylum-related migration flows. We focus on asylum applications lodged in countries of the European Union (EU) by nationals of all countries of origin worldwide; the same approach can be applied in any context provided adequate migration or asylum data are available. We exploit three tiers of data - geolocated events and internet searches in countries of origin, detections of irregular crossings at the EU border, and asylum recognition rates in countries of destination - to effectively forecast individual asylum-migration flows up to four weeks ahead with high accuracy. Uniquely, our approach a) monitors potential drivers of migration in countries of origin to detect changes early onset; b) models individual country-to-country migration flows separately and on moving time windows; c) estimates the effects of individual drivers, including lagged effects; d) provides forecasts of asylum applications up to four weeks ahead; e) assesses how patterns of drivers shift over time to describe the functioning and change of migration systems.
翻译:移徙是一个复杂的系统,其特点是:由相互作用、高度背景依赖和寿命短的因果因素支撑,移徙是一个以偶发差异为特征的复杂系统;与此相关的是,移徙监测依赖于分散的数据,而预测特定移徙流动重点的方法和结果往往不一致,难以在区域或全球层面一概而论。 我们在这里表明,将官方统计数据和非传统数据来源纳入规模的适应性机器学习算法能够有效地预测与庇护有关的移徙流动情况。我们侧重于由全世界所有原籍国国民在欧洲联盟(欧盟)国家提出的庇护申请,其特点是出现偶发差异,其基础是相互作用、高度环境依赖和寿命短短。我们利用了三个层次的数据――在原籍国的地理定位事件和互联网搜索,在欧盟边境的不正常过境系统探测,以及目的地国的庇护识别率――有效地预测个人庇护移徙流动到提前四周的动态,包括提前四个星期向原籍国的移徙趋势;单独地监测各国移徙趋势的预测,其潜在趋势是:从头至头四个星期的移徙趋势;从头四个时期到头四个时期的移徙趋势。