We formulate the problem of induced domain adaptation (IDA) when the underlying distribution/domain shift is introduced by the model being deployed. Our formulation is motivated by applications where the deployed machine learning models interact with human agents, and will ultimately face responsive and interactive data distributions. We formalize the discussions of the transferability of learning in our IDA setting by studying how the model trained on the available source distribution (data) would translate to the performance on the induced domain. We provide both upper bounds for the performance gap due to the induced domain shift, as well as lower bound for the trade-offs a classifier has to suffer on either the source training distribution or the induced target distribution. We provide further instantiated analysis for two popular domain adaptation settings with covariate shift and label shift. We highlight some key properties of IDA, as well as computational and learning challenges.
翻译:当正在使用的模型引入基本分布/域变换时,我们就提出引引域适应(IDA)问题; 我们的配方是由已部署的机器学习模型与人体代理相互作用的应用驱动的,最终将面临反应迅速和互动的数据分配; 我们正式讨论在开发协会环境中学习的可转移性,方法是研究如何将关于现有源分配(数据)的训练模型转化为引域的性能表现; 我们为因引域变换造成的性能差距提供上限,以及分类者在源培训分配或诱导目标分配方面必须承受的取舍限制较低。 我们为两种流行的域适应环境提供进一步的即时分析,同时进行变量变换和标签变换。 我们强调开发协会的一些关键特性,以及计算和学习方面的挑战。