We propose a direct domain adaptation (DDA) approach to enrich the training of supervised neural networks on synthetic data by features from real-world data. The process involves a series of linear operations on the input features to the NN model, whether they are from the source or target domains, as follows: 1) A cross-correlation of the input data (i.e. images) with a randomly picked sample pixel (or pixels) of all images from that domain or the mean of all randomly picked sample pixel (or pixels) of all images. 2) The convolution of the resulting data with the mean of the autocorrelated input images from the other domain. In the training stage, as expected, the input images are from the source domain, and the mean of auto-correlated images are evaluated from the target domain. In the inference/application stage, the input images are from the target domain, and the mean of auto-correlated images are evaluated from the source domain. The proposed method only manipulates the data from the source and target domains and does not explicitly interfere with the training workflow and network architecture. An application that includes training a convolutional neural network on the MNIST dataset and testing the network on the MNIST-M dataset achieves a 70% accuracy on the test data. A principal component analysis (PCA), as well as t-SNE, show that the input features from the source and target domains, after the proposed direct transformations, share similar properties along with the principal components as compared to the original MNIST and MNIST-M input features.
翻译:我们建议一种直接域适应(DDAD)方法,通过真实世界数据的特点,丰富对受监管神经网络的合成数据培训。这一过程涉及对NN模型输入特性的一系列线性操作,无论输入特性来自源域还是目标域,具体如下:1)输入数据(即图像)的交叉对比(即图像),该输入数据(或图像)的随机采样像素(或像素)或所有图像的随机采样像素(或像素)的平均值。2 由此产生的数据与来自其他域的与自动相关输入图像的平均值的结合。在培训阶段,输入图像来自源域或目标域,如预期,输入图像来自源域或目标域,输入图像来自源域,输入图像图像来自源域,且不明显干扰培训目标域域域域域,在目标域域域域域域中,自动镜像图像来自目标域/应用,在测试数据元件网络后,将数据测试数据系统,将数据系统化为测试数据元件。