We are interested in in silico evaluation methodology for molecular optimization methods. Given a sample of molecules and their properties of our interest, we wish not only to train an agent that can find molecules optimized with respect to the target property but also to evaluate its performance. A common practice is to train a predictor of the target property on the sample and use it for both training and evaluating the agent. We show that this evaluator potentially suffers from two biases; one is due to misspecification of the predictor and the other to reusing the same sample for training and evaluation. We discuss bias reduction methods for each of the biases comprehensively, and empirically investigate their effectiveness.
翻译:我们感兴趣的是分子优化方法的硅评价方法。鉴于分子的样本及其我们感兴趣的特性,我们不仅希望培训一个能够找到目标属性方面最佳分子的代理物,而且还要评价其性能。一种常见的做法是对样本目标属性的预测者进行培训,并将其用于培训和评估该代理物。我们表明,该评估员可能存在两种偏差:一种是由于预测器的特性不当,另一种是因为在培训和评估中重复同样的样本。我们讨论每个偏差的减少偏差方法,并从经验上调查其有效性。