The "rare type match problem" is the situation in which the suspect's DNA profile, matching the DNA profile of the crime stain, is not in the database of reference. The evaluation of this match in the light of the two competing hypotheses (the crime stain has been left by the suspect or by another person) is based on the calculation of the likelihood ratio and depends on the population proportions of the DNA profiles, that are unknown. We propose a Bayesian nonparametric method that uses a two-parameter Poisson Dirichlet distribution as a prior over the ranked population proportions, and discards the information about the names of the different DNA profiles. This fits very well the data coming from European Y-STR DNA profiles, and the calculation of the likelihood ratio becomes quite simple thanks to a justified Empirical Bayes approach.
翻译:“极端类型匹配问题”是指嫌疑人的DNA特征与犯罪污点的DNA特征相匹配,不在参考数据库中的情况。根据两种相互竞争的假设(犯罪嫌疑人或他人留下的犯罪污点)对这一匹配的评估基于可能性比率的计算,并取决于DNA特征的人口比例,而这种比例并不为人所知。 我们建议采用巴耶斯非参数性非参数性方法,先使用Poisson Drichlet两参数分布,然后丢弃关于不同DNA特征名称的信息。 这非常符合欧洲Y-STRDNA特征的数据,而可能性比率的计算也非常简单,因为采用了合理的“经验贝耶斯”方法。