Positive and unlabelled learning is an important problem which arises naturally in many applications. The significant limitation of almost all existing methods lies in assuming that the propensity score function is constant (SCAR assumption), which is unrealistic in many practical situations. Avoiding this assumption, we consider parametric approach to the problem of joint estimation of posterior probability and propensity score functions. We show that under mild assumptions when both functions have the same parametric form (e.g. logistic with different parameters) the corresponding parameters are identifiable. Motivated by this, we propose two approaches to their estimation: joint maximum likelihood method and the second approach based on alternating maximization of two Fisher consistent expressions. Our experimental results show that the proposed methods are comparable or better than the existing methods based on Expectation-Maximisation scheme.
翻译:在许多应用中,积极和无标签学习是自然产生的一个重要问题。几乎所有现有方法的显著局限性在于假设倾向性评分函数是不变的(SCAR假设),在许多实际情况下是不切实际的。为了避免这一假设,我们考虑对联合估计后生概率和倾向性评分函数的问题采取参数法。我们表明,在两种功能具有相同的参数形式(例如,具有不同参数的后勤能力)的轻度假设下,相应的参数是可识别的。为此,我们提出了两种估算方法:共同最大可能性方法和第二种方法,其基础是交替地实现两种渔业一致性表达方式的最大化。我们的实验结果表明,拟议方法与基于预期-最大化计划的现有方法相似或更好。