This article presents GuideR, a user-guided rule induction algorithm, which overcomes the largest limitation of the existing methods-the lack of the possibility to introduce user's preferences or domain knowledge to the rule learning process. Automatic selection of attributes and attribute ranges often leads to the situation in which resulting rules do not contain interesting information. We propose an induction algorithm which takes into account user's requirements. Our method uses the sequential covering approach and is suitable for classification, regression, and survival analysis problems. The effectiveness of the algorithm in all these tasks has been verified experimentally, confirming guided rule induction to be a powerful data analysis tool.
翻译:本篇文章介绍指南R,这是用户指导的上岗演算法,它克服了现有方法的最大局限性,即没有可能将用户的偏好或域知识引入规则学习过程;自动选择属性和属性范围往往导致产生规则不包含有趣信息的情况;我们建议采用考虑到用户要求的上岗演算法;我们的方法采用顺序覆盖法,适合分类、回归和生存分析问题;所有这些任务中的算法的有效性都经过了试验性核查,确认有指导的上岗演算法是一个强有力的数据分析工具。