The accuracy of a classifier, when performing Pattern recognition, is mostly tied to the quality and representativeness of the input feature vector. Feature Selection is a process that allows for representing information properly and may increase the accuracy of a classifier. This process is responsible for finding the best possible features, thus allowing us to identify to which class a pattern belongs. Feature selection methods can be categorized as Filters, Wrappers, and Embed. This paper presents a survey on some Filters and Wrapper methods for handcrafted feature selection. Some discussions, with regard to the data structure, processing time, and ability to well represent a feature vector, are also provided in order to explicitly show how appropriate some methods are in order to perform feature selection. Therefore, the presented feature selection methods can be accurate and efficient if applied considering their positives and negatives, finding which one fits best the problem's domain may be the hardest task.
翻译:在进行模式识别时,分类器的准确性主要与输入特性矢量的质量和代表性有关。特性选择是一个允许正确代表信息的过程,可能提高分类器的准确性。这一过程负责寻找最佳可能的特性,从而使我们能够确定模式属于哪一类。特征选择方法可以归类为过滤器、包装器和嵌入器。本文对手工艺特性选择的某些过滤器和包装器方法进行了调查。还就数据结构、处理时间和正确代表特性矢量的能力进行了一些讨论,以明确表明某些方法对于进行特征选择是何等适当。因此,如果考虑到其正反两种,所介绍的特征选择方法可以准确而有效,发现问题领域最适合哪个领域是最困难的任务。