A major limitation of fuzzy or neuro-fuzzy systems is their failure to deal with high-dimensional datasets. This happens primarily due to the use of T-norm, particularly, product or minimum (or a softer version of it). Thus, there are hardly any work dealing with datasets with dimensions more than hundred or so. Here, we propose a neuro-fuzzy framework that can handle datasets with dimensions even more than 7000! In this context, we propose an adaptive softmin (Ada-softmin) which effectively overcomes the drawbacks of ``numeric underflow" and ``fake minimum" that arise for existing fuzzy systems while dealing with high-dimensional problems. We call it an Adaptive Takagi-Sugeno-Kang (AdaTSK) fuzzy system. We then equip the AdaTSK system to perform feature selection and rule extraction in an integrated manner. In this context, a novel gate function is introduced and embedded only in the consequent parts, which can determine the useful features and rules, in two successive phases of learning. Unlike conventional fuzzy rule bases, we design an enhanced fuzzy rule base (En-FRB), which maintains adequate rules but does not grow the number of rules exponentially with dimension that typically happens for fuzzy neural networks. The integrated Feature Selection and Rule Extraction AdaTSK (FSRE-AdaTSK) system consists of three sequential phases: (i) feature selection, (ii) rule extraction, and (iii) fine tuning. The effectiveness of the FSRE-AdaTSK is demonstrated on 19 datasets of which five are in more than 2000 dimension including two with dimension greater than 7000. This may be the first time fuzzy systems are realized for classification involving more than 7000 input features.
翻译:模糊或神经模糊系统的主要限制是它们无法处理高维数据集。 这主要是因为使用T- 诺姆, 特别是产品或最小( 或更软的版本) 。 因此, 几乎没有任何处理尺寸超过100个或以上的数据集的工作。 在此, 我们提议一个神经模糊框架, 可以处理尺寸甚至超过7000个的数据集。 在这方面, 我们提议一个适应性软质( 亚达- 软质), 它有效克服了“ 数字下流” 和“ fake 最低” 的缺陷。 这主要归功于 T- 诺姆( 特别是产品或最低( 或更软的版本 ) 。 因此, 我们称之为适应性 Taki- Sugenno- Kang ( AdatSK) 的系统。 然后我们让 AdaTSK 系统能够以综合方式进行特性选择和规则提取。 在这方面, 引入并嵌入一个新型的门功能仅包含后继部分, 可以在连续两个学习阶段确定有用的特性和规则 。 与常规的FS- 规则基础不同, 我们设计一个比常规规则更高级的系统更精度 。