抽样和规则设置大小对产生模糊推断系统预测准确性的影响:软件工程数据集分析 (The Impact of Sampling and Rule Set Size on Generated Fuzzy Inference System Predictive Accuracy: Analysis of a Software Engineering Data Set) - 专知论文

会员服务 ·

0

情景 · Engineering · 模型评估 · 推断 · 样本 ·

2021 年 2 月 5 日

The Impact of Sampling and Rule Set Size on Generated Fuzzy Inference System Predictive Accuracy: Analysis of a Software Engineering Data Set

翻译：抽样和规则设置大小对产生模糊推断系统预测准确性的影响:软件工程数据集分析

Stephen G. MacDonell

from arxiv, Conference paper, 7 pages, 5 tables, 7 figures

Software project management makes extensive use of predictive modeling to estimate product size, defect proneness and development effort. Although uncertainty is acknowledged in these tasks, fuzzy inference systems, designed to cope well with uncertainty, have received only limited attention in the software engineering domain. In this study we empirically investigate the impact of two choices on the predictive accuracy of generated fuzzy inference systems when applied to a software engineering data set: sampling of observations for training and testing; and the size of the rule set generated using fuzzy c-means clustering. Over ten samples we found no consistent pattern of predictive performance given certain rule set size. We did find, however, that a rule set compiled from multiple samples generally resulted in more accurate predictions than single sample rule sets. More generally, the results provide further evidence of the sensitivity of empirical analysis outcomes to specific model-building decisions.

翻译：软件项目管理广泛使用预测模型来估计产品规模、易变率和开发努力。虽然在这些任务中承认了不确定性,但为应付不确定性而设计的模糊推断系统在软件工程领域只得到有限的注意。在本研究中,我们实证地调查了两种选择在应用到软件工程数据集时对产生的模糊推断系统的预测准确性的影响:用于培训和测试的观测抽样;以及使用模糊的 c-poles 群集生成的规则集的规模。超过10个样本我们发现,由于某些规则设定的大小,预测性能没有一致的模式。然而,我们发现,从多个样本中收集的规则集通常比单一样本规则集产生更准确的预测。更一般而言,结果进一步证明了经验分析结果对具体的模型建设决定的敏感性。

0

相关内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

Python数据分析:过去、现在和未来，52页ppt

Python数据分析:过去、现在和未来，52页ppt

专知会员服务

103+阅读 · 2020年3月9日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

专知会员服务

17+阅读 · 2019年12月9日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

计算机经典算法回顾与展望——机器学习与数据挖掘

计算机经典算法回顾与展望——机器学习与数据挖掘

中国计算机学会

5+阅读 · 2019年10月11日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

已删除

将门创投

4+阅读 · 2017年12月5日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Targeted Branching for the Maximum Independent Set Problem

Arxiv

0+阅读 · 2021年3月29日

Development and Validation of a Deep Learning Model for Prediction of Severe Outcomes in Suspected COVID-19 Infection

Arxiv

0+阅读 · 2021年3月29日

Inference of Random Effects for Linear Mixed-Effects Models with a Fixed Number of Clusters

Arxiv

0+阅读 · 2021年3月28日

Covariate-Adjusted Inference for Differential Analysis of High-Dimensional Networks

Arxiv

0+阅读 · 2021年3月27日

SQAPlanner: Generating Data-Informed Software Quality Improvement Plans

Arxiv

0+阅读 · 2021年3月27日

Predictive and explanatory models might miss informative features in educational data

Arxiv

0+阅读 · 2021年3月26日

Investigating spatial scan statistics for multivariate functional data

Arxiv

0+阅读 · 2021年3月26日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Analysis of Multivariate Scoring Functions for Automatic Unbiased Learning to Rank

Arxiv

6+阅读 · 2020年8月20日

Reverse Engineering Configurations of Neural Text Generation Models

Arxiv

5+阅读 · 2020年4月13日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

Python数据分析:过去、现在和未来，52页ppt

Python数据分析:过去、现在和未来，52页ppt

专知会员服务

103+阅读 · 2020年3月9日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

专知会员服务

17+阅读 · 2019年12月9日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

从无人机到数据：揭示边缘计算作为新作战域

可解释人工智能的基础

大规模视觉模型中的基于提示的适应：综述

相关资讯

计算机经典算法回顾与展望——机器学习与数据挖掘

计算机经典算法回顾与展望——机器学习与数据挖掘

中国计算机学会

5+阅读 · 2019年10月11日

【TED】生命中的每一年的智慧

【TED】生命中的每一年的智慧

英语演讲视频每日一推

10+阅读 · 2019年1月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

已删除

将门创投

4+阅读 · 2017年12月5日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Targeted Branching for the Maximum Independent Set Problem

Arxiv

0+阅读 · 2021年3月29日

Development and Validation of a Deep Learning Model for Prediction of Severe Outcomes in Suspected COVID-19 Infection

Arxiv

0+阅读 · 2021年3月29日

Inference of Random Effects for Linear Mixed-Effects Models with a Fixed Number of Clusters

Arxiv

0+阅读 · 2021年3月28日

Covariate-Adjusted Inference for Differential Analysis of High-Dimensional Networks

Arxiv

0+阅读 · 2021年3月27日

SQAPlanner: Generating Data-Informed Software Quality Improvement Plans

Arxiv

0+阅读 · 2021年3月27日

Predictive and explanatory models might miss informative features in educational data

Arxiv

0+阅读 · 2021年3月26日

Investigating spatial scan statistics for multivariate functional data

Arxiv

0+阅读 · 2021年3月26日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

Analysis of Multivariate Scoring Functions for Automatic Unbiased Learning to Rank

Arxiv

6+阅读 · 2020年8月20日

Reverse Engineering Configurations of Neural Text Generation Models

Arxiv

5+阅读 · 2020年4月13日

微信扫码咨询专知VIP会员