PrInDT(RePrInDT)中重复的低抽样抽样:低抽样和预测中的变异,以及组合中的预测人的排位 (Repeated undersampling in PrInDT (RePrInDT): Variation in undersampling and prediction, and ranking of predictors in ensembles) - 专知论文

会员服务 ·

0

欠采样 · 预测器/决策函数 · 秩 · 阈值 · 类别 ·

2021 年 8 月 11 日

Repeated undersampling in PrInDT (RePrInDT): Variation in undersampling and prediction, and ranking of predictors in ensembles

翻译：PrInDT(RePrInDT)中重复的低抽样抽样:低抽样和预测中的变异,以及组合中的预测人的排位

Claus Weihs,Sarah Buschfeld

In this paper, we extend our PrInDT method (Weihs & Buschfeld 2021a) towards undersampling with different percentages of the smaller and the larger classes (psmall and plarge), stratification of predictors, varying the prediction threshold, and measuring variable importance in ensembles. An application of these methods to a linguistic example suggests the following: 1. In undersampling, a careful selection of the percentages plarge and psmall is important for building models with high balanced accuracies; 2. Stratification of predictors does not majorly enhance balanced accuracies; 3. Lowering the prediction threshold for the smaller class turns out to be an alternative method to undersampling because it increases the likelihood of the smaller class being selected. Finally, we introduce a method for ranking predictor importance that allows for a straightforward interpretation of the results.

翻译：在本文中,我们把普里特特特方法(Weihs & Buschfeld 2021a)推广到对较小和较大类别(小类和大类)不同百分比、预测数的分层、预测阈值不同和在组合中的可变重要性的衡量,低抽样。将这些方法应用于语言实例表明如下: 1. 在低抽样中,仔细选择大类和小类的百分比对于建立高度平衡的模型很重要; 2. 预测数的分层不能大大加强平衡的准确性; 3. 降低小类的预测阈值是减少低抽样的替代方法,因为它增加了被选中的较小类别的可能性。最后,我们引入了排序预测重要性的方法,以便能够对结果进行直截了当的解释。

0

相关内容

欠采样

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

30+阅读 · 2021年6月12日

《人工智能计算中心白皮书》，43页pdf

《人工智能计算中心白皮书》，43页pdf

专知会员服务

147+阅读 · 2021年3月5日

【UBC】高级机器学习课程，Advanced Machine Learning

【UBC】高级机器学习课程，Advanced Machine Learning

专知会员服务

23+阅读 · 2021年1月26日

【德勤】数字化健康白皮书

专知会员服务

46+阅读 · 2020年12月4日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

52+阅读 · 2020年9月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

专知会员服务

51+阅读 · 2020年1月20日

【KDD2019|讲座推荐】深层贝叶斯挖掘、学习与理解：Deep Bayesian Mining, Learning and Understanding

【KDD2019|讲座推荐】深层贝叶斯挖掘、学习与理解：Deep Bayesian Mining, Learning and Understanding

专知会员服务

63+阅读 · 2019年12月14日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Assessment of Neural Networks for Stream-Water-Temperature Prediction

Assessment of Neural Networks for Stream-Water-Temperature Prediction

Arxiv

0+阅读 · 2021年10月8日

Temporal Convolutions for Multi-Step Quadrotor Motion Prediction

Temporal Convolutions for Multi-Step Quadrotor Motion Prediction

Arxiv

0+阅读 · 2021年10月8日

Predictive Quantile Regression with Mixed Roots and Increasing Dimensions

Arxiv

0+阅读 · 2021年10月7日

Robust Localization with Bounded Noise: Creating a Superset of the Possible Target Positions via Linear-Fractional Representations

Arxiv

0+阅读 · 2021年10月6日

Influence-Balanced Loss for Imbalanced Visual Classification

Arxiv

0+阅读 · 2021年10月6日

Improving Collaborative Metric Learning with Efficient Negative Sampling

Arxiv

3+阅读 · 2019年9月24日

Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs

Arxiv

39+阅读 · 2019年6月4日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Interaction Embeddings for Prediction and Explanation in Knowledge Graphs

Arxiv

7+阅读 · 2019年3月12日

Deep Randomized Ensembles for Metric Learning

Deep Randomized Ensembles for Metric Learning

Arxiv

5+阅读 · 2018年9月4日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

30+阅读 · 2021年6月12日

《人工智能计算中心白皮书》，43页pdf

《人工智能计算中心白皮书》，43页pdf

专知会员服务

147+阅读 · 2021年3月5日

【UBC】高级机器学习课程，Advanced Machine Learning

【UBC】高级机器学习课程，Advanced Machine Learning

专知会员服务

23+阅读 · 2021年1月26日

【德勤】数字化健康白皮书

专知会员服务

46+阅读 · 2020年12月4日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

52+阅读 · 2020年9月7日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

91+阅读 · 2020年3月12日

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

【清华大学】知识增强的常识性故事生成预训练模型，A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

专知会员服务

51+阅读 · 2020年1月20日

【KDD2019|讲座推荐】深层贝叶斯挖掘、学习与理解：Deep Bayesian Mining, Learning and Understanding

【KDD2019|讲座推荐】深层贝叶斯挖掘、学习与理解：Deep Bayesian Mining, Learning and Understanding

专知会员服务

63+阅读 · 2019年12月14日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

167+阅读 · 2019年10月11日

热门VIP内容

相关资讯

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

【论文笔记】通俗理解少样本文本分类 (Few-Shot Text Classification) (1)

深度学习自然语言处理

7+阅读 · 2020年4月8日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

25+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】卷积神经网络类间不平衡问题系统研究

【推荐】卷积神经网络类间不平衡问题系统研究

机器学习研究会

6+阅读 · 2017年10月18日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Assessment of Neural Networks for Stream-Water-Temperature Prediction

Assessment of Neural Networks for Stream-Water-Temperature Prediction

Arxiv

0+阅读 · 2021年10月8日

Temporal Convolutions for Multi-Step Quadrotor Motion Prediction

Temporal Convolutions for Multi-Step Quadrotor Motion Prediction

Arxiv

0+阅读 · 2021年10月8日

Predictive Quantile Regression with Mixed Roots and Increasing Dimensions

Arxiv

0+阅读 · 2021年10月7日

Robust Localization with Bounded Noise: Creating a Superset of the Possible Target Positions via Linear-Fractional Representations

Arxiv

0+阅读 · 2021年10月6日

Influence-Balanced Loss for Imbalanced Visual Classification

Arxiv

0+阅读 · 2021年10月6日

Improving Collaborative Metric Learning with Efficient Negative Sampling

Arxiv

3+阅读 · 2019年9月24日

Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs

Arxiv

39+阅读 · 2019年6月4日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Interaction Embeddings for Prediction and Explanation in Knowledge Graphs

Arxiv

7+阅读 · 2019年3月12日

Deep Randomized Ensembles for Metric Learning

Deep Randomized Ensembles for Metric Learning

Arxiv

5+阅读 · 2018年9月4日

微信扫码咨询专知VIP会员