Bootstrap resampling is the foundation of many ensemble learning methods, and out-of-bag (OOB) error estimation is the most widely used internal measure of generalization performance. In the standard multinomial bootstrap, the number of distinct observations in each resample is random. Although this source of variability exists, it has rarely been studied in isolation to understand how much it affects OOB-based quantities. To address this gap, we investigate Sequential Bootstrap, a resampling method that forces every bootstrap replicate to contain the same number of distinct observations, and treat it as a controlled modification of the classical bootstrap within the OOB framework. We reproduce Breiman's five original OOB experiments on both synthetic and real-world datasets, repeating all analyses across many different random seeds. Our results show that switching from the classical bootstrap to Sequential Bootstrap leaves accuracy-related metrics essentially unchanged, but yields measurable and data-dependent reductions in several variance-related measures. Therefore, Sequential Bootstrap should not be viewed as a new method for improving predictive performance, but rather as a tool for understanding how randomness in the number of distinct samples contributes to the variance of OOB estimators. This work provides a reproducible setting for studying the statistical properties of resampling-based ensemble estimators and offers empirical evidence that may support future theoretical work on variance decomposition in bootstrap-based systems.
翻译:自助重采样是众多集成学习方法的基础,而袋外误差估计是应用最广泛的泛化性能内部度量。在标准的多项式自助法中,每个重采样样本中不同观测值的数量是随机的。尽管存在这一变异性来源,但鲜有研究单独探讨其对基于袋外误差的统计量的影响程度。为填补这一空白,我们研究了顺序自助法——一种强制每个自助法重复样本包含相同数量不同观测值的重采样方法,并将其视为袋外框架下经典自助法的受控改进。我们在合成数据集和真实数据集上复现了Breiman最初设计的五个袋外实验,并在多个不同随机种子下重复了所有分析。结果表明,从经典自助法切换至顺序自助法时,与精度相关的指标基本保持不变,但在多个方差相关度量上产生了可测量且依赖于数据的降低。因此,顺序自助法不应被视为提升预测性能的新方法,而应作为理解不同样本数量的随机性如何影响袋外估计量方差的工具。本研究为探究基于重采样的集成估计量的统计特性提供了可复现的实验环境,并为未来基于自助法的系统方差分解理论研究提供了实证依据。