Classifiers are often utilized in time-constrained settings where labels must be assigned to inputs quickly. To address these scenarios, budgeted multi-stage classifiers (MSC) process inputs through a sequence of partial feature acquisition and evaluation steps with early-exit options until a confident prediction can be made. This allows for fast evaluation that can prevent expensive, unnecessary feature acquisition in time-critical instances. However, performance of MSCs is highly sensitive to several design aspects -- making optimization of these systems an important but difficult problem. To approximate an initially intractable combinatorial problem, current approaches to MSC configuration rely on well-behaved surrogate loss functions accounting for two primary objectives (processing cost, error). These approaches have proven useful in many scenarios but are limited by analytic constraints (convexity, smoothness, etc.) and do not manage additional performance objectives. Notably, such methods do not explicitly account for an important aspect of real-time detection systems -- the ratio of "accepted" predictions satisfying some confidence criterion imposed by a risk-averse monitor. This paper proposes a problem-specific genetic algorithm, EMSCO, that incorporates a terminal reject option for indecisive predictions and treats MSC design as an evolutionary optimization problem with distinct objectives (accuracy, cost, coverage). The algorithm's design emphasizes Pareto efficiency while respecting a notion of aggregated performance via a unique scalarization. Experiments are conducted to demonstrate EMSCO's ability to find global optima in a variety of Theta(k^n) solution spaces, and multiple experiments show EMSCO is competitive with alternative budgeted approaches.
翻译:分类器通常在时间紧张的环境中使用,因为必须迅速指定标签用于投入。为了应对这些假设,预算编列的多阶段分类(MSC)流程投入通过一系列局部特性获取和评价步骤进行,并有提前退出的选择,直到能够作出有信心的预测。这样可以进行快速评估,防止在时间紧迫的情况下以昂贵、不必要的特性获取,但是,MSC的性能对几个设计方面非常敏感 -- -- 使优化这些系统成为一个重要但困难的问题。为了接近最初棘手的组合问题,目前MSC配置的方法依赖于两种主要目标(处理成本、错误)的妥善的代理损失功能核算。这些方法在许多假设中证明有用,但受到分析性限制(精度、顺畅通等),而且不会管理额外的性能目标。值得注意的是,这种方法并未明确考虑到实时检测系统的一个重要方面 -- -- 风险-反向监测所设定的“可接受”空间预测满足某些信任标准的比率。 本文提出了一种针对特定问题的替代遗传算法,即EMSCO,将精度能力范围纳入一个具有明确度的模型性选择性选择范围,同时将IMSARMASLAMASLSLSLSLSEA性预测显示一种不同的设计效率。