We address classification of distributional data, where units are described by histogram or interval-valued variables. The proposed approach uses a linear discriminant function where distributions or intervals are represented by quantile functions, under specific assumptions. This discriminant function allows defining a score for each unit, in the form of a quantile function, which is used to classify the units in two a priori groups, using the Mallows distance. There is a diversity of application areas for the proposed linear discriminant method. In this work we classify the airline companies operating in NY airports based on air time and arrival/departure delays, using a full year fights.
翻译:我们处理分配数据的分类问题,其中单位用直方图或间限值变量进行描述; 拟议的方法使用线性分辨功能,根据具体假设,分布或间距由量函数代表,根据具体假设,这种分辨功能允许以量函数的形式为每个单位确定一个分数,该分数函数用来使用马洛距离将单位分成两个前位组,对拟议的线性分辨方法有多种应用区域。 在这项工作中,我们根据空气时间和抵达/离开延误对在纽约机场运营的航空公司进行分类,用整年的争斗来将每个单位分为两个前位组。