This paper investigates a novel offline change-point detection problem from an information-theoretic perspective. In contrast to most related works, we assume that the knowledge of the underlying pre- and post-change distributions are not known and can only be learned from the training sequences which are available. We further require the probability of the \emph{estimation error} to decay either exponentially or sub-exponentially fast (corresponding respectively to the large and moderate deviations regimes in information theory parlance). Based on the training sequences as well as the test sequence consisting of a single change-point, we design a change-point estimator and further show that this estimator is optimal by establishing matching (strong) converses. This leads to a full characterization of the optimal confidence width (i.e., half the width of the confidence interval within which the true change-point is located at with high probability) as a function of the undetected error, under both the large and moderate deviations regimes.
翻译:本文从信息理论的角度对一个新的离线变化点探测问题进行了调查。 与大多数相关作品不同, 我们假设对变化前和变化后基本分布的了解并不为人所知, 只能从现有培训序列中学习。 我们还要求 emph{ 估计错误} 的概率, 以便指数化或亚爆炸性快速( 分别对应信息理论对称中大偏差制度) 。 根据培训序列以及由单一变化点组成的测试序列, 我们设计了一个更改点估计器, 进一步显示该估计器通过建立匹配( 强) 反差是最佳的。 这导致在大型和中偏差制度下对最佳信任宽度( 即真实变化点所在的宽度的一半, 概率很高 ) 进行充分描述, 以未发现错误的函数为基础, 在大偏差和中偏差制度下 。