学习从严格和依赖数据中学习:线性动态学案例 (Learning from Censored and Dependent Data: The case of Linear Dynamics)

Observations from dynamical systems often exhibit irregularities, such as censoring, where values are recorded only if they fall within a certain range. Censoring is ubiquitous in practice, due to saturating sensors, limit-of-detection effects, and image-frame effects. In light of recent developments on learning linear dynamical systems (LDSs), and on censored statistics with independent data, we revisit the decades-old problem of learning an LDS, from censored observations (Lee and Maddala (1985); Zeger and Brookmeyer (1986)). Here, the learner observes the state $x_t \in \mathbb{R}^d$ if and only if $x_t$ belongs to some set $S_t \subseteq \mathbb{R}^d$. We develop the first computationally and statistically efficient algorithm for learning the system, assuming only oracle access to the sets $S_t$. Our algorithm, Stochastic Online Newton with Switching Gradients, is a novel second-order method that builds on the Online Newton Step (ONS) of Hazan et al. (2007). Our Switching-Gradient scheme does not always use (stochastic) gradients of the function we want to optimize, which we call "censor-aware" function. Instead, in each iteration, it performs a simple test to decide whether to use the censor-aware, or another "censor-oblivious" function, for getting a stochastic gradient. In our analysis, we consider a "generic" Online Newton method, which uses arbitrary vectors instead of gradients, and we prove an error-bound for it. This can be used to appropriately design these vectors, leading to our Switching-Gradient scheme. This framework significantly deviates from the recent long line of works on censored statistics (e.g., Daskalakis et al. (2018); Kontonis et al. (2019); Daskalakis et al. (2019)), which apply Stochastic Gradient Descent (SGD), and their analysis reduces to establishing conditions for off-the-shelf SGD-bounds.

翻译：动态系统中的观测往往显示出不规则现象, 例如检查, 只有当数值出现在一定范围内, 才会记录下来。这里, 学习者观察的是状态 $x_ t\ in\ mathb{ Rád$, 因为饱和感应器、检测效果的限度和图像框架效果。根据最近学习线性动态系统( LDS) 和独立数据的受审查统计的发展动态, 我们重新审视了学习LDS 的数十年问题, 包括审查的观察( Lee 和 Madddala (1985年); Zeger 和 Brookmey (1986年) 。这里, 学习者观察的是状态 $x_ t\ in a a a a a a a a a a a a locental oral- devoil) 数据, 并且“ 不断在网络- develop ASil a stration a steal shares a dressoral stration (SO) a we still dress a destal strations a dress of we stalstalstals) a stration strate strate stration.