In this work, we study a status update system with a source node sending timely information to the destination through a channel with random delay. We measure the timeliness of the information stored at the receiver via the Age of Information (AoI), the time elapsed since the freshest sample stored at the receiver is generated. The goal is to design a sampling strategy that minimizes the total cost of the expected time average AoI and sampling cost in the absence of transmission delay statistics. We reformulate the total cost minimization problem as the optimization of a renewal-reward process, and propose an online sampling strategy based on the Robbins-Monro algorithm. We show that, when the transmission delay is bounded, the expected time average total cost obtained by the proposed online algorithm converges to the minimum cost when $K$ goes to infinity, and the optimality gap decays with rate $\mathcal{O}\left(\ln K/K\right)$, where $K$ is the number of samples we have taken. Simulation results validate the performance of our proposed algorithm.
翻译:在这项工作中,我们研究一个状态更新系统,其中有一个源节点,通过随机延迟的频道及时向目的地发送信息;我们测量接收者通过信息时代储存的信息的及时性,这是在接收者储存的最新鲜样本产生后的时间间隔;目的是设计一个取样战略,在没有传输延迟统计数据的情况下,最大限度地减少预期平均时间和取样成本的总成本;我们将成本最小化的总问题重新表述为更新-奖励过程的优化,并根据Robbbins-Monro算法提出在线抽样战略。我们表明,在传输延迟受限时,拟议在线算法获得的预期平均总成本在达到无限值时将达到最低成本,而最佳性差则随着汇率$mathcal{O ⁇ left(lnK/K\right)的衰减,而美元是我们所采样的数量。模拟结果证实了我们提议的算法的性能。