Control of the stochastic dynamics of a quantum system is indispensable in fields such as quantum information processing and metrology. However, there is no general ready-made approach to the design of efficient control strategies. Here, we propose a framework for the automated design of control schemes based on differentiable programming ($\partial \mathrm{P}$). We apply this approach to the state preparation and stabilization of a qubit subjected to homodyne detection. To this end, we formulate the control task as an optimization problem where the loss function quantifies the distance from the target state, and we employ neural networks (NNs) as controllers. The system's time evolution is governed by a stochastic differential equation (SDE). To implement efficient training, we backpropagate the gradient information from the loss function through the SDE solver using adjoint sensitivity methods. As a first example, we feed the quantum state to the controller and focus on different methods of obtaining gradients. As a second example, we directly feed the homodyne detection signal to the controller. The instantaneous value of the homodyne current contains only very limited information on the actual state of the system, masked by unavoidable photon-number fluctuations. Despite the resulting poor signal-to-noise ratio, we can train our controller to prepare and stabilize the qubit to a target state with a mean fidelity of around 85%. We also compare the solutions found by the NN to a hand-crafted control strategy.
 翻译:在量子信息处理和计量学等领域,控制量子系统的随机动态是不可或缺的。 但是,在量子信息处理和计量学等领域,没有通用的现成方法来设计高效控制战略。 我们在此建议一个基于不同编程的监控计划自动设计框架 ($\ part \ mathrm{P}$) 。 我们将这种方法应用于受同质检测的qubit的国家准备和稳定。 为此, 我们将控制任务设计成一个优化问题, 即损失功能量化距离目标状态的大小, 我们使用神经网络作为控制器。 系统的时间演变受一个随机分析差异方程式( SDE) 的制约。 为了实施高效培训, 我们用连接灵敏度的敏感度方法, 将梯度信息从损失函数中反射出。 首先, 我们向控制器输入量子状态, 并将重点放在获取梯度的不同方法上。 第二个例子是, 我们直接向控制器的检测信号信号是神经网络的瞬时值, 由精确度差异的方位值来将我们无法避免的恒定的电压系统 。