Rubik's Optical Neural Networks: Multi-task Learning with Physics-aware Rotation Architecture (Rubik's Optical Neural Networks: Multi-task Learning with Physics-aware Rotation Architecture)

Recently, there are increasing efforts on advancing optical neural networks (ONNs), which bring significant advantages for machine learning (ML) in terms of power efficiency, parallelism, and computational speed. With the considerable benefits in computation speed and energy efficiency, there are significant interests in leveraging ONNs into medical sensing, security screening, drug detection, and autonomous driving. However, due to the challenge of implementing reconfigurability, deploying multi-task learning (MTL) algorithms on ONNs requires re-building and duplicating the physical diffractive systems, which significantly degrades the energy and cost efficiency in practical application scenarios. This work presents a novel ONNs architecture, namely, \textit{RubikONNs}, which utilizes the physical properties of optical systems to encode multiple feed-forward functions by physically rotating the hardware similarly to rotating a \textit{Rubik's Cube}. To optimize MTL performance on RubikONNs, two domain-specific physics-aware training algorithms \textit{RotAgg} and \textit{RotSeq} are proposed. Our experimental results demonstrate more than 4$\times$ improvements in energy and cost efficiency with marginal accuracy degradation compared to the state-of-the-art approaches.

翻译：RubikONNs：利用物理感知旋转架构进行多任务学习的光学神经网络近来，越来越多的工作致力于推进光学神经网络（ONNs）的发展，ONNs在计算效率、并行性和计算速度等方面为机器学习（ML）带来了重大优势。随着计算速度和能量效率的显著提高，人们开始将ONNs应用于医疗感知、安全检测、药物检测和自动驾驶等领域。然而，由于复杂的重构问题，将多任务学习（MTL）算法部署到ONNs上需要重新构建和复制物理衍射系统，这在实际应用场景中会显著降低能量和成本效率。本文提出了一种新的ONNs架构，即RubikONNs，它利用光学系统的物理特性，物理旋转硬件来实现多个前馈函数的编码，类似于旋转光学玩具魔方。为了优化RubikONNs的MTL性能，提出了两种面向特定领域的物理感知训练算法RotAgg和RotSeq。实验结果表明，与最先进的方法相比，RubikONNs能够使能量和成本效率提高4倍以上，准确度略有下降。

相关内容

多任务学习

关注 161

多任务学习（MTL）是机器学习的一个子领域，可以同时解决多个学习任务，同时利用各个任务之间的共性和差异。与单独训练模型相比，这可以提高特定任务模型的学习效率和预测准确性。多任务学习是归纳传递的一种方法，它通过将相关任务的训练信号中包含的域信息用作归纳偏差来提高泛化能力。通过使用共享表示形式并行学习任务来实现,每个任务所学的知识可以帮助更好地学习其它任务。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

35+阅读 · 2022年3月5日