面向非确定性的浮点神经网络乐观验证 (Nondeterminism-Aware Optimistic Verification for Floating-Point Neural Networks)

Neural networks increasingly run on hardware outside the user's control (cloud GPUs, inference marketplaces). Yet ML-as-a-Service reveals little about what actually ran or whether returned outputs faithfully reflect the intended inputs. Users lack recourse against service downgrades (model swaps, quantization, graph rewrites, or discrepancies like altered ad embeddings). Verifying outputs is hard because floating-point(FP) execution on heterogeneous accelerators is inherently nondeterministic. Existing approaches are either impractical for real FP neural networks or reintroduce vendor trust. We present NAO: a Nondeterministic tolerance Aware Optimistic verification protocol that accepts outputs within principled operator-level acceptance regions rather than requiring bitwise equality. NAO combines two error models: (i) sound per-operator IEEE-754 worst-case bounds and (ii) tight empirical percentile profiles calibrated across hardware. Discrepancies trigger a Merkle-anchored, threshold-guided dispute game that recursively partitions the computation graph until one operator remains, where adjudication reduces to a lightweight theoretical-bound check or a small honest-majority vote against empirical thresholds. Unchallenged results finalize after a challenge window, without requiring trusted hardware or deterministic kernels. We implement NAO as a PyTorch-compatible runtime and a contract layer currently deployed on Ethereum Holesky testnet. The runtime instruments graphs, computes per-operator bounds, and runs unmodified vendor kernels in FP32 with negligible overhead (0.3% on Qwen3-8B). Across CNNs, Transformers and diffusion models on A100, H100, RTX6000, RTX4090, empirical thresholds are $10^2-10^3$ times tighter than theoretical bounds, and bound-aware adversarial attacks achieve 0% success. NAO reconciles scalability with verifiability for real-world heterogeneous ML compute.

翻译：神经网络日益运行在用户无法控制的硬件上（云端GPU、推理市场）。然而机器学习即服务几乎不透露实际运行内容或返回输出是否忠实反映预期输入。用户面对服务降级（模型替换、量化、图重写或广告嵌入篡改等差异）缺乏追索手段。验证输出之所以困难，是因为异构加速器上的浮点运算本质上具有非确定性。现有方法要么对实际浮点神经网络不实用，要么重新引入供应商信任。我们提出NAO：一种非确定性容差感知的乐观验证协议，该协议接受符合原则性算子级接受区域的输出，而非要求比特级相等。NAO融合两种误差模型：（i）严谨的逐算子IEEE-754最坏情况边界；（ii）跨硬件校准的严格经验百分位分布。差异触发基于默克尔树锚定、阈值引导的争议博弈，递归分割计算图直至剩余单个算子，此时裁决简化为轻量级理论边界检查或针对经验阈值的小规模诚实多数投票。未受挑战的结果在争议窗口期后最终确认，无需可信硬件或确定性内核。我们将NAO实现为PyTorch兼容运行时及当前部署于以太坊Holesky测试网的合约层。该运行时对计算图进行插装、计算逐算子边界，并以可忽略开销（Qwen3-8B上为0.3%）在FP32精度下运行未经修改的供应商内核。在A100、H100、RTX6000、RTX4090上对CNN、Transformer和扩散模型的测试表明，经验阈值较理论边界严格$10^2-10^3$倍，边界感知对抗攻击成功率为0%。NAO为现实世界异构机器学习计算实现了可扩展性与可验证性的统一。

相关内容

Neural Networks

关注 1651

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日