智能体互信条约中应对机器人核武管制核查的经验和借鉴 (Nuclear Arms Control Verification and Lessons for AI Treaties) - 专知论文

会员服务 ·

0

智能体 · 验证系统 · 机器人 · 安全风险 · 可行 ·

2023 年 4 月 8 日

Nuclear Arms Control Verification and Lessons for AI Treaties

翻译：智能体互信条约中应对机器人核武管制核查的经验和借鉴

Security risks from AI have motivated calls for international agreements that guardrail the technology. However, even if states could agree on what rules to set on AI, the problem of verifying compliance might make these agreements infeasible. To help clarify the difficulty of verifying agreements on AI$\unicode{x2013}$and identify actions that might reduce this difficulty$\unicode{x2013}$this report examines the case study of verification in nuclear arms control. We review the implementation, track records, and politics of verification across three types of nuclear arms control agreements. Then, we consider implications for the case of AI, especially AI development that relies on thousands of highly specialized chips. In this context, the case study suggests that, with certain preparations, the foreseeable challenges of verification would be reduced to levels that were successfully managed in nuclear arms control. To avoid even worse challenges, substantial preparations are needed: (1) developing privacy-preserving, secure, and acceptably priced methods for verifying the compliance of hardware, given inspection access; and (2) building an initial, incomplete verification system, with authorities and precedents that allow its gaps to be quickly closed if and when the political will arises.

翻译：智能体可能产生的安全风险已经引起国际社会对智能体监管协议的呼吁。但是，即使各国能够就智能体监管问题达成协议，验证其实施可能会使这些协议变得不可行。为了帮助阐明在智能体监管协议中验证的困难以及确定可能减轻这种困难的行动，本报告研究了在核武器管制领域中进行验证的案例研究。我们回顾了核武器管制协议的三种类型的实施、记录和政治，然后考虑对智能体案例的影响，特别是依靠数以千计的高度专业化芯片进行开发的智能体。在这种情况下，案例研究表明，通过一定的准备工作，可以将预见到的验证挑战降至能够在核武器管制中成功处理的水平。为避免更严重的挑战，需要进行大量的准备工作：（1）开发保护隐私、安全和价格合理的验证硬件的方法，假设检查机制已经实施；（2）建立一个初步的、不完整的验证系统，设立主管部门和先例，以便在政治意愿出现时尽快弥补其缺陷。

0

相关内容

智能体

智能体，顾名思义，就是具有智能的实体，英文名是Agent。

美国参谋长联席会议《联合规划》Joint Publication 5-0

美国参谋长联席会议《联合规划》Joint Publication 5-0

专知会员服务

85+阅读 · 2022年5月19日

美国联合参谋部最新版《作战评估方法》，112页pdf，METHODOLOGY FOR COMBAT ASSESSMENT

美国联合参谋部最新版《作战评估方法》，112页pdf，METHODOLOGY FOR COMBAT ASSESSMENT

专知会员服务

192+阅读 · 2022年4月28日

美国「联合全域指挥与控制 (JADC2)」战略发展最新总结报告，12页pdf，Summary of the Joint All-Domain Command and Control Strategy

美国「联合全域指挥与控制 (JADC2)」战略发展最新总结报告，12页pdf，Summary of the Joint All-Domain Command and Control Strategy

专知会员服务

264+阅读 · 2022年3月22日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【微软雷德蒙研究院】对抗机器学习工业视角，Adversarial Machine Learning - Industry Perspectives

【微软雷德蒙研究院】对抗机器学习工业视角，Adversarial Machine Learning - Industry Perspectives

专知会员服务

12+阅读 · 2020年2月23日

【KDD2019|讲座推荐】公平意识机器学习：现实挑战与经验教训：Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

专知会员服务

20+阅读 · 2019年12月9日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

重磅开讲：图灵奖得主—— Joseph Sifakis

重磅开讲：图灵奖得主—— Joseph Sifakis

THU数据派

0+阅读 · 2022年6月13日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

最新246篇自动化神经网络搜索（NAS）论文，附完整列表PDF下载

最新246篇自动化神经网络搜索（NAS）论文，附完整列表PDF下载

专知

17+阅读 · 2019年9月20日

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

泡泡机器人SLAM

14+阅读 · 2019年4月30日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇自动问答（QA）相关论文—复杂序列问答、注意力机制、长短时记忆、文本推理、多因素注意力、主动的问答智能体

【论文推荐】最新六篇自动问答（QA）相关论文—复杂序列问答、注意力机制、长短时记忆、文本推理、多因素注意力、主动的问答智能体

专知

18+阅读 · 2018年2月22日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

砷暴露对lncRNA PANDA基因表达的影响及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于总势能完备性的钢构件弯扭屈曲设计理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

锈蚀钢筋混凝土柱抗震性能劣化研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多类型FACTS装置的复杂电力系统时滞相关鲁棒阻尼控制方法

国家自然科学基金

0+阅读 · 2013年12月31日

确保网络化多机器人协同跟踪的实时通信条件研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米磁结构中的自旋波与畴壁动力学

国家自然科学基金

0+阅读 · 2012年12月31日

无/低cyclins肿瘤细胞群(NCCCs)：一种新的肿瘤细胞亚群？

国家自然科学基金

0+阅读 · 2011年12月31日

基于人脑决策特征的飞行机器人自主飞行共性控制方法研究

国家自然科学基金

2+阅读 · 2010年12月31日

交直流混联电网环境下UHVDC输电系统概率风险评估

国家自然科学基金

0+阅读 · 2009年12月31日

食品安全危机下的消费者风险评估与购买决策：基于神经营销学的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Cluster Analysis of Open Research Data and a Case for Replication Metadata

Arxiv

0+阅读 · 2023年5月26日

UMat: Uncertainty-Aware Single Image High Resolution Material Capture

Arxiv

0+阅读 · 2023年5月25日

Comparison of Pedestrian Prediction Models from Trajectory and Appearance Data for Autonomous Driving

Arxiv

0+阅读 · 2023年5月25日

Automated Driving Architecture and Operation of a Light Commercial Vehicle

Arxiv

0+阅读 · 2023年5月24日

Trends and Challenges Towards an Effective Data-Driven Decision Making in UK SMEs: Case Studies and Lessons Learnt from the Analysis of 85 SMEs

Arxiv

0+阅读 · 2023年5月24日

Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence

Arxiv

19+阅读 · 2022年1月5日

AI Accelerator Survey and Trends

Arxiv

28+阅读 · 2021年9月18日

Agile, Antifragile, Artificial-Intelligence-Enabled, Command and Control

Arxiv

51+阅读 · 2021年9月14日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Contextual and Position-Aware Factorization Machines for Sentiment Classification

Arxiv

13+阅读 · 2018年1月18日

VIP会员

文章信息

相关主题

相关VIP内容

美国参谋长联席会议《联合规划》Joint Publication 5-0

美国参谋长联席会议《联合规划》Joint Publication 5-0

专知会员服务

85+阅读 · 2022年5月19日

美国联合参谋部最新版《作战评估方法》，112页pdf，METHODOLOGY FOR COMBAT ASSESSMENT

美国联合参谋部最新版《作战评估方法》，112页pdf，METHODOLOGY FOR COMBAT ASSESSMENT

专知会员服务

192+阅读 · 2022年4月28日

美国「联合全域指挥与控制 (JADC2)」战略发展最新总结报告，12页pdf，Summary of the Joint All-Domain Command and Control Strategy

美国「联合全域指挥与控制 (JADC2)」战略发展最新总结报告，12页pdf，Summary of the Joint All-Domain Command and Control Strategy

专知会员服务

264+阅读 · 2022年3月22日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【微软雷德蒙研究院】对抗机器学习工业视角，Adversarial Machine Learning - Industry Perspectives

【微软雷德蒙研究院】对抗机器学习工业视角，Adversarial Machine Learning - Industry Perspectives

专知会员服务

12+阅读 · 2020年2月23日

【KDD2019|讲座推荐】公平意识机器学习：现实挑战与经验教训：Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

专知会员服务

20+阅读 · 2019年12月9日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

重磅开讲：图灵奖得主—— Joseph Sifakis

重磅开讲：图灵奖得主—— Joseph Sifakis

THU数据派

0+阅读 · 2022年6月13日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

最新246篇自动化神经网络搜索（NAS）论文，附完整列表PDF下载

最新246篇自动化神经网络搜索（NAS）论文，附完整列表PDF下载

专知

17+阅读 · 2019年9月20日

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

基于 Carsim 2016 和 Simulink的无人车运动控制联合仿真（四）

泡泡机器人SLAM

14+阅读 · 2019年4月30日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

【论文推荐】最新六篇自动问答（QA）相关论文—复杂序列问答、注意力机制、长短时记忆、文本推理、多因素注意力、主动的问答智能体

【论文推荐】最新六篇自动问答（QA）相关论文—复杂序列问答、注意力机制、长短时记忆、文本推理、多因素注意力、主动的问答智能体

专知

18+阅读 · 2018年2月22日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Cluster Analysis of Open Research Data and a Case for Replication Metadata

Arxiv

0+阅读 · 2023年5月26日

UMat: Uncertainty-Aware Single Image High Resolution Material Capture

Arxiv

0+阅读 · 2023年5月25日

Comparison of Pedestrian Prediction Models from Trajectory and Appearance Data for Autonomous Driving

Arxiv

0+阅读 · 2023年5月25日

Automated Driving Architecture and Operation of a Light Commercial Vehicle

Arxiv

0+阅读 · 2023年5月24日

Trends and Challenges Towards an Effective Data-Driven Decision Making in UK SMEs: Case Studies and Lessons Learnt from the Analysis of 85 SMEs

Arxiv

0+阅读 · 2023年5月24日

Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence

Arxiv

19+阅读 · 2022年1月5日

AI Accelerator Survey and Trends

Arxiv

28+阅读 · 2021年9月18日

Agile, Antifragile, Artificial-Intelligence-Enabled, Command and Control

Arxiv

51+阅读 · 2021年9月14日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

Contextual and Position-Aware Factorization Machines for Sentiment Classification

Arxiv

13+阅读 · 2018年1月18日

相关基金

砷暴露对lncRNA PANDA基因表达的影响及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于总势能完备性的钢构件弯扭屈曲设计理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

锈蚀钢筋混凝土柱抗震性能劣化研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多类型FACTS装置的复杂电力系统时滞相关鲁棒阻尼控制方法

国家自然科学基金

0+阅读 · 2013年12月31日

确保网络化多机器人协同跟踪的实时通信条件研究

国家自然科学基金

0+阅读 · 2012年12月31日

纳米磁结构中的自旋波与畴壁动力学

国家自然科学基金

0+阅读 · 2012年12月31日

无/低cyclins肿瘤细胞群(NCCCs)：一种新的肿瘤细胞亚群？

国家自然科学基金

0+阅读 · 2011年12月31日

基于人脑决策特征的飞行机器人自主飞行共性控制方法研究

国家自然科学基金

2+阅读 · 2010年12月31日

交直流混联电网环境下UHVDC输电系统概率风险评估

国家自然科学基金

0+阅读 · 2009年12月31日

食品安全危机下的消费者风险评估与购买决策：基于神经营销学的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员