【DeepMind】人工智能、价值与对齐，Artificial Intelligence, Values, and Alignment - 专知VIP

会员服务 ·

6

Google DeepMind · 人工智能 ·

2020 年 1 月 13 日

【DeepMind】人工智能、价值与对齐，Artificial Intelligence, Values, and Alignment

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

简介：

本文着眼于在AI一致性背景下出现的哲学问题。它捍卫了三个主张。首先，AI协调问题的规范和技术方面是相互关联的，这为在两个领域工作的人们之间的有效参与创造了空间。其次，重要的是要明确对齐的目标。人工智能与指令，意图，揭示的偏好，理想偏好，兴趣和价值观相符之间存在显着差异。在这种情况下，基于原则的AI对齐方法将这些元素以系统的方式结合在一起，具有相当大的优势。第三，理论学家面临的主要挑战不是确定AI的“真实”道德原则。相反，它是确定公平的公正原则，尽管人们的道德观念差异很大，但原则上仍应得到反思的认可。本文的最后一部分探讨了可以潜在地确定AI协调的公平原则的三种方式。

任何新技术都会产生道德上的考虑。但是，随着计算机系统具有更大的自主权并以“越来越多地禁止人们评估是否以负责任或道德的方式来评估每个动作”的速度运行，赋予人工代理以道德价值的任务变得尤为重要。

本文的第一部分指出，虽然技术人员在构建尊重和体现人类价值的系统中可以发挥重要作用，但选择合适的价值并不是仅靠技术工作就能解决的任务。当我们研究至少在强化学习范式中可以实现价值一致的不同方式时，这一点变得很明显。

成为VIP会员查看完整内容

Artificial Intelligence, Values, and Alignment.pdf

38

相关内容

Google DeepMind

Google DeepMind

Google DeepMind 是一家英国的人工智能公司。公司创建于 2010 年，最初名称是 DeepMind 科技，在 2014 年被谷歌收购。

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

专知会员服务

181+阅读 · 2020年6月23日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

专知会员服务

22+阅读 · 2020年4月8日

【Google-WWW2020】会话域探索的动态组合， Conversational Domain Exploration

专知会员服务

10+阅读 · 2020年3月22日

【ICML2020投稿论文-DeepMind】时序差分学习的推理与泛化，Temporal Difference Learning

专知会员服务

26+阅读 · 2020年3月16日

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

专知会员服务

78+阅读 · 2020年2月25日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

【伯克利】机器学习中充满价值的学科转变（Value-laden Disciplinary Shifts in Machine Learning）

【伯克利】机器学习中充满价值的学科转变（Value-laden Disciplinary Shifts in Machine Learning）

专知会员服务

5+阅读 · 2019年12月5日

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

专知会员服务

55+阅读 · 2019年10月27日

【工业4.0】工业人工智能与工业4.0 制造

【工业4.0】工业人工智能与工业4.0 制造

产业智能官

19+阅读 · 2018年11月8日

AlphaGo之父David Silver最新演讲，传授强化学习的十大原则

AlphaGo之父David Silver最新演讲，传授强化学习的十大原则

深度学习世界

3+阅读 · 2018年9月21日

DeepMind：用PopArt进行多任务深度强化学习

DeepMind：用PopArt进行多任务深度强化学习

论智

29+阅读 · 2018年9月14日

人工智能摧毁的不是工作岗位，而是商业模式

人工智能摧毁的不是工作岗位，而是商业模式

数据分析

5+阅读 · 2018年5月13日

不对称多代理博弈中的博弈理论解读

不对称多代理博弈中的博弈理论解读

AI前线

14+阅读 · 2018年3月8日

Gartner：2018人工智能预测

Gartner：2018人工智能预测

走向智能论坛

4+阅读 · 2017年11月28日

人工智能可以预测女朋友什么时候生气吗？

人工智能可以预测女朋友什么时候生气吗？

中科院物理所

3+阅读 · 2017年11月22日

【深度强化学习】深度强化学习揭秘

【深度强化学习】深度强化学习揭秘

产业智能官

21+阅读 · 2017年11月13日

【深度强化学习】专业解读“深度强化学习“：从AlphaGo到AlphaGoZero

【深度强化学习】专业解读“深度强化学习“：从AlphaGo到AlphaGoZero

产业智能官

11+阅读 · 2017年11月2日

【DRL教程学习笔记01】AlphaGo Zero核心技术- 深度强化学习简介

【DRL教程学习笔记01】AlphaGo Zero核心技术- 深度强化学习简介

专知

17+阅读 · 2017年10月20日

Interference and Generalization in Temporal Difference Learning

Arxiv

8+阅读 · 2020年3月13日

The Measure of Intelligence

The Measure of Intelligence

Arxiv

7+阅读 · 2019年11月5日

Graph Convolutional Networks for Temporal Action Localization

Arxiv

5+阅读 · 2019年9月7日

Efficient Eligibility Traces for Deep Reinforcement Learning

Arxiv

4+阅读 · 2018年10月23日

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Arxiv

5+阅读 · 2018年7月23日

Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

Arxiv

5+阅读 · 2018年7月11日

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Arxiv

4+阅读 · 2018年7月8日

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

Arxiv

3+阅读 · 2018年7月5日

Unsupervised Meta-Learning for Reinforcement Learning

Arxiv

8+阅读 · 2018年6月12日

Learning Unsupervised Learning Rules

Arxiv

7+阅读 · 2018年5月23日

VIP会员

相关主题

Google DeepMind

相关VIP内容

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

最新《可解释人工智能XAI：机会与挑战》25页pdf，Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey

专知会员服务

181+阅读 · 2020年6月23日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

【CMU-Google-斯坦福】可控行为的弱监督强化学习，Weakly-Supervised RL

专知会员服务

22+阅读 · 2020年4月8日

【Google-WWW2020】会话域探索的动态组合， Conversational Domain Exploration

专知会员服务

10+阅读 · 2020年3月22日

【ICML2020投稿论文-DeepMind】时序差分学习的推理与泛化，Temporal Difference Learning

专知会员服务

26+阅读 · 2020年3月16日

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

专知会员服务

78+阅读 · 2020年2月25日

【AAAI2020教程】强化学习中的Exploration-Exploitation in Reinforcement Learning

专知会员服务

101+阅读 · 2020年2月8日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

【伯克利】机器学习中充满价值的学科转变（Value-laden Disciplinary Shifts in Machine Learning）

【伯克利】机器学习中充满价值的学科转变（Value-laden Disciplinary Shifts in Machine Learning）

专知会员服务

5+阅读 · 2019年12月5日

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

专知会员服务

55+阅读 · 2019年10月27日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

【工业4.0】工业人工智能与工业4.0 制造

【工业4.0】工业人工智能与工业4.0 制造

产业智能官

19+阅读 · 2018年11月8日

AlphaGo之父David Silver最新演讲，传授强化学习的十大原则

AlphaGo之父David Silver最新演讲，传授强化学习的十大原则

深度学习世界

3+阅读 · 2018年9月21日

DeepMind：用PopArt进行多任务深度强化学习

DeepMind：用PopArt进行多任务深度强化学习

论智

29+阅读 · 2018年9月14日

人工智能摧毁的不是工作岗位，而是商业模式

人工智能摧毁的不是工作岗位，而是商业模式

数据分析

5+阅读 · 2018年5月13日

不对称多代理博弈中的博弈理论解读

不对称多代理博弈中的博弈理论解读

AI前线

14+阅读 · 2018年3月8日

Gartner：2018人工智能预测

Gartner：2018人工智能预测

走向智能论坛

4+阅读 · 2017年11月28日

人工智能可以预测女朋友什么时候生气吗？

人工智能可以预测女朋友什么时候生气吗？

中科院物理所

3+阅读 · 2017年11月22日

【深度强化学习】深度强化学习揭秘

【深度强化学习】深度强化学习揭秘

产业智能官

21+阅读 · 2017年11月13日

【深度强化学习】专业解读“深度强化学习“：从AlphaGo到AlphaGoZero

【深度强化学习】专业解读“深度强化学习“：从AlphaGo到AlphaGoZero

产业智能官

11+阅读 · 2017年11月2日

【DRL教程学习笔记01】AlphaGo Zero核心技术- 深度强化学习简介

【DRL教程学习笔记01】AlphaGo Zero核心技术- 深度强化学习简介

专知

17+阅读 · 2017年10月20日

相关论文

Interference and Generalization in Temporal Difference Learning

Arxiv

8+阅读 · 2020年3月13日

The Measure of Intelligence

The Measure of Intelligence

Arxiv

7+阅读 · 2019年11月5日

Graph Convolutional Networks for Temporal Action Localization

Arxiv

5+阅读 · 2019年9月7日

Efficient Eligibility Traces for Deep Reinforcement Learning

Arxiv

4+阅读 · 2018年10月23日

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Arxiv

5+阅读 · 2018年7月23日

Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

Arxiv

5+阅读 · 2018年7月11日

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Arxiv

4+阅读 · 2018年7月8日

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

Arxiv

3+阅读 · 2018年7月5日

Unsupervised Meta-Learning for Reinforcement Learning

Arxiv

8+阅读 · 2018年6月12日

Learning Unsupervised Learning Rules

Arxiv

7+阅读 · 2018年5月23日

微信扫码咨询专知VIP会员