同步多代理主动搜索 (Cost Aware Asynchronous Multi-Agent Active Search) - 专知论文

会员服务 ·

0

Agent · 回合 · 代价 · 在线 · Principle ·

2022 年 10 月 5 日

Cost Aware Asynchronous Multi-Agent Active Search

翻译：同步多代理主动搜索

Arundhati Banerjee,Ramina Ghods,Jeff Schneider

Multi-agent active search requires autonomous agents to choose sensing actions that efficiently locate targets. In a realistic setting, agents also must consider the costs that their decisions incur. Previously proposed active search algorithms simplify the problem by ignoring uncertainty in the agent's environment, using myopic decision making, and/or overlooking costs. In this paper, we introduce an online active search algorithm to detect targets in an unknown environment by making adaptive cost-aware decisions regarding the agent's actions. Our algorithm combines principles from Thompson Sampling (for search space exploration and decentralized multi-agent decision making), Monte Carlo Tree Search (for long horizon planning) and pareto-optimal confidence bounds (for multi-objective optimization in an unknown environment) to propose an online lookahead planner that removes all the simplifications. We analyze the algorithm's performance in simulation to show its efficacy in cost aware active search.

翻译：多试剂主动搜索要求自主代理商选择能够有效定位目标的感测动作。在现实的环境中,代理商还必须考虑其决定产生的成本。先前提议的积极搜索算法通过忽略代理商环境的不确定性、使用近视决策以及/或忽略成本来简化问题。在本文中,我们引入了在线主动搜索算法,通过对代理商的行动做出适应性成本意识的决定,在未知环境中检测目标。我们的算法结合了Thompson Sampling(用于搜索空间探索和分散式多试剂决策)、Monte Carlo树搜索(用于长地平线规划)和对等最佳信任界限(用于在未知环境中实现多目标优化)的原则,以提出消除所有简化的在线外观规划器。我们分析了模拟算法的性能,以显示其在成本意识主动搜索中的效率。

0

相关内容

Agent

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

变应性鼻炎孕母暴露PM2.5对子代脐血Th2细胞亚群极化影响及其表观遗传调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

β2-AR /PCBP2相互作用在胰腺癌发生发展中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

L-BM诱导的血流动力学改变对慢性心衰中自噬的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

信息物理融合系统多领域统一建模方法及仿真策略研究

国家自然科学基金

4+阅读 · 2013年12月31日

B7-H3调控Th细胞分化及其在支气管哮喘免疫病理中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

农村地区2型糖尿病Markov模型构建及相关干预策略经济学评价

国家自然科学基金

0+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

应力负荷介导下Ihh/PTHrP信号轴对髁突前软骨干细胞分化的影响及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

AR/let-7及其下游分子对ER-AR+乳腺癌干细胞生长的调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

Deep W-Networks: Solving Multi-Objective Optimisation Problems With Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年11月9日

Discrimination and Class Imbalance Aware Online Naive Bayes

Discrimination and Class Imbalance Aware Online Naive Bayes

Arxiv

0+阅读 · 2022年11月9日

Active Exploration via Experiment Design in Markov Chains

Arxiv

0+阅读 · 2022年11月9日

Nimbus: Toward Speed Up Function Signature Recovery via Input Resizing and Multi-Task Learning

Arxiv

0+阅读 · 2022年11月8日

Adaptive Asynchronous Control using Meta-learned Neural Ordinary Differential Equations

Arxiv

0+阅读 · 2022年11月8日

Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年11月6日

Intelligent Reflecting Surface Enabled Multi-Target Sensing

Arxiv

0+阅读 · 2022年11月5日

Multi-Fidelity Cost-Aware Bayesian Optimization

Arxiv

0+阅读 · 2022年11月4日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】用于提升含优化层学习的算法与体系结构

【NeurIPS2025】有何不同于过去？基于自监督偏差学习的时空时间序列预测

超越决策优势：情报在创新与适应中的作用

量子计算发展态势研究报告（2025年）

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Deep W-Networks: Solving Multi-Objective Optimisation Problems With Deep Reinforcement Learning

Arxiv

0+阅读 · 2022年11月9日

Discrimination and Class Imbalance Aware Online Naive Bayes

Discrimination and Class Imbalance Aware Online Naive Bayes

Arxiv

0+阅读 · 2022年11月9日

Active Exploration via Experiment Design in Markov Chains

Arxiv

0+阅读 · 2022年11月9日

Nimbus: Toward Speed Up Function Signature Recovery via Input Resizing and Multi-Task Learning

Arxiv

0+阅读 · 2022年11月8日

Adaptive Asynchronous Control using Meta-learned Neural Ordinary Differential Equations

Arxiv

0+阅读 · 2022年11月8日

Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning

Arxiv

0+阅读 · 2022年11月6日

Intelligent Reflecting Surface Enabled Multi-Target Sensing

Arxiv

0+阅读 · 2022年11月5日

Multi-Fidelity Cost-Aware Bayesian Optimization

Arxiv

0+阅读 · 2022年11月4日

Multi-Agent Cooperative Bidding Games for Multi-Objective Optimization in e-Commercial Sponsored Search

Arxiv

12+阅读 · 2021年6月8日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

相关基金

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

变应性鼻炎孕母暴露PM2.5对子代脐血Th2细胞亚群极化影响及其表观遗传调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

β2-AR /PCBP2相互作用在胰腺癌发生发展中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

L-BM诱导的血流动力学改变对慢性心衰中自噬的调控和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

信息物理融合系统多领域统一建模方法及仿真策略研究

国家自然科学基金

4+阅读 · 2013年12月31日

B7-H3调控Th细胞分化及其在支气管哮喘免疫病理中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

农村地区2型糖尿病Markov模型构建及相关干预策略经济学评价

国家自然科学基金

0+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

应力负荷介导下Ihh/PTHrP信号轴对髁突前软骨干细胞分化的影响及作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

AR/let-7及其下游分子对ER-AR+乳腺癌干细胞生长的调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员