Q函数论文 - 专知

会员服务 ·

Q函数

Data Poisoning to Fake a Nash Equilibrium in Markov Games

Arxiv

0+阅读 · 2024年6月18日

Enhancing Q-Learning with Large Language Model Heuristics

Arxiv

0+阅读 · 2024年5月9日

Enhancing Q-Learning with Large Language Model Heuristics

Arxiv

0+阅读 · 2024年5月6日

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

Arxiv

0+阅读 · 2024年3月18日

OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences

Arxiv

0+阅读 · 2024年2月7日

FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game

Arxiv

0+阅读 · 2024年2月1日

Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation

Arxiv

0+阅读 · 2024年1月25日

Bridging RL Theory and Practice with the Effective Horizon

Arxiv

0+阅读 · 2024年1月11日

Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control

Arxiv

0+阅读 · 2023年11月13日

Bridging RL Theory and Practice with the Effective Horizon

Arxiv

0+阅读 · 2023年11月3日

Virtual Action Actor-Critic Framework for Exploration (Student Abstract)

Arxiv

0+阅读 · 2023年11月6日

Towards Safe Propofol Dosing during General Anesthesia Using Deep Offline Reinforcement Learning

Arxiv

0+阅读 · 2023年11月2日

Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders

Arxiv

0+阅读 · 2023年9月22日

Safe and Robust Multi-Agent Reinforcement Learning for Connected Autonomous Vehicles under State Perturbations

Arxiv

0+阅读 · 2023年9月20日

Evaluation of Deep Reinforcement Learning Algorithms for Portfolio Optimisation

Arxiv

0+阅读 · 2023年7月31日

参考链接

微信扫码咨询专知VIP会员