足球俱乐部葡萄牙3D仿真队：2020年团队介绍论文 (FC Portugal 3D Simulation Team: Team Description Paper 2020)

The FC Portugal 3D team is developed upon the structure of our previous Simulation league 2D/3D teams and our standard platform league team. Our research concerning the robot low-level skills is focused on developing behaviors that may be applied on real robots with minimal adaptation using model-based approaches. Our research on high-level soccer coordination methodologies and team playing is mainly focused on the adaptation of previously developed methodologies from our 2D soccer teams to the 3D humanoid environment and on creating new coordination methodologies based on the previously developed ones. The research-oriented development of our team has been pushing it to be one of the most competitive over the years (World champion in 2000 and Coach Champion in 2002, European champion in 2000 and 2001, Coach 2nd place in 2003 and 2004, European champion in Rescue Simulation and Simulation 3D in 2006, World Champion in Simulation 3D in Bremen 2006 and European champion in 2007, 2012, 2013, 2014 and 2015). This paper describes some of the main innovations of our 3D simulation league team during the last years. A new generic framework for reinforcement learning tasks has also been developed. The current research is focused on improving the above-mentioned framework by developing new learning algorithms to optimize low-level skills, such as running and sprinting. We are also trying to increase student contact by providing reinforcement learning assignments to be completed using our new framework, which exposes a simple interface without sharing low-level implementation details.

翻译：FC葡萄牙3D队基于先前的Simulation League 2D/3D队和标准平台联赛队的结构开发。我们对机器人低级技能的研究集中在开发行为上，这些行为可以在采用基于模型的方法的真实机器人上进行最小的适应。我们对高级足球协调方法和团队比赛的研究主要集中在从我们的2D足球队中自适应先前开发的方法到3D人形环境中并创建基于先前开发的协调方法的新方法上。我们以研究为导向的团队开发一直在推进其成为多年来最具竞争力的团队之一（2000年世界冠军和2002年教练冠军、2000年和2001年欧洲冠军、2003年和2004年教练亚军、2006年Rescue仿真和仿真3D欧洲冠军、2006年Bremen仿真3D世界冠军，2007年，2012年，2013年，2014年和2015年欧洲冠军）。本文介绍了我们的3D仿真联赛队在过去几年中的一些主要创新。也开发了一种新的通用框架，用于强化学习任务。目前的研究集中在通过开发新的学习算法来优化低级技能，例如奔跑和短跑，以改进上述框架。我们还通过提供使用我们的新框架完成强化学习任务的作业来增加学生联系，该框架公开了一个简单的接口，而不共享低级实现细节。