We demonstrate an application of online transfer learning as a digital assets trading agent. This agent makes use of a powerful feature space representation in the form of an echo state network, the output of which is made available to a direct, recurrent reinforcement learning agent. The agent learns to trade the XBTUSD (Bitcoin versus US dollars) perpetual swap derivatives contract on BitMEX. It learns to trade intraday on five minutely sampled data, avoids excessive over-trading, captures a funding profit and is also able to predict the direction of the market. Overall, our crypto agent realises a total return of 350%, net of transaction costs, over roughly five years, 71% of which is down to funding profit. The annualised information ratio that it achieves is 1.46.
翻译:作为数字资产交易代理,我们展示了在线转让学习的应用。该代理利用了一个强大的功能空间代表,其形式是回声状态网络,其产出提供给直接、经常性的强化学习代理。该代理学会了在BitMEX上交易XBT$(Bitcoin对美元)永久互换衍生工具合同。它学会了以5分钟的抽样数据进行内部交易,避免了过度的过度交易,获取了资金利润,并且能够预测市场的方向。总体而言,我们的加密代理实现了350%的总回报率,扣除交易成本,大约5年中,其中71%的回报率低于为盈利供资。它实现的年度信息比率是1.46。