Brian Ning, Franco Ho Ting Lin, Sebastian Jaimungal
TL;DR采用 Deep Q-Learning 算法,基于当前的交易信号和订单簿信息预测股票交易的最佳行动方案,并在九种不同的股票上进行了实验,在大多数股票上优于标准基准方法,且表现以超额收益率、超额率和收益 - 损失比为优。
Abstract
optimal trade execution is an important problem faced by essentially all
traders. Much research into optimal execution uses stringent model assumptions
and applies continuous time stochastic control to solve them. Here, we instead
take a model free approach and develop a variation of <