# Cost-sensitive portfolio selection via deep reinforcement learning ###### tags: `paper` `Reinforcement Learning` `Finance` :::info Published in: IEEE Transactions on Knowledge and Data Engineering Yifan Zhang School of Software Engineering, South China University of Technology, Guangzhou, Guangdong China (e-mail: sezyifan@mail.scut.edu.cn) Peilin Zhao AI Lab, Tencent, 508929 Shenzhen, Guangdong China (e-mail: peilinzhao@hotmail.com) Bin Li School of Software Engineering, South China University of Technology, Guangzhou, Guangdong China (e-mail: qyw@scut.edu.cn) Qingyao Wu Department of Finance, Wuhan University, Wuhan, Hubei China (e-mail: binli.whu@whu.edu.cn) Junzhou Huang AI Lab, Tencent, Shenzhen, Guangdong China (e-mail: joehhuang@tencent.com) Mingkui Tan School of Software Engineering, South China University of Technology, 26467 Guangzhou, Guangdong China (e-mail: mingkuitan@scut.edu.cn) Date of Publication: 10 March 2020 dataset source : https://poloniex.com/ dataset api : https://poloniex.com/support/api/ ::: :::warning 整份備忘: $r^c_t$ : 第t個period考慮cost 的reward $\hat{r}^c_t$ : 第t個period考慮cost 的reward 取log 預測時長==test data時長 ::: 7-2 Reinforcement Learning Algorithm Selection 為何使用DPG不使用AC(Actor-Critic)?