Speaker: Zihao Chen (SUSTech)
Time: Mar 3, 2023, 21:10-21:40
Location: M5024, College of Science Bldg.
Abstract
This lecture mainly introduces how to use reinforcement learning to solve a pulse control problem. First, the impulse control problem is converted into the optimal stopping time problem. Then the reinforcement learning method is used to solve the optimal stopping time problem, and then the verification theorem is used to obtain the optimal impulse control.
南方科技大学数学系微信公众号
© 2015 All Rights Reserved. 粤ICP备14051456号
Address: No 1088,xueyuan Rd., Xili, Nanshan District,Shenzhen,Guangdong,China 518055 Tel: +86-755-8801 0000