Graduate Student Colloquium

Impulse control via reinforcement learning

  • 演讲者:陈子豪(南科大)

  • 时间:2023-03-03 21:10-21:40

  • 地点:理学院大楼M5024讨论间

摘要:This lecture mainly introduces how to use reinforcement learning to solve a pulse control problem. First, the impulse control problem is converted into the optimal stopping time problem. Then the reinforcement learning method is used to solve the optimal stopping time problem, and then the verification theorem is used to obtain the optimal impulse control.