Tutorial 4: Portfolio Management with PPO
PPO strikes a balance between ease of implementation, sample complexity, and ease of tuning, trying to compute an update at each step that minimizes the cost function while ensuring the deviation from the previous policy is relativel small.
In this notebook, we implement the training and testing process of PPO based on the TradeMaster framework.
And this is the script for training and testing.