Introduction

Architecture

TradeMaster could be beneficial to a wide range of communities including leading trading firms, startups, financial service providers and personal investors. We hope TradeMaster can make a change for the whole pipeline of FinRL to prevent untrustworthy results and lead successful industry deployment.
Architecture of Trademaster framework could be visualizaed by the figure below.

Level 1 Simulation

Market Data and User Preference are fed into the network for data processing. In this step, data preprocessing, mining, augmentation, feature selection and behaviour cloning are performed to provide evident simulation of noisy real-world financial market.
Level 2 Algorithm

A collection of various reinforcement learning algotithms are developed to provide feasible solution to different financial tasks like Algorithm Trading, Order Excuction and Porforlio Management. The RL algorithms will generate strategy to maximize user profit.
Level 3 Evaluation

TradeMaster is evaluated in multiple dimenstions. Financial metrics like profit and risk metrics are applied. Additionally, decision tree and shapley value are used to evaluate the explainability of the model. Variability and Alpha decay are used for reliability evaluation.

Supported Trading Scenario

Algorithmic Trading

Algorithmic trading (AT) is a trading scenario that involves using deep reinforcement learning methods to execute trades. This scenario is often used by traders who wish to execute large numbers of trades quickly and efficiently, especially in high-frequency trading.

For more information, please refer to Algorithmic Trading.

Order Execution

Order execution (OE) is a supported trading scenario that involves placing orders to buy or sell securities. This scenario is often used by traderswho wish to execute trades automatically and optimally.

The kev difference of this 0E and algorithmic trading is that OE can only put an action at one side. For example, if the 0E task is to buy oneshare of BTC, you cannot put an “ask” order even if you stil have BTC in your hand. in 0E, we want to sell at the highest price or buy at thelowest price. Therefore, the optimization target is set to be the amount of money we sel. f the target is buying, our target will be negativeBoth targets will be optimized to their maximum value. All of the trades will be conducted at their closing price.

For more information, please refer to Order Execution.

Portfolio Management

Portfolio management (PM) is a trading scenario that involves managing a collection of investments over time. lt’s used by investors who wantto minimize nsk and diversify ther nvestments, This scenano often invo ves a ono-term investment strateoy. rather than tocusind on shortterm trading opportunities.

For more information, please refer to Order Execution.

Model Zoo

Classic RL based on Pytorch and Ray:

PPO

Please refer to PPO for details.

A2C

Please refer to A2C for details.

SAC

Please refer to SAC for details.

DDPG

Please refer to DDPG for details.

DQN

Please refer to DQN for details.

PG

Please refer to PG for details.

TD3

Please refer to TD3 for details.

Introduction

Architecture

Supported Trading Scenario

Algorithmic Trading

Order Execution

Portfolio Management

Model Zoo

DeepScalper

OPD

DeepTrader

SARL

ETTO

Investor-Imitator

EIIE