This is agent F.

Policy gradient with very basic environment and "deep" neural network. The agent will decide, on each timestep, of an asset portfolio weight vector and then proceed to the transactions every day. The optimization is done with respect to the differential Sharpe Ratio.

Warning: This is a very rough experimental toy project.

TODO:

Include a better form of normalization
Include a way to assess efficiency of the learning process
Improve loss function
Include some temporality and recurrence in the model or in the data processing.
Complexify the reward to give more accurate credit to the right actions.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
.vscode		.vscode
__pycache__		__pycache__
IntelDataSet.csv		IntelDataSet.csv
README.md		README.md
agent.py		agent.py
agent_test.py		agent_test.py
environment.py		environment.py
environment_test.py		environment_test.py
main.py		main.py
market.py		market.py
portfolio.py		portfolio.py
portfolio_test.py		portfolio_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This is agent F.

About

Releases

Packages

Contributors 2

Languages

SergeOlivierP/agentF

Folders and files

Latest commit

History

Repository files navigation

This is agent F.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages