Releases · masus04/Deep-Reinforcement-Learning-for-Boardgames

16 Sep 18:21

masus04

Unified Experiments Pre-release

Pre-release

0.65

Preparation

Assets 2

15 Sep 13:54

masus04

Major network updates Pre-release

Pre-release

Make use of LogSoftmax().exp() for numerically stable and non spiking LegalSoftmax module
Fixes TTT Baseline Player's loss function bug
Improved plotting

Assets 2

19 Aug 08:08

masus04

Major TicTacToe fix Pre-release

Pre-release

Resolved config file issue which impacts all TicTacToe experiments. All TicTacToe experiments are run after this point.

Assets 2

11 Aug 21:01

masus04

Major Functionality release Pre-release

Pre-release

Includes major functionality:

-> All experiments are based on this release. Will create new release if learning players are changed.

Assets 2

Provide feedback