Skip to content

Releases: masus04/Deep-Reinforcement-Learning-for-Boardgames

Unified Experiments

16 Sep 18:21
Compare
Choose a tag to compare
Unified Experiments Pre-release
Pre-release
0.65

Preparation

Major network updates

15 Sep 13:54
Compare
Choose a tag to compare
Major network updates Pre-release
Pre-release
  • Make use of LogSoftmax().exp() for numerically stable and non spiking LegalSoftmax module
  • Fixes TTT Baseline Player's loss function bug
  • Improved plotting

Major TicTacToe fix

19 Aug 08:08
Compare
Choose a tag to compare
Major TicTacToe fix Pre-release
Pre-release

Resolved config file issue which impacts all TicTacToe experiments. All TicTacToe experiments are run after this point.

Major Functionality release

11 Aug 21:01
Compare
Choose a tag to compare
Pre-release

Includes major functionality:

  • Framework
  • TicTacToe & Othello
  • Reinforce, Baseline & Actor Critic players
  • Search player
  • GUI

-> All experiments are based on this release. Will create new release if learning players are changed.