Skip to content

Commit

Permalink
updated readme
Browse files Browse the repository at this point in the history
  • Loading branch information
joshuaspear committed Jul 17, 2024
1 parent a1a86b0 commit 94cbb86
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -97,6 +97,7 @@ The different kinds of importance samples can also be visualised by querying the
* Added explicit stochastic policies for d3rlpy
* Introduced 'policy_func' which is any function/method which outputs type Union[TorchPolicyReturn, NumpyPolicyReturn]
* Simplified and unified ISCallback in d3rlpy/api using PolicyFactory
* Added 'premade' doubly robust estimators for vanilla DR, weighted DR, per-decision DR and weighted per-decision DR

#### 5.0.1
* Fixed bug where GreedyDeterministic couldn't handle multi-dimensional action spaces
Expand Down

0 comments on commit 94cbb86

Please sign in to comment.