Releases: lucidrains/self-rewarding-lm-pytorch
Releases · lucidrains/self-rewarding-lm-pytorch
0.2.12
What's Changed
- Fixed deep copy, shallow copy error and label mask error. by @Control-derek in #29
Full Changelog: 0.2.11...0.2.12
0.2.11
What's Changed
- Solves the problem that some variables are not declared by @Control-derek in #28
Full Changelog: 0.2.10...0.2.11
0.2.10
What's Changed
- Solves the problem that some variables are not declared by @Control-derek in #27
Full Changelog: 0.2.9...0.2.10
0.2.9
What's Changed
- add self. by @Control-derek in #26
New Contributors
- @Control-derek made their first contribution in #26
Full Changelog: 0.2.8...0.2.9
0.2.8
What's Changed
- Fix TypeError for is_valid_reward in SelfRewardDPOConfig by @ViswanathaReddyGajjala in #19
New Contributors
- @ViswanathaReddyGajjala made their first contribution in #19
Full Changelog: 0.2.7...0.2.8
0.2.7
What's Changed
- Update self_rewarding_lm_pytorch.py by @unaidedelf8777 in #17
New Contributors
- @unaidedelf8777 made their first contribution in #17
Full Changelog: 0.2.5...0.2.7
0.2.5
Full Changelog: 0.2.4...0.2.5
0.2.4
Full Changelog: 0.2.3...0.2.4
0.2.3
Full Changelog: 0.2.2...0.2.3
0.2.2
Full Changelog: 0.2.1...0.2.2