Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update the Gaudi trainer with transformers 4.45.2 #1398

Merged
merged 55 commits into from
Dec 9, 2024
Merged
Changes from 25 commits
Commits
Show all changes
55 commits
Select commit Hold shift + click to select a range
32830a4
Update the Gaudi trainer
yafshar Oct 4, 2024
181a182
Update the Gaudi trainer
yafshar Oct 4, 2024
e7bc0fb
Uodate the Gaudi trainer
yafshar Oct 4, 2024
01c7bf2
update the _should_update_inputs var and removed extra conditionals
yafshar Oct 4, 2024
b2bc9d2
Merge branch 'main' into trainer
yafshar Oct 7, 2024
7c5ef76
Remove is_accelerate_available with no min version, as accelerate is …
yafshar Oct 7, 2024
d1d1cb5
Uncomment load_fsdp_model since it is already in use from _load_from_…
yafshar Oct 7, 2024
404a1b3
Merge branch 'main' into trainer
yafshar Oct 7, 2024
e3f316f
cleaning, we have args = self.args, no need to use self.args in train…
yafshar Oct 7, 2024
d90b1ff
Merge branch 'main' into trainer
yafshar Oct 8, 2024
47faecd
Merge branch 'main' into trainer
yafshar Oct 8, 2024
eb54370
Revert "cleaning, we have args = self.args, no need to use self.args …
yafshar Oct 8, 2024
8f97c49
Merge branch 'main' into trainer
yafshar Oct 11, 2024
03aea0d
Update _inner_training_loop
yafshar Oct 11, 2024
cebf5d6
Merge branch 'main' into trainer
yafshar Oct 17, 2024
782d580
Merge branch 'main' into trainer
yafshar Oct 17, 2024
4fb5ad9
Merge branch 'main' into trainer
yafshar Oct 18, 2024
1afd991
Merge branch 'main' into trainer
yafshar Oct 21, 2024
8dbc8a3
Merge branch 'main' into trainer
yafshar Oct 23, 2024
d654557
Merge branch 'main' into trainer
yafshar Oct 29, 2024
eeb3e82
Merge branch 'main' into trainer
yafshar Nov 1, 2024
8fb2886
Simplify the code
yafshar Nov 1, 2024
a303607
Fix style
yafshar Nov 1, 2024
f6984bd
Merge branch 'main' into trainer
yafshar Nov 8, 2024
389b9ac
Refactor input update logic into a function
yafshar Nov 8, 2024
206c23e
Merge branch 'main' into trainer
yafshar Nov 13, 2024
7fca739
Merge branch 'main' into trainer
yafshar Nov 19, 2024
d8acb8e
Merge branch 'main' into trainer
yafshar Nov 21, 2024
3a6bb83
Merge branch 'main' into trainer
yafshar Nov 21, 2024
6682797
Merge branch 'main' into trainer
yafshar Nov 25, 2024
00d3e06
Merge branch 'main' into trainer
yafshar Nov 25, 2024
c2efaa5
Merge branch 'main' into trainer
yafshar Nov 25, 2024
196c2d8
Merge branch 'main' into trainer
yafshar Nov 26, 2024
322e9ee
Resove merge conflict
yafshar Nov 26, 2024
61975b6
Merge branch 'main' into trainer
yafshar Nov 26, 2024
751797f
Merge branch 'main' into trainer
yafshar Nov 28, 2024
64e60fa
Resolve merge conflict
yafshar Dec 2, 2024
eed58a9
Remove extra check for Accelerate
yafshar Dec 2, 2024
f74a642
Remove extra check for Accelerate
yafshar Dec 2, 2024
5950221
Merge branch 'main' into trainer
yafshar Dec 2, 2024
f0e47f5
Merge branch 'main' into trainer
yafshar Dec 2, 2024
37d8e5e
Merge branch 'main' into trainer
yafshar Dec 2, 2024
3ebe72e
Merge branch 'main' into trainer
yafshar Dec 3, 2024
f6426fb
Resolve merge conflict
yafshar Dec 3, 2024
6f296ca
Merge branch 'main' into trainer
yafshar Dec 3, 2024
aec3777
Merge branch 'main' into trainer
yafshar Dec 4, 2024
cd4465b
Make sure the lazy_mode exists in model forward
yafshar Dec 4, 2024
56ab416
Merge branch 'main' into trainer
yafshar Dec 4, 2024
eed1418
Merge branch 'main' into trainer
yafshar Dec 4, 2024
585fba4
Merge branch 'main' into trainer
yafshar Dec 5, 2024
0ea64d7
Merge branch 'main' into trainer
yafshar Dec 6, 2024
e6d582a
Fix the lazy_mode assignment for Peft models
yafshar Dec 6, 2024
3615333
Merge branch 'main' into trainer
yafshar Dec 6, 2024
f7d4013
Resolve merge conflict
yafshar Dec 6, 2024
d6fb5b3
Resolve merge conflict
yafshar Dec 9, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading