Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler #213

Franklalalala · 2024-09-27T15:24:04Z

As mentioned in [issue 211], this PR aims to Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler.

Major changes include:

add a Cyclic Learning Rate in tool.py and corresponding args.
add step-wise lr update scheme in trainer.py and base_trainer.py.
unit test, here I implemented 4 cases, which are mesh test on two lr scheme, exp and clr, and w/wo iteration-wise lr update

Minor change:

docs have been updated in argcheck.py
use the 'display_freq' as the tensorboard log interval (per-iteration), instead of using a fixed 25 iteration interval.

Franklalalala · 2024-11-10T14:30:20Z

The code has been cleaned. It is now the same with the upstream.

Franklalalala · 2024-11-17T07:11:44Z

the test example has been updated:
the dataset size was reduced from 5MB to 68KB

floatingCatty · 2024-11-13T04:37:55Z

examples/clr_and_per_iter_update/data_10/data.1312.lmdb/data.mdb

这个file可以再小一点吗？

floatingCatty · 2024-11-16T21:24:18Z

dptb/nnops/trainer.py

@@ -30,6 +30,7 @@ def __init__(
        self.model = model.to(self.device)
        self.optimizer = get_optimizer(model_param=self.model.parameters(), **train_options["optimizer"])
        self.lr_scheduler = get_lr_scheduler(optimizer=self.optimizer, **train_options["lr_scheduler"])  # add optmizer
+        self.update_lr_per_step_flag = train_options["update_lr_per_step_flag"]


如果这个flag为false就不更新LR了？

floatingCatty · 2024-11-16T21:25:06Z

dptb/nnops/trainer.py

@@ -129,6 +130,11 @@ def iteration(self, batch, ref_batch=None):
        loss.backward()
        #TODO: add clip large gradient
        self.optimizer.step()
+        if self.update_lr_per_step_flag:


没太理解这个开关的作用

学习率更新需要显式的使用 self.lr_scheduler.step()。添加这个开关可以在每个 iteration 里调用，否则是每个 epoch 调用一次

Franklalalala and others added 8 commits August 22, 2024 17:22

feat: add tensorboard support

5208228

add tensorboard = "*" in pyproject.toml for installation

d00eefc

add doc and example

d3de3d4

minor change

4798d93

Merge branch 'deepmodeling:main' into main

11400fd

clr and stepwise lr update support

acc69b0

add unit test

7215141

Merge branch 'deepmodeling:main' into main

8cb2338

QG-phy requested a review from floatingCatty November 12, 2024 05:49

update clr test example

57c44ef

floatingCatty reviewed Nov 18, 2024

View reviewed changes

floatingCatty approved these changes Nov 18, 2024

View reviewed changes

floatingCatty merged commit 8b241f5 into deepmodeling:main Nov 18, 2024
2 checks passed

floatingCatty mentioned this pull request Nov 18, 2024

Enhancement: Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler #211

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler #213

Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler #213

Franklalalala commented Sep 27, 2024

Franklalalala commented Nov 10, 2024

Franklalalala commented Nov 17, 2024

floatingCatty Nov 13, 2024

Franklalalala Nov 18, 2024

floatingCatty Nov 16, 2024

floatingCatty Nov 16, 2024

Franklalalala Nov 18, 2024

Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler #213

Implement Cyclic Learning Rate and Step-wise Learning Rate Scheduler #213

Conversation

Franklalalala commented Sep 27, 2024

Franklalalala commented Nov 10, 2024

Franklalalala commented Nov 17, 2024

floatingCatty Nov 13, 2024

Choose a reason for hiding this comment

Franklalalala Nov 18, 2024

Choose a reason for hiding this comment

floatingCatty Nov 16, 2024

Choose a reason for hiding this comment

floatingCatty Nov 16, 2024

Choose a reason for hiding this comment

Franklalalala Nov 18, 2024

Choose a reason for hiding this comment