Add Prodigy Plus Schedule Free optimizer #614

saunderez · 2024-12-25T09:23:41Z

For more details, open the Copilot Workspace session.

--- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/Nerogar/OneTrainer?shareId=XXXX-XXXX-XXXX-XXXX).

…mizerParamsWindow.py` * Add `split_groups` parameter with title, tooltip, and type. * Adjust formatting for `cautious` parameter to align with new `split_groups` parameter.

…ate_optimizer` function in `modules/util/create.py`. * **Remove `decouple` parameter:** - Remove `decouple` parameter from the `create_optimizer` function for different optimizer configurations. * **Add `prodigy_steps` parameter:** - Add `prodigy_steps` parameter to the `create_optimizer` function for `PRODIGY_PLUS_SCHEDULE_FREE` optimizer configuration.

Nerogar · 2024-12-25T15:26:44Z

modules/ui/OptimizerParamsWindow.py

@@ -154,6 +159,11 @@ def create_dynamic_ui(

        # Extract the keys for the selected optimizer
        for index, key in enumerate(OPTIMIZER_DEFAULT_PARAMETERS[selected_optimizer].keys()):
+            if selected_optimizer == Optimizer.PRODIGY_PLUS_SCHEDULE_FREE and key not in [


Can you explain this change? I'm not quite sure if or why it's needed

@saunderez Can you comment on this? I don't want to merge something if I don't understand why it's done

I think this is a side effect of setting the lr to 1.0 in the OPTIMIZER_DEFAULT_PARAMETERS and not wanting to present it as configurable.

if selected_optimizer == Optimizer.PRODIGY_PLUS_SCHEDULE_FREE and key in [ 'lr' ]: continue

Is really what it's doing here.

given the other learning rate free optimizers don't do this it's probably better if lr is removed from Optimizer.PRODIGY_PLUS_SCHEDULE_FREE and then conditional isn't necessary.

Out of scope for this change would be to fix this sharp edge for all the optimizers that expect a lr of 1.0

Koratahiu · 2024-12-29T08:06:05Z

OneTrainer\modules\trainer\GenericTrainer.py", line 539, in __apply_fused_back_pass
for param_group in self.model.optimizer.param_groups:
AttributeError: 'NoneType' object has no attribute 'param_groups'

pretty broken

This commit adds a variant of the Prodigy Optimizer created by LoganBooker https://github.com/LoganBooker/prodigy-plus-schedule-free/tree/main It is both learning rate free and schedule free. It also contains experiemntal optimization techniques as well as general memory usage and performance , improvmentsc Use with constant scheduler Based on code from: https://github.com/facebookresearch/schedule_free https://github.com/konstmish/prodigy Incorporates improvements from these pull requests (credit to https://github.com/dxqbYD and https://github.com/sangoi-exe): konstmish/prodigy#23 konstmish/prodigy#22 konstmish/prodigy#20 Supports fused backwards pass. Experimental featuures. ADOPT https://arxiv.org/abs/2411.02853 Cautious https://arxiv.org/pdf/2411.16085 MuonPP https://github.com/KellerJordan/Muon/blob/master/muon.py StableAdamW https://optimi.benjaminwarner.dev/optimizers/stableadamw/ Probably some other stuff I forgot. For full details

…aunderez/onetrainer into add-prodigy-plus-schedule-free

allenbenz · 2025-01-30T02:13:48Z

modules/util/config/TrainConfig.py

+    use_adopt: bool
+    prodigy_steps: int
+    use_adopt: bool
+    use_cautious: bool


Is this supposed to be here twice?

allenbenz · 2025-01-30T02:21:41Z

modules/ui/OptimizerParamsWindow.py

+            'use_cautious': {'title': 'Use Cautious', 'tooltip': 'Experimental. Perform "cautious" updates, as proposed in https://arxiv.org/pdf/2411.16085. Recommended: False', 'type': 'bool'},
+            'use_adopt': {'title': 'Use ADOPT', 'tooltip': 'Experimental. Partial implementation of (https://arxiv.org/abs/2411.02853). Recommended: False', 'type': 'bool'},
+            'lr': {'title': 'Learning Rate', 'tooltip': 'Learning rate adjustment parameter. Increases or decreases the Prodigy learning rate. Recommended: 1.0', 'type': 'float'},
+            'weignt_decay_by_lr': {'title': 'Weight Decay by LR', 'tooltip': 'If True, weight_decay is multiplied by the adaptive learning rate. Recommended: True', 'type': 'bool'},


weignt_decay_by_lr -> weight_decay_by_lr

Actually this is in a couple of places.

allenbenz · 2025-01-30T02:26:50Z

modules/util/config/TrainConfig.py

+        data.append(("use_muon_pp", False, bool, False))
+        data.append(("use_cautious", False, bool, False))
+        data.append(("use_adopt", False, bool, False))
+        data.append(("prodigy_steps", 0, int, False))


missing weight_decay_by_lr

allenbenz · 2025-01-30T03:43:12Z

requirements-global.txt

@@ -34,6 +34,7 @@ lion-pytorch==0.2.2 # lion optimizer
 prodigyopt==1.0 # prodigy optimizer
 schedulefree==1.3.0 # schedule-free optimizers
 pytorch_optimizer==3.3.0 # pytorch optimizers
+prodigy-plus-schedule-free==1.8.0


1.9.0 got released rather recently. Only interface change is an extra parameter, factored_fp32 with a default of True.

saunderez added 10 commits December 25, 2024 19:23

Add Prodigy Plus Schedule Free optimizer

dfd0f67

--- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/Nerogar/OneTrainer?shareId=XXXX-XXXX-XXXX-XXXX).

Fixed parameter window hopefully

f58e835

Add UI components and logic for PRODIGY_PLUS_SCHEDULE_FREE in `Opti…

170909e

…mizerParamsWindow.py` * Add `split_groups` parameter with title, tooltip, and type. * Adjust formatting for `cautious` parameter to align with new `split_groups` parameter.

Add a bunch of missing params to trainconfig.py

8d3155a

Found another missing param and another unwanted one.

1337f82

Damn you decouple

09f6d2b

a05aa2b

ad658e4

2e934c9

Nerogar reviewed Dec 25, 2024

View reviewed changes

saunderez force-pushed the add-prodigy-plus-schedule-free branch from 2e934c9 to 878c9c2 Compare December 30, 2024 18:07

Merge branch 'add-prodigy-plus-schedule-free' of https://github.com/s…

af67848

…aunderez/onetrainer into add-prodigy-plus-schedule-free

allenbenz reviewed Jan 30, 2025

View reviewed changes

O-J1 added the followup Failure to provide config or other info or needs followup label Feb 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Prodigy Plus Schedule Free optimizer #614

Add Prodigy Plus Schedule Free optimizer #614

saunderez commented Dec 25, 2024 •

edited

Loading

Nerogar Dec 25, 2024

Nerogar Jan 7, 2025

allenbenz Jan 30, 2025

Koratahiu commented Dec 29, 2024

allenbenz Jan 30, 2025

allenbenz Jan 30, 2025

allenbenz Jan 30, 2025

allenbenz Jan 30, 2025

allenbenz Jan 30, 2025

Add Prodigy Plus Schedule Free optimizer #614

Are you sure you want to change the base?

Add Prodigy Plus Schedule Free optimizer #614

Conversation

saunderez commented Dec 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Koratahiu commented Dec 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

saunderez commented Dec 25, 2024 •

edited

Loading