Fix PSGD, spring cleaning
- Previously, only the first parameter of PSGD was trained; This is fixed now
- All PSGDs were
PurePSGD
- nowmomentum_into_precond_update
andexp_avg_input
have their expected effect again - preliminary support for external changes of
group['lr']