Super-Convergence on CIFAR10
sophia
cifar10
lion
second-order-optimization
adamw
super-convergence
weight-decay
sharpness-aware-minimization
madgrad
large-batch-optimization
lion-optimizer
-
Updated
Jun 17, 2024 - Python