-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
training stats event-handler (for printing training losses) #7
Comments
Hi @yanchengnv and @wyli , I found a similar example in @ericspod 's notebook example: About this task, do you mean something like it? |
We would something a bit more generalized with optional outputs for end of iteration, end of step, end of train, etc. We ought to be logging to the engine's log object instead of printing to stdout, we can add and printing handler to that log object to do both of course. |
Hi @ericspod , Thanks for your suggestion about general StatsLogger. |
Sorry for jumping in into your conversation, I just would like to help with that and make more clear on what is provided out-of-the-box in ignite for that:
In my experience, using experiment tracking systems like MLflow or Polyaxon, we can either log to the system via their api (and ignite's wrappers like HTH |
Hi @vfdev-5 , Thanks very much for your detailed sharing! And @wyli @ericspod @yanchengnv , About the usage of Ignite, have we aligned to use only Ignite official code or both official code and the 3rd contrib code? |
@vfdev-5 @Nic-Ma I've used the log file for logging just messages and such, the SessionSaver class in ptproto creates a new directory in a given parent directory for every new run and sends the log to a file there along with the checkpoints and saved networks. My subclasses of Engine add extra fields to the state and we could add more things to it, I would think that metrics should only be the output from metric handlers and shouldn't have anything else. Returning to the idea of the session handling if we're saving the whole engine state (or everything without large tensors) then these other things we add will get saved as well. |
@vfdev-5 One thing to mention is that tqdm doesn't play well with Jupyterlab for some reason, I believe it's a known bug. I had written a super primitive text progress bar that works, I don't know if we collectively want to investigate anything else. I really like doing things through Jupyter a lot, so stuff that doesn't rely on tensorboard/visdom is what I would prefer. |
There is a |
@fepegar I think that does have issues with Jupyterlab, Jupyter notebook vanilla I think is fine. I don't why but they're different. |
Yes, I've had trouble before on JupyterLab. But I think installing the widgets extension solves it: https://ipywidgets.readthedocs.io/en/latest/user_install.html#installing-the-jupyterlab-extension |
@ericspod I'm also using jupyterlab for development, it provides a cool environment for research/prototyping/testing etc.
However, how do you plan to run and then organize and compare various trainings for the same task ? |
Yan - Setup time to discuss design choice here. |
* adds network * adds basic training * update loading * working prototype * update validation set * [MONAI] Add author; paper info; PDDCA18 (#6) + Author + Early accept + PDDCA18 link * Update README.md * adds network * adds basic training * update loading * working prototype * update validation set * [MONAI] Update TRAIN_PATH, VAL_PATH (#8) + Update TRAIN_PATH, VAL_PATH * [MONAI] Add data link (#7) + Add data link https://drive.google.com/file/d/1A2zpVlR3CkvtkJPvtAF3-MH0nr1WZ2Mn/view?usp=sharing * fixes typos * tested new dataset * print more infor, checked new dataset * [MONAI] Add paper link (#9) Add paper link https://arxiv.org/abs/2006.12575 * [MONAI] Use dice loss + focal loss to train (#10) Use dice loss + focal loss to train * [MONAI] Support non-one-hot ground truth (#11) Support non-one-hot ground truth * fixes format and docstrings, adds argparser options * resume the focal_loss * adds tests * [MONAI] Support non-one-hot ground truth (#11) Support non-one-hot ground truth * adds tests * update docstring * [MONAI] Keep track of best validation scores (#12) Keep track of best validation scores * model saving * adds window sampling * update readme * update docs * fixes flake8 error * update window sampling * fixes model name * fixes channel size issue * [MONAI] Update --pretrain, --lr (#13) + lr from 5e-4 to 1e-3 because we use mean for class channel instead of sum for class channel. + pretrain path is consistent with current model_name. * [MONAI] Pad image; elastic; best class model (#14) * [MONAI] Pad image; elastic; best class model + Pad image bigger than crop_size, avoid potential issues in RandCropByPosNegLabeld + Use Rand3DElasticd + Save best model for each class * Update train.py Co-authored-by: Wenqi Li <wenqil@nvidia.com> * flake8 fixes * removes -1 cropsize deform * testing commands * fixes unit tests * update spatial padding * [MONAI] Add full image deform augmentation (#15) + Add full image deform augmentation by Rand3DElasticd + Please use latest MONAI in #623 * Adding py.typed * updating setup.py to comply with black * update based on comments * excluding research from packaging * update tests * update setup.py Co-authored-by: Wentao Zhu <wentaozhu1991@gmail.com> Co-authored-by: Neil Tenenholtz <ntenenz@users.noreply.github.com> Co-authored-by: Nic Ma <nma@nvidia.com>
No description provided.
The text was updated successfully, but these errors were encountered: