fix the split placement example #281

xffxff · 2025-02-15T07:04:56Z

The split placement example is outdated, I tried it and encountered some errors. To address this, the following changes were made in this PR

Copied the content from verl/trainer/config/ppo_trainer.yaml to examples/split_placement/config/ppo_trainer_split.yaml
Copied RayPPOTrainer.fit method into the fit func in examples/split_placement/split_monkey_patch.py and modified it to get the futures of critic_output and actor_output

xffxff · 2025-02-15T07:06:21Z

examples/split_placement/split_monkey_patch.py

+
+                    actor_output = actor_output.get()
+                    actor_output_metrics = reduce_metrics(actor_output.meta_info['metrics'])
+                    metrics.update(actor_output_metrics)


Modified to get futures of critic_output and actor_output

Nice job! What do you modify here?

Modified to get futures of critic_output and actor_output

The comments here are just to highlight the changes compared to RayTrainer.fit, because most of the code is copied from RayTrainer.fit.

These changes are copied from examples/split_placement/split_monkey_patch.py, because the update_actor and update_critic would be non-blocking as described in https://github.com/volcengine/verl/tree/main/examples/split_placement#step-2-make-the-models-executed-asynchronously. I didn't add any new things here, I just copied bits from both RayTrainer.fit and examples/split_placement/split_monkey_patch.py to make it consistent with the original logic.

The split placement example is outdated, I tried it and encountered some errors. To address this, the following changes were made in this PR 1. Copied the content from `verl/trainer/config/ppo_trainer.yaml` to `examples/split_placement/config/ppo_trainer_split.yaml` 2. Copied `RayPPOTrainer.fit` method into the `fit` func in `examples/split_placement/split_monkey_patch.py` and modified it to get the futures of `critic_output` and `actor_output`

fix the split placement example

fc63de2

xffxff commented Feb 15, 2025

View reviewed changes

PeterSH6 approved these changes Feb 15, 2025

View reviewed changes

PeterSH6 merged commit c8b9c35 into volcengine:main Feb 15, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix the split placement example #281

fix the split placement example #281

xffxff commented Feb 15, 2025 •

edited

Loading

xffxff Feb 15, 2025

PeterSH6 Feb 15, 2025

xffxff Feb 15, 2025

fix the split placement example #281

fix the split placement example #281

Conversation

xffxff commented Feb 15, 2025 • edited Loading

xffxff Feb 15, 2025

Choose a reason for hiding this comment

PeterSH6 Feb 15, 2025

Choose a reason for hiding this comment

xffxff Feb 15, 2025

Choose a reason for hiding this comment

xffxff commented Feb 15, 2025 •

edited

Loading