update tdmpc into kscale sim module. #72

chamorajg · 2024-09-12T20:07:56Z

No description provided.

sim/tdmpc/src/algorithm/tdmpc.py

budzianowski · 2024-09-13T16:59:40Z

Do you have some results of standing?

WT-MM

Looks good - could you run make format + make static-checks to make sure that this pr passes the status checks?

chamorajg · 2024-09-13T19:42:03Z

Do you have some results of standing?

I started training the stompypro only yesterday and its probably doing 100 iterations per day (takes ever so long to train). I have standing results of dora taken at 200 iterations (25M timesteps).

I have results of stompy pro learning to stand a bit but its at the very early stages of training.

budzianowski · 2024-09-13T21:28:39Z

I see! What is the most time consuming step in the pipeline?

chamorajg · 2024-09-13T22:15:32Z

horizon update. This line makes the training process so slow.

budzianowski · 2024-09-13T22:47:53Z

Horizon is only 5 and mlp is tiny. I don't understand why would it be that slow?

chamorajg · 2024-09-14T00:59:34Z

The planning step that collects samples from interactions with the environment (MPC) is pretty slow. The number of iterations that we set for humanoid type task is around ~10-12.

budzianowski

see Makefile for linting and semi building. See https://github.com/kscalelabs/sim/blob/master/CONTRIBUTING.md

budzianowski · 2024-09-14T01:14:09Z

sim/train_tdmpc.py

+		if tdmpc_cfg.save_model and episode_idx % tdmpc_cfg.eval_freq_episode == 0:
+			L.save(agent, f"tdmpc_policy_{int(step // tdmpc_cfg.episode_length) + 1}.pt")
+			buffer.save(str(work_dir / "buffer.pt"))
+	# 		# common_metrics['episode_reward'] = evaluate(env, agent, h1 if L.video is not None else None, tdmpc_cfg.eval_episodes, step, env_step, L.video, tdmpc_cfg.action_repeat)


update tdmpc into kscale sim module.

d5023a9

budzianowski reviewed Sep 13, 2024

View reviewed changes

sim/tdmpc/src/algorithm/tdmpc.py Outdated Show resolved Hide resolved

chamorajg added 2 commits September 12, 2024 17:35

update moved tdmpc up into the algo folder.

31ce27f

update tdmpc fix play_tdmpc.py imports.

96f0b91

budzianowski requested a review from WT-MM September 13, 2024 16:59

WT-MM reviewed Sep 13, 2024

View reviewed changes

budzianowski reviewed Sep 14, 2024

View reviewed changes

feat: update tdmpc and buffer update.

f3468cf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update tdmpc into kscale sim module. #72

update tdmpc into kscale sim module. #72

chamorajg commented Sep 12, 2024

budzianowski commented Sep 13, 2024

WT-MM left a comment

chamorajg commented Sep 13, 2024 •

edited

Loading

budzianowski commented Sep 13, 2024

chamorajg commented Sep 13, 2024

budzianowski commented Sep 13, 2024

chamorajg commented Sep 14, 2024

budzianowski left a comment

budzianowski Sep 14, 2024

update tdmpc into kscale sim module. #72

Are you sure you want to change the base?

update tdmpc into kscale sim module. #72

Conversation

chamorajg commented Sep 12, 2024

budzianowski commented Sep 13, 2024

WT-MM left a comment

Choose a reason for hiding this comment

chamorajg commented Sep 13, 2024 • edited Loading

budzianowski commented Sep 13, 2024

chamorajg commented Sep 13, 2024

budzianowski commented Sep 13, 2024

chamorajg commented Sep 14, 2024

budzianowski left a comment

Choose a reason for hiding this comment

budzianowski Sep 14, 2024

Choose a reason for hiding this comment

chamorajg commented Sep 13, 2024 •

edited

Loading