feat(jigsaw): Implement the Jigsaw env #147

RuanJohn · 2023-05-29T15:32:01Z

Implements the full Jigsaw environment with actor critic networks.
closes #143

CLAassistant · 2023-05-29T15:32:08Z

All committers have signed the CLA.

sash-a

Just had a look at the docs, will check the rest tomorrow 😄

docs/environments/jigsaw.md

DriesSmit

Thanks @RuanJohn 🙌 This looks great 🙂 I will do a more detailed review tomorrow.

jumanji/environments/packing/jigsaw/env.py

Co-authored-by: Sasha <reallysasha@gmail.com>

DriesSmit · 2023-05-30T07:25:12Z

jumanji/environments/packing/jigsaw/conftest.py

+@pytest.fixture()
+def piece_one_partially_placed(board_with_piece_one_placed: chex.Array) -> chex.Array:
+    """A 2D array of zeros where piece one has been placed partially correctly.
+    That is to say that there is overlap between where the piece has been placed and


Can you add the correct tabs throughout the codebase, please :)

sash-a

Types and reward ✔️

sash-a · 2023-05-30T08:01:51Z

jumanji/training/configs/config.yaml

-    - env: snake  # [bin_pack, cleaner, connector, cvrp, game_2048, job_shop, knapsack, maze, minesweeper, rubiks_cube, snake, tsp]
+    - env: jigsaw # [bin_pack, cleaner, connector, cvrp, game_2048, jigsaw, job_shop, knapsack, maze, minesweeper, rubiks_cube, snake, tsp]

-agent: random  # [random, a2c]


Change this back please

jumanji/environments/packing/jigsaw/types.py

sash-a · 2023-05-30T08:17:54Z

jumanji/environments/packing/jigsaw/types.py

+    num_pieces: chex.Numeric  # ()
+    solved_board: chex.Array  # (num_rows, num_cols)
+    pieces: chex.Array  # (num_pieces, 3, 3)
+    action_mask: chex.Array  # (num_pieces, num_rotations, num_rows-3, num_cols-3)


Are you calculating the action mask from scratch each step? If so you don't need it in the state

jumanji/environments/packing/jigsaw/reward.py

docs/environments/jigsaw.md

jumanji/environments/packing/jigsaw/env.py

jumanji/environments/packing/jigsaw/env_test.py

jumanji/environments/packing/jigsaw/generator.py

djbyrne · 2023-05-30T09:30:11Z

jumanji/environments/packing/jigsaw/reward.py

+from jumanji.environments.packing.jigsaw.types import State
+
+
+class RewardFn(abc.ABC):


I think a protocol would make more sense here than a base class as it is just the callable? @clement-bonnet

sash-a

Looks great, thanks Ruan! Really minor style points from my side.

Only thing I haven't looked at is the networks because I assume those are still being tweaked

jumanji/environments/packing/jigsaw/generator.py

sash-a · 2023-05-30T11:02:47Z

jumanji/environments/packing/jigsaw/env.py

+        chosen_piece = rotate_piece(chosen_piece, rotation)
+
+        grid_piece = self._expand_piece_to_board(chosen_piece, row_idx, col_idx)
+        grid_mask_piece = self._get_ones_like_expanded_piece(grid_piece=grid_piece)


I think this works and gets rid of the extra method

Suggested change

grid_mask_piece = self._get_ones_like_expanded_piece(grid_piece=grid_piece)

grid_mask_piece = grid_piece == piece_idx

jumanji/environments/packing/jigsaw/env.py

sash-a · 2023-05-30T11:09:32Z

jumanji/environments/packing/jigsaw/env.py

+        grids = batch_expand_piece_to_board(rotated_pieces, rows, cols)
+
+        batch_get_ones_like_expanded_piece = jax.vmap(
+            self._get_ones_like_expanded_piece, in_axes=(0)


If you delete _get_ones_like_expanded_piece then:

Suggested change

self._get_ones_like_expanded_piece, in_axes=(0)

lambda x: x != 0, in_axes=(0)

jumanji/environments/packing/jigsaw/env.py

RuanJohn · 2023-05-30T13:17:04Z

There is a problem with the current formulation of the problem. In order to make the problem solvable in its current configuration would require the agent to have access to the solved board which makes the problem non-combinatorial.

The way forward will be to rework Jigsaw as a new environment called FlatPack. This will be a 2D, discrete and flattened version of the BinPack problem with potential positive transfer to the Tetris environment since placed blocks will still interlock with each other.

RuanJohn added 21 commits May 21, 2023 20:04

feat: initial jigsaw commit.

960ead0

feat: added puzzle numbers to env viewer

0e6c994

feat: initial code for random agent network.

19f21f1

feat: remove board action mask.

3988f4d

feat: add jigsaw random agent.

14a53ed

chore: change board_dim to num_rows and num_cols

62625c0

feat: register environment and add random networks

6828cfa

feat: full action mask working.

b81dda2

feat: cleaner action mask generation.

f67315c

feat: added jigsaw documentation

941fee1

chore: typo fix.

2839da0

feat: added class doctring to env.

3474efa

feat: import jigsaw actor critic network.

6378477

wip: work on actor critic networks.

478b504

chore: better variable naming

316e733

chore: variable renaming in jigsaw networks.

4e35b86

chore: variable renaming in jigsaw networks.

b681f83

feat: jigsaw networks implemented.

ee87ec3

fix: fix action spec off by one.

8416969

feat: added jigsaw training config.

5868835

chore: minor fixes.

4a8caad

RuanJohn added the enhancement New feature or request label May 29, 2023

RuanJohn requested review from sash-a and DriesSmit May 29, 2023 15:32

RuanJohn self-assigned this May 29, 2023

sash-a reviewed May 29, 2023

View reviewed changes

docs/environments/jigsaw.md Outdated Show resolved Hide resolved

docs/environments/jigsaw.md Outdated Show resolved Hide resolved

docs/environments/jigsaw.md Outdated Show resolved Hide resolved

docs/environments/jigsaw.md Outdated Show resolved Hide resolved

clement-bonnet changed the title ~~Implement jigsaw env~~ feat(jigsaw): Implement the Jigsaw env May 29, 2023

DriesSmit reviewed May 29, 2023

View reviewed changes

jumanji/environments/packing/jigsaw/env.py Outdated Show resolved Hide resolved

jumanji/environments/packing/jigsaw/env.py Outdated Show resolved Hide resolved

chore: fix action space in docs.

d0aa02c

Co-authored-by: Sasha <reallysasha@gmail.com>

RuanJohn and others added 5 commits May 29, 2023 17:50

chore: docs action mask fix.

e876908

Co-authored-by: Sasha <reallysasha@gmail.com>

chore: action mask fix in docs.

57b6a81

Co-authored-by: Sasha <reallysasha@gmail.com>

chore: action mask fix in docs.

0267299

Co-authored-by: Sasha <reallysasha@gmail.com>

chore: indent docstrings.

6adfaf3

Merge branch 'main' into 143-implement-jigsaw-env

6206819

DriesSmit reviewed May 30, 2023

View reviewed changes

fix: action mask indexing bugfix.

b344add

sash-a reviewed May 30, 2023

View reviewed changes

djbyrne reviewed May 30, 2023

View reviewed changes

sash-a requested changes May 30, 2023

View reviewed changes

RuanJohn closed this May 30, 2023

clement-bonnet deleted the 143-implement-jigsaw-env branch June 2, 2023 08:03

RuanJohn mentioned this pull request Jul 20, 2023

feat: FlatPack environment #188

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(jigsaw): Implement the Jigsaw env #147

feat(jigsaw): Implement the Jigsaw env #147

RuanJohn commented May 29, 2023

CLAassistant commented May 29, 2023 •

edited

Loading

sash-a left a comment

DriesSmit left a comment

DriesSmit May 30, 2023

sash-a left a comment

sash-a May 30, 2023

sash-a May 30, 2023

djbyrne May 30, 2023 •

edited

Loading

sash-a left a comment

sash-a May 30, 2023

sash-a May 30, 2023

RuanJohn commented May 30, 2023

		from jumanji.environments.packing.jigsaw.types import State


		class RewardFn(abc.ABC):

	grid_mask_piece = self._get_ones_like_expanded_piece(grid_piece=grid_piece)
	grid_mask_piece = grid_piece == piece_idx

	self._get_ones_like_expanded_piece, in_axes=(0)
	lambda x: x != 0, in_axes=(0)

feat(jigsaw): Implement the Jigsaw env #147

feat(jigsaw): Implement the Jigsaw env #147

Conversation

RuanJohn commented May 29, 2023

CLAassistant commented May 29, 2023 • edited Loading

sash-a left a comment

Choose a reason for hiding this comment

DriesSmit left a comment

Choose a reason for hiding this comment

DriesSmit May 30, 2023

Choose a reason for hiding this comment

sash-a left a comment

Choose a reason for hiding this comment

sash-a May 30, 2023

Choose a reason for hiding this comment

sash-a May 30, 2023

Choose a reason for hiding this comment

djbyrne May 30, 2023 • edited Loading

Choose a reason for hiding this comment

sash-a left a comment

Choose a reason for hiding this comment

sash-a May 30, 2023

Choose a reason for hiding this comment

sash-a May 30, 2023

Choose a reason for hiding this comment

RuanJohn commented May 30, 2023

CLAassistant commented May 29, 2023 •

edited

Loading

djbyrne May 30, 2023 •

edited

Loading