feat(sudoku): implement environment #125

Egiob · 2023-04-28T12:38:41Z

This PR contains all the changes related to the addition of the Sudoku environment. See below the principal stuff that needs to be discussed before merging (in my opinion).

Generator

Currently the environment uses a default DatabaseGenerator that resets to a new puzzle included in a pre-loaded database. These data are included under jumanji.environments.logic.sudoku.data. For now two databases are included: 1000_very_easy_puzzles.npy containing 1000 puzzles with >= 46 clues and 10000_ mixed_puzzles.npy containing 10000 puzzle of various difficulties. The exact number of puzzles may be changed so that (1) the "very-easy" dataset is indeed very-easy for A2C and (2) the "mixed" dataset contains enough diversity while not taking too much memory in the repo.

Versions

Two envs have been registered: Sudoku-v0 using by default the 10000_mixed_puzzles.npy database and Sudoku-very-easy-v0 using by default the 1000_very_easy_puzzles.npy database. For simplicity purpose, we propose to train the A2C agent on this latter environment.

Remaining actions:

Check that the training on the very-easy dataset meets expectations
Update Sudoku GIF and Image accordingly
Revert changes in notebooks

References:
This is the paper I used to define that >=46 clues was "very-easy".

CLAassistant · 2023-04-28T12:38:47Z

All committers have signed the CLA.

clement-bonnet

Thank you for your contribution. Overall, looks very good to me. I have a few comments and NIT suggestions.
We can merge once we have a training curve (working), I'm wondering if we can make the network equivariant to permutation of digits at least.
Please let me know if you have any questions.

README.md

docs/env_anim/sudoku.gif

docs/environments/sudoku.md

jumanji/environments/logic/sudoku/env.py

jumanji/environments/logic/sudoku/reward.py

jumanji/environments/logic/sudoku/viewer.py

jumanji/training/setup_train.py

jumanji/training/networks/sudoku/actor_critic.py

Co-authored-by: Clément Bonnet <56230714+clement-bonnet@users.noreply.github.com>

jumanji/environments/logic/sudoku/utils.py

docs/environments/sudoku.md

Co-authored-by: Tristan Kalloniatis <tristankalloniatis@gmail.com>

Co-authored-by: Clément Bonnet <56230714+clement-bonnet@users.noreply.github.com>

…i into feat/add-sudoku-environment

clement-bonnet

Thanks for the very nice contribution. I left a couple of nit comments. All good to me otherwise.

jumanji/environments/logic/sudoku/env.py

jumanji/environments/logic/sudoku/generator.py

jumanji/training/networks/sudoku/actor_critic.py

README.md

.pre-commit-config.yaml

Co-authored-by: Daniel <57721552+dluo96@users.noreply.github.com>

jumanji/environments/logic/sudoku/env.py

jumanji/environments/logic/sudoku/reward.py

jumanji/environments/logic/sudoku/utils_test.py

jumanji/environments/logic/sudoku/viewer.py

Co-authored-by: Tristan Kalloniatis <tristankalloniatis@gmail.com>

…i into feat/add-sudoku-environment

Egiob added 21 commits April 22, 2023 19:49

feat: add sudoku

5214c68

feat: make generator configurable and add puzzles data

1c139a4

fix: add sudoku conf

72330e6

fix: validate board

2593612

feat(sudoku): change default version

aef828c

feat(sudoku): update very easy databse

db3845c

feat(sudoku): update very easy data

521612a

wip: debug train

89299cd

wip: debug train

bc61304

wip: debug train

7541e4c

wip: debug train

5b41051

wip: debug train

4bc8677

wip: debug train

04c120a

wip: debug train

e060543

wip: debug train

d156a03

wip: debug train

d11dc09

wip: debug train

d341e17

test: add utils test

0bba8a6

docs: update sudoku versions

aedccb6

style: clean useless files

3248ef2

style: clean notebook

0260e82

Egiob added the enhancement New feature or request label Apr 28, 2023

Egiob requested a review from clement-bonnet April 28, 2023 12:38

Egiob self-assigned this Apr 28, 2023

fix: data not loadable

9352a4d

clement-bonnet changed the title ~~feat(sudoku): add environment~~ feat(sudoku): implement environment Apr 28, 2023

clement-bonnet reviewed May 3, 2023

View reviewed changes

Egiob and others added 2 commits May 5, 2023 15:31

wip: train invariant net

371f54f

docs: update README.md

06f4d1f

Co-authored-by: Clément Bonnet <56230714+clement-bonnet@users.noreply.github.com>

TristanKalloniatis reviewed May 25, 2023

View reviewed changes

jumanji/environments/logic/sudoku/utils.py Show resolved Hide resolved

Egiob and others added 2 commits May 30, 2023 15:26

style: address PR comments

61d35d3

Merge branch 'main' into feat/add-sudoku-environment

f5dd0fb

TristanKalloniatis reviewed May 31, 2023

View reviewed changes

docs/environments/sudoku.md Outdated Show resolved Hide resolved

Egiob and others added 5 commits May 31, 2023 10:38

Update docs/environments/sudoku.md

81e96dc

Co-authored-by: Tristan Kalloniatis <tristankalloniatis@gmail.com>

style: add type hint

e3e21ac

Co-authored-by: Clément Bonnet <56230714+clement-bonnet@users.noreply.github.com>

style: PR review

4af1f13

Merge branch 'feat/add-sudoku-environment' of github.com:Egiob/jumanj…

f6629d9

…i into feat/add-sudoku-environment

fix: path to sudoku data

3d83925

clement-bonnet previously approved these changes Jun 1, 2023

View reviewed changes

style: minor changes

090b630

Egiob dismissed clement-bonnet’s stale review via 090b630 June 1, 2023 08:40

dluo96 reviewed Jun 1, 2023

View reviewed changes

README.md Show resolved Hide resolved

dluo96 reviewed Jun 1, 2023

View reviewed changes

.pre-commit-config.yaml Outdated Show resolved Hide resolved

Egiob and others added 2 commits June 1, 2023 12:18

docs: add comment

873cb08

Co-authored-by: Daniel <57721552+dluo96@users.noreply.github.com>

Merge branch 'main' into feat/add-sudoku-environment

83e087f

clement-bonnet previously approved these changes Jun 1, 2023

View reviewed changes