Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A - Feat/machine learning #422

Open
wants to merge 321 commits into
base: dev
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 250 commits
Commits
Show all changes
321 commits
Select commit Hold shift + click to select a range
ac4effa
feat(machine_learning): kwargs_dict becomes kwargs
Tanguylo Dec 2, 2022
e524c92
Merge branch 'dev' into feat/machine_learning
Tanguylo Dec 2, 2022
c39f883
Merge branch 'feat/machine_learning' of github.com:Dessia-tech/dessia…
Tanguylo Dec 2, 2022
80d58c4
feat(machine_learning): remove BaseTree fit and fit_predict method wh…
Tanguylo Dec 2, 2022
b166b1a
feat(machine_learning): docstring in basescaler and test for pydocsty…
Tanguylo Dec 2, 2022
a0a306c
feat(machine_learning): remove blank line in cluster, does this push …
Tanguylo Dec 2, 2022
474676a
feat(machine_learning): new test for blank line period
Tanguylo Dec 2, 2022
8552314
feat(machine_learning): went back to initial cluster file which will …
Tanguylo Dec 2, 2022
44ea215
feat(machine_learning): docstrings for basescaler
Tanguylo Dec 2, 2022
02f2611
feat(machine_learning): finish with docstring in scalers
Tanguylo Dec 2, 2022
ff2a80c
feat(machine_learning): docstrings for labelbinarizer and change mode…
Tanguylo Dec 2, 2022
0964ee5
feat(machine_learning): add output typings on scalers and basemodel a…
Tanguylo Dec 2, 2022
5be6bf5
feat(machine_learning): remove hasattr(scaler, attr) condition and be…
Tanguylo Dec 2, 2022
ba526ac
feat(machine_learning): modify code according to some review comments
Tanguylo Dec 2, 2022
424f067
feat(machine_learning): change the way kwargs are built
Tanguylo Dec 2, 2022
a34ec92
feat(machine_learning): change the way to instantiate dessia scaler f…
Tanguylo Dec 2, 2022
3f7caa0
feat(machine_learning): ridge docstrings
Tanguylo Dec 5, 2022
8c04011
feat(machine_learning): add linearregression with its doc to have a f…
Tanguylo Dec 5, 2022
5eb481c
feat(machine_learning): docstring for basetree
Tanguylo Dec 5, 2022
221dd29
feat(machine_learning): docstrings for trees
Tanguylo Dec 5, 2022
d8fd199
feat(machine_learning): handles multiclass outputs in decisiontreecla…
Tanguylo Dec 5, 2022
42db473
feat(machine_learning): docstrings for randomforest
Tanguylo Dec 5, 2022
6c5275c
feat(machine_learning): docstrings for svm and some code consistency …
Tanguylo Dec 5, 2022
03a8ddd
feat(machine_learning): remove Base prefix
Tanguylo Dec 5, 2022
72fdb9b
feat(machine_learning): code clean and add LinearModel object to sepa…
Tanguylo Dec 5, 2022
a3b65d1
feat(machine_learning): end with doc and code consistency and cleaning
Tanguylo Dec 5, 2022
e491ed0
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Dec 5, 2022
a7b821c
feat(machine_learning): change default value for list
Tanguylo Dec 5, 2022
e0fa730
feat(machine_learning): finally fullfil pylint and pydocstyle require…
Tanguylo Dec 5, 2022
c51abf9
Merge branch 'fix/doc_dataset' into feat/machine_learning
Tanguylo Dec 5, 2022
c5e37b4
Merge remote-tracking branch 'origin/fix/doc_dataset' into feat/machi…
Tanguylo Dec 5, 2022
1d3f269
Merge remote-tracking branch 'origin/fix/doc_dataset' into feat/machi…
Tanguylo Dec 5, 2022
c69c173
feat(machine_learning): rewrite some docstrings
Tanguylo Dec 6, 2022
2097783
Merge remote-tracking branch 'origin/fix/doc_dataset' into feat/machi…
Tanguylo Dec 6, 2022
db3bce4
feat(machine_learning): simplify types and remove lines in docstrings
Tanguylo Dec 6, 2022
ad951e4
feat(machine_learning): rewrite some tests
Tanguylo Dec 6, 2022
6ebe012
feat(machine_learning): change test script and score method of decisi…
Tanguylo Dec 6, 2022
444c44b
feat(machine_learning): change place of function in modeling
Tanguylo Dec 6, 2022
360a1e0
Merge remote-tracking branch 'origin/fix/doc_dataset' into feat/machi…
Tanguylo Dec 7, 2022
42f1fe2
fet(machine_learning): train test split matrix
Tanguylo Dec 7, 2022
61a13d7
feat(machine_learning): rename files
Tanguylo Dec 7, 2022
210a74c
feat(machine_learning): write test train split
Tanguylo Dec 7, 2022
b42d2ed
Merge remote-tracking branch 'origin/fix/doc_dataset' into feat/machi…
Tanguylo Dec 7, 2022
4ae9f51
feat(machine_learning): nice train test split method
Tanguylo Dec 7, 2022
34372dc
Merge remote-tracking branch 'origin/fix/doc_dataset' into feat/machi…
Tanguylo Dec 7, 2022
8f34850
feat(machine_learning): change name of test file in ci_test
Tanguylo Dec 7, 2022
719b35d
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Dec 7, 2022
7f89ee9
feat(machine_learning): add init_for_modeler method in models object …
Tanguylo Dec 7, 2022
4952eb8
feat(machine_learning): add scaler stuff in modeler and write fit method
Tanguylo Dec 7, 2022
0cb93c9
feat(machine_learning): change name generation for scaler in modeler
Tanguylo Dec 8, 2022
d33013a
feat(machine_learning): add methode to set scaler name
Tanguylo Dec 8, 2022
ca8d6e7
feat(machine_learning): replace list(list(float)) with matrix type
Tanguylo Dec 8, 2022
b60c095
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Dec 8, 2022
23f3249
feat(machine_learning): fit methods for dataset
Tanguylo Dec 12, 2022
11a685e
feat(machine_learning): fit predict and fit predict method both for d…
Tanguylo Dec 12, 2022
ca540ca
feat(machine_learning): validation plot with y=x implemented in model…
Tanguylo Dec 12, 2022
23ca7b3
feat(machine_learning): add method to fit validate model, but seems u…
Tanguylo Dec 12, 2022
0c9f464
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Dec 12, 2022
fe87ce3
feat(machine_learning): cross_validation for modeler
Tanguylo Dec 13, 2022
1917ff2
feat(machine_learning): code improvement and consistency for cross_va…
Tanguylo Dec 14, 2022
05c438e
feat(machine_learning): private fit, predict and fit_predict method t…
Tanguylo Dec 14, 2022
674fdce
feat(machine_learning): code cleaning, some work remaining
Tanguylo Dec 14, 2022
b0a4fdd
Merge branch 'dev' into chore/cluster_coverage
masfaraud Dec 15, 2022
ad6f2c4
feat(machine_learning): code improvements
Tanguylo Dec 15, 2022
a27f2e7
feat(machine_earning): change to_points to to_val_points
Tanguylo Dec 15, 2022
3968536
feat(machine_learning): code cleaning and better call to methods and …
Tanguylo Dec 16, 2022
35b8fcd
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Dec 16, 2022
833e002
feat(machine_learning): remove a function and write doc
Tanguylo Dec 16, 2022
e15f3ba
feat(machine_learning): write doc
Tanguylo Dec 16, 2022
fed1362
feat(machine_learning): write some doc
Tanguylo Dec 16, 2022
4db2465
feat(machine_learning): fix pylint errors
Tanguylo Dec 16, 2022
b61ef31
feat(machine_learning): fix pylint errors and change pylint
Tanguylo Dec 16, 2022
e3c84c9
feat(machine_learning): add test file
Tanguylo Dec 16, 2022
23994ff
feat(machine_learning): add class ModelerValidation for better readab…
Tanguylo Dec 16, 2022
3c5cf5c
feat(machine_learning): some hard work is still required to get an ea…
Tanguylo Dec 16, 2022
28b2747
feat(machine_learning): add ValidationData, ModelValidation and Cross…
Tanguylo Dec 19, 2022
6aa9a2a
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Dec 19, 2022
0f3ec23
feat(machine_learning): rewrite cross_validation method of modeler an…
Tanguylo Dec 19, 2022
e68b631
feat(machine_learning): remove unused imports
Tanguylo Dec 19, 2022
ea9aef5
feat(machine_learning): fix a train test prediction mistake that gave…
Tanguylo Dec 19, 2022
bca95c5
feat(machine_learning): change order of display between train and tes…
Tanguylo Dec 19, 2022
e38b67b
feat(machine_learning): fix mono output error for RF and MLP (needs a…
Tanguylo Dec 19, 2022
43d50b5
feat(machine_learning): fix mono element in matrix for scaler and plo…
Tanguylo Dec 19, 2022
b7ca111
feat(machine_learning): change max value for plot cross validation
Tanguylo Dec 19, 2022
d25e707
feat(machine_learning): change methods of class for cleaner code
Tanguylo Dec 20, 2022
17a93b6
feat(machine_learning): add and fix doc, change some code
Tanguylo Dec 20, 2022
05bf37b
perf: caching argspec
Dec 20, 2022
4281362
Merge branch 'dev' into chore/cluster_coverage
GhislainJ Dec 20, 2022
1513113
feat(machine_learning): add attribute params to model for better hand…
Tanguylo Dec 20, 2022
5bfcf16
feat(machine_learning): take into account params attribute in models …
Tanguylo Dec 20, 2022
7388855
feat(machine_learning): remove default value for hidden layer sizes
Tanguylo Dec 20, 2022
06caa7f
feat(machine_learning): add mini max functions, their tests and doc f…
Tanguylo Dec 21, 2022
62eca93
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Dec 21, 2022
1fb37a2
feat(machine_learning): tried to set subplots sizes in multiplot but …
Tanguylo Dec 21, 2022
3eb8527
feat(machine_learning): guarantee and visual test that train data is …
Tanguylo Dec 21, 2022
93cd17a
Merge remote-tracking branch 'origin/master' into feat/machine_learning
Tanguylo Dec 22, 2022
57d00c7
feat(machine_learning): make new classes appearing in platform
Tanguylo Dec 22, 2022
b068f3c
Merge remote-tracking branch 'origin/master' into chore/cluster_coverage
Tanguylo Dec 22, 2022
a87f5dc
chore(cluster): conflict resovle
Tanguylo Dec 22, 2022
9300bca
chore(cmuster_coverage): write doc, fix doc
Tanguylo Dec 22, 2022
9432db6
chpre(cluster_coverage): fix pylint error
Tanguylo Dec 22, 2022
eb8ffd2
chore(cluster_coverage): decrease thresholds of pydocstyle
Tanguylo Dec 22, 2022
cc34fe9
feat(machine_learning): code clean
Tanguylo Dec 22, 2022
4355b31
feat(machine_learning): fix dtclassifier
Tanguylo Dec 23, 2022
3769c81
feat(machine_learning): try to handle multiclass multioutput labels f…
Tanguylo Dec 23, 2022
7fc6027
feat(machine_learning): remove rows in setup for platform install pro…
Tanguylo Dec 23, 2022
746fd14
feat(machine_learning): code cleaning, epic fail to classify multilab…
Tanguylo Dec 23, 2022
4319ea2
feat(machine_learning): fix a bug and fix pylint errors
Tanguylo Dec 23, 2022
9909e16
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Dec 23, 2022
2110742
feat(machine_learning): increase min coverage note
Tanguylo Dec 23, 2022
dc2a892
feat(machine_learning): try import plot_data to avoid bug on plateform
Tanguylo Dec 23, 2022
daa9910
feat(machine_learning): modeler standalone to True
Tanguylo Dec 23, 2022
4c538e7
feat(machine_learning): add a standalone
Tanguylo Dec 23, 2022
8eac895
chore(cluster_cov): change doc
Tanguylo Dec 23, 2022
db1220a
chore(cluster_cov): fix pydoc error
Tanguylo Dec 23, 2022
9fec4b0
fix: pylint
Jan 2, 2023
b009e9d
Merge branch 'dev' into perf_cache_argspec
Jan 2, 2023
b4d9137
Merge branch 'dev' into perf_cache_argspec
Jan 2, 2023
9ae998d
fix: pydocstyle
Jan 2, 2023
b930e36
debug
Jan 2, 2023
aa7f2af
fix: entrypoint
Jan 2, 2023
96827e3
fix: drone
Jan 2, 2023
da48e68
fix: drone
Jan 2, 2023
98f3cfd
fix: chagelog
Jan 2, 2023
a62531b
Merge remote-tracking branch 'origin/dev' into chore/cluster_coverage
Tanguylo Jan 3, 2023
19415f1
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Jan 3, 2023
fb07a53
chore(cluster): resolve conflicts
Tanguylo Jan 3, 2023
1dfb59d
chore(cluster_coverage): solve pydocstyles errors
Tanguylo Jan 3, 2023
74cd4bc
chore(cluster_coverage): change pydocstyle threshold
Tanguylo Jan 3, 2023
0c79965
chore(cluster_coverage): fix diff between dev and this
Tanguylo Jan 3, 2023
2515fb4
chore(cluster_coverage): fix newline error
Tanguylo Jan 3, 2023
bea273c
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Jan 3, 2023
fb91ea2
chore(cluster_coverage): write changelog
Tanguylo Jan 3, 2023
14a2235
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Jan 3, 2023
8d0e149
feat(machine_learning): add info to changelog
Tanguylo Jan 3, 2023
b1588b2
feat(machine_learning): fix pydocstyle errors
Tanguylo Jan 3, 2023
e18f926
fix: drone PR
Jan 3, 2023
89cd050
doc: add a docstring
Jan 3, 2023
2ab8d7a
doc: add docstring
Jan 3, 2023
6701a8b
fix: bash script
Jan 3, 2023
85e18aa
feat(machine_learning): begin with features methods
Tanguylo Jan 3, 2023
28fa064
chore(cluster_coverage): Undo deletion of utf8 and env commented line…
Tanguylo Jan 3, 2023
4290762
chore(cluster_coverage): add a forgotten utf 8 stuff
Tanguylo Jan 3, 2023
4f462e0
chore(cluster_coverage): undo utf8 removal
Tanguylo Jan 3, 2023
5ab3dba
Merge remote-tracking branch 'origin/perf_cache_argspec' into chore/c…
Tanguylo Jan 3, 2023
8a39cc8
chore(cluster_coverage): decrease min note for pydocstyle
Tanguylo Jan 3, 2023
8ea2eea
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Jan 3, 2023
a7285e7
chore(cluster_coverage): update cluster metric from sklearn update
Tanguylo Jan 3, 2023
d194de7
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Jan 3, 2023
45aab48
fix: bugfix on argspec cache
Jan 3, 2023
bd55b83
feat(machine_learning): features importances
Tanguylo Jan 3, 2023
5249405
Merge remote-tracking branch 'origin/perf_cache_argspec' into chore/c…
Tanguylo Jan 3, 2023
f0ecfec
Merge remote-tracking branch 'origin/perf_cache_argspec' into feat/ma…
Tanguylo Jan 3, 2023
3be4140
feat(machine_learning): begin with tests of features importances
Tanguylo Jan 3, 2023
7c334ab
Merge remote-tracking branch 'origin/dev' into chore/cluster_coverage
Tanguylo Jan 3, 2023
812b135
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Jan 3, 2023
3735f88
Merge remote-tracking branch 'origin/dev' into chore/cluster_coverage
Tanguylo Jan 3, 2023
89ae063
Merge remote-tracking branch 'origin/chore/cluster_coverage' into fea…
Tanguylo Jan 3, 2023
35d1230
feat(machine_learning): begin with histogrm plot of features importances
Tanguylo Jan 3, 2023
c826724
feat(machine_learning): remove features importances stuff for another…
Tanguylo Jan 3, 2023
ee978d0
feat(machine_learning): answer review comment
Tanguylo Jan 4, 2023
ac5a5e2
feat(machine_learning): lighten docstrings
Tanguylo Jan 4, 2023
28059a0
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jan 4, 2023
3a5f8f2
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jan 5, 2023
8f4f2a9
feat(ML): add . to docstring
Tanguylo Jan 5, 2023
da1965d
feat(ML): add models to setup
Tanguylo Jan 6, 2023
bf9c0a2
feat(ML): try a new trick for importing models
Tanguylo Jan 6, 2023
5acc1b8
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jan 9, 2023
40c1b81
feat(ML): fix merge mistake
Tanguylo Jan 9, 2023
4d74ba8
feat(machine_learning): change name of predicted outputs to predicted…
Tanguylo Jan 10, 2023
759145b
feat(machine_learning): add class modeleddataset for plotting predict…
Tanguylo Jan 10, 2023
fb393ee
feat(ML): remove trees for test on PF
Tanguylo Jan 10, 2023
2261aa6
feat(ML): trees are not the problem
Tanguylo Jan 10, 2023
3099b6d
feat(ML): change name of models.py
Tanguylo Jan 10, 2023
71920a8
feat(ML): change train_test_split method of dataset for better manipu…
Tanguylo Jan 10, 2023
109baa8
feat(ML): fixes on from_predicted_dataset
Tanguylo Jan 10, 2023
861ba21
feat(ML): fix Sample stuff in modeleddataset
Tanguylo Jan 10, 2023
c7246a6
feat(ML): fix bug
Tanguylo Jan 10, 2023
3ae7f33
feat(ML): fix Sample stuff in modeleddataset third
Tanguylo Jan 10, 2023
28919b9
feat(ML): fix Sample stuff in modeleddataset fourth
Tanguylo Jan 10, 2023
c8e27a6
feat(ML): fix Sample stuff in modeleddataset fifth
Tanguylo Jan 10, 2023
31f1169
Merge remote-tracking branch 'origin/fix/refpath_dataset' into feat/m…
Tanguylo Jan 10, 2023
ebbce9b
feat(ML): fix pylint errors
Tanguylo Jan 10, 2023
2f228ce
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jan 13, 2023
b1ff74b
feat(ML): add a wrapping method to fit validate a modeler and predict…
Tanguylo Jan 13, 2023
1af69cb
feat(ML): fix pylint
Tanguylo Jan 13, 2023
e5d4df8
feat(ML): pylint
Tanguylo Jan 13, 2023
2ac2d3a
feat(ML): pylint doc
Tanguylo Jan 13, 2023
a69b12e
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jan 23, 2023
0021571
feat(ML): resolve conflict in pylint
Tanguylo Jan 23, 2023
67cdc35
feat(ML): repair plot of modeler object with reference path
Tanguylo Jan 23, 2023
0837b6c
Merge remote-tracking branch 'origin/fix/refpath_dataset' into feat/m…
Tanguylo Jan 23, 2023
7008d19
feat(ML): repair plot_data method with new reference path handling
Tanguylo Jan 23, 2023
f629620
Merge remote-tracking branch 'origin/fix/refpath_dataset' into feat/m…
Tanguylo Jan 23, 2023
aeec50d
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jan 23, 2023
ce18fd1
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Feb 13, 2023
436520a
feat(ML): solve pylint issues
Tanguylo Feb 13, 2023
0e925c4
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Feb 13, 2023
c2c3252
feat(ML): fix pylint
Tanguylo Feb 13, 2023
08d7dde
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Feb 15, 2023
165d0e6
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Feb 22, 2023
f6b2fbd
feat(ML): fix pylint
Tanguylo Feb 22, 2023
443c62c
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Feb 22, 2023
e4f0ef3
feat(ML): fix pylint
Tanguylo Feb 22, 2023
5e3e6d5
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Feb 22, 2023
47e6773
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Mar 1, 2023
01477ba
feat(ML): fix some spelling
Tanguylo Mar 1, 2023
59face1
feat(ML): try to put in commented code to avoid spell checks
Tanguylo Mar 1, 2023
34483ee
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Mar 1, 2023
7fb153d
feat(ML): fix some spelling
Tanguylo Mar 1, 2023
662af87
feat(ML): some changes in spellings
Tanguylo Mar 1, 2023
9025627
feat(ML): remove word params
Tanguylo Mar 1, 2023
2bb8b63
feat(ML): fix pylint
Tanguylo Mar 1, 2023
9654376
feat(ML): fix one line docstrings
Tanguylo Mar 1, 2023
a2d9bcd
feat(ML): add line after class docstrings
Tanguylo Mar 1, 2023
009267a
feat(ML): lower spelling threshold to catch new spelling errors
Tanguylo Mar 1, 2023
248b071
feat(ML): remove more spelling errors and change some words
Tanguylo Mar 1, 2023
f60f17f
feat(ML): pylint
Tanguylo Mar 1, 2023
1902088
feat(ML): move scripts files
Tanguylo Mar 1, 2023
3c99d83
feat(ML): this PR shall not be blocked for coverage reasons since it …
Tanguylo Mar 1, 2023
b3dd9f9
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Mar 29, 2023
482329e
feat(ML): solve pylint conflict
Tanguylo Mar 29, 2023
872f96d
feat(ML): first changes after review
Tanguylo Mar 29, 2023
b3d313a
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo May 31, 2023
13bafd9
feat(ML): change file organization
Tanguylo May 31, 2023
3af4a02
feat(ML): add files
Tanguylo May 31, 2023
8398008
feat(ML): fix bug in str
Tanguylo May 31, 2023
a06c1d4
feat(ML): new tests for str in clustering.py
Tanguylo May 31, 2023
9f6326c
feat(ML): quite large refactor for better uses and write required did…
Tanguylo Jun 2, 2023
6ab0e2e
feat(ML): full refactor of modeler for a better UX and write scripts …
Tanguylo Jun 2, 2023
4140a27
feat(ML): remove types in docstring when redundant with arguments types
Tanguylo Jun 2, 2023
2724b11
feat(ML): last review comment, but the structure of input, output, preds
Tanguylo Jun 2, 2023
042d97e
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jun 2, 2023
330f2c0
feat(ML): fix pylint errorsé
Tanguylo Jun 5, 2023
8eae28a
feat(ML): add object to handle input output prediction matrices in on…
Tanguylo Jun 5, 2023
2e3a4d3
feat(ML): uncomment ci_scripts code
Tanguylo Jun 5, 2023
c994e61
feat(ML): drone stuff
Tanguylo Jun 5, 2023
049eeb6
feat(ML): changelog
Tanguylo Jun 5, 2023
19d4e54
feat(ML): pylint and docstyleé
Tanguylo Jun 5, 2023
d326173
feat(ML): increase coverage note
Tanguylo Jun 5, 2023
efaaf0d
feat(ML): change assertion rules for scripts in learning models
Tanguylo Jun 5, 2023
2d8d512
feat(ML): set allowed methods (partial)
Tanguylo Jun 5, 2023
d63f0ac
feat(ML): set more allowed methods
Tanguylo Jun 5, 2023
cd233cc
feat(ML): remove types in docstring and increase protected_access note
Tanguylo Jun 5, 2023
2d86ff1
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Jan 31, 2025
93bb208
feat(ml): fix tree deserialization error
Tanguylo Jan 31, 2025
b4683da
fat(ml): fix typing error
Tanguylo Jan 31, 2025
5ccc78c
feat(ml): change typings
Tanguylo Jan 31, 2025
fd022be
feat(ml): fast fixes for this branch to work with CDA applications
Tanguylo Jan 31, 2025
26d9614
Merge remote-tracking branch 'origin/dev' into feat/machine_learning
Tanguylo Feb 3, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
52 changes: 52 additions & 0 deletions .pyenchant_dessia.dict
Original file line number Diff line number Diff line change
@@ -1,50 +1,102 @@
argspecs
backend
BallTree
binarized
BinaryFile
bool
builtins
cKDTree
classmethod
cls
ClusteredDataset
CMA
config
csv
cvar
Cython
dataset
datatools
dbscan
DecisionTree
DecisionTreeClassifier
DecisionTreeRegressor
dectree
dendrogram
deserialization
deserialize
dessia
DessiaObject
dict
dicts
docstring
DOE
filepath
frobenius
frontend
ge
getitem
gini
gte
init
itertools
th
json
JSON
jsonschema
kernel
KMeans
kwargs
hyperparameter
hyperparameters
l1
l2
len
LinearModel
LinearRegression
linf
linspace
lt
Mahalanobis
matplotlib
Minkowski
MLPClassifier
MLPRegressor
multiclass
MultiLayerPerceptron
multioutput
nbv
ndarray
networkx
numpy
orjson
Platt
poly
RandomForest
RandomForestClassifier
RandomForestRegressor
rbf
Ridge
rtype
schemas
scipy
scikit
serializable
sigmoid
sklearn
SupportVectorClassifier
SupportVectorMachine
SupportVectorRegressor
str
StringFile
Stringify
submodule
subobjects
SVD
tol
traceback
uuid
uuids
visjs
volmdlr
XLS
XLSX
2 changes: 1 addition & 1 deletion .pylintrc
Original file line number Diff line number Diff line change
Expand Up @@ -352,7 +352,7 @@ function-naming-style=snake_case

# Good variable names which should always be accepted, separated by a comma.
good-names=i,j,k,n,ie,x,xi,x0,v1,v2,
ax,
ax,_probA,_probB,C,
_

# Good variable names regexes, separated by a comma. If names match any regex,
Expand Down
9 changes: 9 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -127,6 +127,14 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- code_pydocstyle.py checks daily instead of weekly
- Add a time decrease effect for pylint

### Added
- modeling.py file in datatools which contains:
* StandardScaler, IdentityScaler
* Regressions: SVR, RandomForest, MLP, Ridge, DecisionTree
* Classifications: SVC, RandomForest, MLP, DecisionTree
- modeler.py file in datatools which allows to :
* Handle full machine learning modelization process (scale data, train model and validate), with plots

### Performance

- Conform doc for many parts of dessia_common
Expand Down Expand Up @@ -161,6 +169,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- Fix path deepth when dict misses keys

### Changed

- Refactor copy_pipes and nbv checking

### Performance
Expand Down
8 changes: 4 additions & 4 deletions code_pylint.py
Original file line number Diff line number Diff line change
Expand Up @@ -28,20 +28,20 @@
MAX_ERROR_BY_TYPE = {
"wrong-spelling-in-docstring": 196,
"wrong-spelling-in-comment": 26,
"protected-access": 46, # Highly dependant on our "private" conventions. Keeps getting raised
"protected-access": 70, # Highly dependant on our "private" conventions. Keeps getting raised
"arguments-differ": 1,
"too-many-locals": 10, # Reduce by dropping vectored objects
"too-many-branches": 13, # Huge refactor needed. Will be reduced by schema refactor
"unused-argument": 6, # Some abstract functions have unused arguments (plot_data). Hence cannot decrease
"cyclic-import": 2, # Still work to do on Specific based DessiaObject
"too-many-arguments": 22, # Huge refactor needed
"too-many-arguments": 55, # Huge refactor needed
"too-few-public-methods": 4, # Abstract classes (Errors, Checks,...)
"too-many-return-statements": 9, # Huge refactor needed. Will be reduced by schema refactor
"import-outside-toplevel": 4, # TODO : will reduced in a future work (when tests are ready)
"too-many-instance-attributes": 7, # Huge refactor needed (workflow, etc...)
"too-many-instance-attributes": 9, # Huge refactor needed (workflow, etc...)
"broad-exception-caught": 12, # Necessary in order not to raise non critical errors. Will be reduced by schema refactor
"bare-except": 1, # Necessary in order not to raise non critical errors. Will be reduced by schema refactor
"too-many-public-methods": 2, # Try to lower by splitting DessiaObject and Workflow
"too-many-public-methods": 3, # Try to lower by splitting DessiaObject and Workflow
"too-many-statements": 2, # Will be solved by schema refactor and jsonchema removal
"undefined-loop-variable": 1, # Fearing to break the code by solving it
"attribute-defined-outside-init": 3, # For test purposes
Expand Down
2 changes: 1 addition & 1 deletion coverage.py
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
import json

MIN_MODULE_COVERAGE = 60.
MIN_GLOBAL_COVERAGE = 80.0
MIN_GLOBAL_COVERAGE = 82.4

RATCHET_COVERAGE = 3.

Expand Down
12 changes: 6 additions & 6 deletions dessia_common/core.py
Original file line number Diff line number Diff line change
Expand Up @@ -630,7 +630,7 @@ def save_export_to_file(self, selector: str, filepath: str):
raise ValueError(f'Export selector not found: {selector}')

def to_vector(self):
""" Compute vector from object. """
""" Get all values of specified attributes into a list of values (vector). """
vectored_objects = []
for feature in self.vector_features():
vectored_objects.append(getattr(self, feature.lower()))
Expand All @@ -642,7 +642,7 @@ def to_vector(self):

@classmethod
def vector_features(cls):
""" Get a list of vector features, or generate a default one. """
""" Get the list of attributes specified in _vector_features attribute (in order to build a Dataset). """
if cls._vector_features is None:
return list(set(get_attribute_names(cls)).difference(get_attribute_names(DessiaObject)))
return cls._vector_features
Expand Down Expand Up @@ -1009,9 +1009,9 @@ def from_filters_list(cls, filters: List[DessiaFilter], logical_operator: str =
@staticmethod
def combine_booleans_lists(booleans_lists: List[List[bool]], logical_operator: str = "and"):
"""
Combine a list of `n` booleans indexes with the logical operator into a simple boolean index.
Combine a list of `n` boolean indexes with the logical operator into a simple boolean index.

:param booleans_lists: List of `n` booleans indexes
:param booleans_lists: List of `n` boolean indexes
:type booleans_lists: List[List[bool]]

:param logical_operator: Logical operator to combine filters (`'or'`, `'and'` or `'xor'`)
Expand All @@ -1038,12 +1038,12 @@ def combine_booleans_lists(booleans_lists: List[List[bool]], logical_operator: s

def get_booleans_index(self, dobjects_list: List[DessiaObject]):
"""
Compute all the filters of `self.filters` on `dobjects_list` and returns a booleans index of `dobjects_list`.
Compute all the filters of `self.filters` on `dobjects_list` and returns a boolean index of `dobjects_list`.

:param dobject_list: List of data to filter
:type dobject_list: List[DessiaObject]

:return: A `booleans index` of `dobjects_list` of the list of data to filter (`dobjects_list`)
:return: A boolean index of `dobjects_list` of the list of data to filter (`dobjects_list`)
:rtype: List[bool]

:Examples:
Expand Down
2 changes: 1 addition & 1 deletion dessia_common/datatools/__init__.py
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
""" __init__ method for datatools module """
""" __init__ method for datatools module. """

import warnings
from typing import List
Expand Down
Loading