This release switches from MLPrimitives
to ml-stars
.
Moreover, we remove all pipelines using deep feature synthesis.
- Update demo bucket - Issue #76 by @sarahmish
- Remove
dfs
based pipelines - Issue #73 by @sarahmish - Move from
MLPrimitives
toml-stars
- Issue #72 by @sarahmish
This release features a reorganization and renaming of Draco
pipelines. In addtion,
we update some of the dependencies for general housekeeping.
- First release on
draco-ml
PyPI
This release increases the supported version of python to 3.8
and also includes changes
in the installation requirements, where pandas
and scikit-optimize
packages have
been updated to support higher versions. This changes come together with the newer versions
of MLBlocks
and MLPrimitives
.
- Fix
run_benchmark
generating properly theinit_hyperparameters
for the pipelines. - New
FPR
metric. - New
roc_auc_score
metric. - Multiple benchmarking metrics allowed.
- Multiple
tpr
orthreshold
values allowed for the benchmark.
- Fix
mkdir
when exporting tocsv
file the benchmark results. - Intermediate steps for the pipelines with demo notebooks for each pipeline.
- Issue #50: Expose partial outputs and executions in the
GreenGuardPipeline
.
With this release we include:
run_benchmark
: A function within the modulebenchmark
that allows the user to evaluate templates against problems with different window size and resample rules.summarize_results
: A function that given acsv
file generates axlsx
file with a summary tab and a detailed tab with the results fromrun_benchmark
.
- Fix dependency errors
- Added benchmarking module.
- Added github actions.
- Issue #27: Cache Splits pre-processed data on disk
With this release we give the possibility to the user to specify more than one template when
creating a GreenGuardPipeline. When the tune
method of this is called, an instance of BTBSession
is returned and it is in charge of selecting the templates and tuning their hyperparameters until
achieving the best pipeline.
- Resample by filename inside the
CSVLoader
to avoid oversampling of data that will not be used. - Select targets now allows them to be equal.
- Fixed the csv filename format.
- Upgraded to BTB.
- Issue #33: Wrong default datetime format
- Issue #35: Select targets is too strict
- Issue #36: resample by filename inside csvloader
- Issue #39: Upgrade BTB
- Issue #41: Fix CSV filename format
First stable release:
- efficient data loading and preprocessing
- initial collection of dfs and lstm based pipelines
- optimized pipeline tuning
- documentation and tutorials
- First release on PyPI