Enhance docs

ThomasMeissnerDS · Jan 13, 2024 · b63f74b · b63f74b
1 parent 2a46704
commit b63f74b
Show file tree

Hide file tree

Showing 2 changed files with 120 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -42,6 +42,7 @@ the full documentation [here](https://bluecast.readthedocs.io/en/latest/).
     * [Custom feature selection](#custom-feature-selection)
     * [Custom ML model](#custom-ml-model)
     * [Using the inbuilt ExperientTracker](#using-the-inbuilt-experienttracker)
+    * [Use Mlflow via custom ExperientTracker API](#use-mlflow-via-custom-experienttracker-api)
 * [Convenience features](#convenience-features)
 * [Code quality](#code-quality)
 * [Documentation](#documentation)
@@ -752,6 +753,65 @@ The experiment triggers whenever the `fit` or `fit_eval` methods of a BlueCast
 class instance are called (also within BlueCastCV). This means for custom
 models the tracker will not trigger automatically and has to be added manually.
 
+#### Use Mlflow via custom ExperientTracker API
+
+The inbuilt experiment tracker is handy to start with, however in production environments
+it might be required to send metrics to a Mlflow server or comparable solutions. BlueCast
+allows to pass a custom experiment tracker.
+
+```sh
+# instantiate and train BlueCast
+from bluecast.blueprints.cast import BlueCast
+from bluecast.cnfig.base_classes import BaseClassExperimentTracker
+
+class CustomExperimentTracker(BaseClassExperimentTracker):
+    """Base class for the experiment tracker.
+
+    Enforces the implementation of the add_results and retrieve_results_as_df methods.
+    """
+
+    @abstractmethod
+    def add_results(
+        self,
+        experiment_id: int,
+        score_category: Literal["simple_train_test_score", "cv_score", "oof_score"],
+        training_config: TrainingConfig,
+        model_parameters: Dict[Any, Any],
+        eval_scores: Union[float, int, None],
+        metric_used: str,
+        metric_higher_is_better: bool,
+    ) -> None:
+        """
+        Add results to the ExperimentTracker class.
+        """
+        pass # add Mlflow tracking i.e.
+
+    @abstractmethod
+    def retrieve_results_as_df(self) -> pd.DataFrame:
+        """
+        Retrieve results from the ExperimentTracker class
+        """
+        pass
+
+
+experiment_tracker = CustomExperimentTracker()
+
+automl = BlueCast(
+        class_problem="binary",
+        target_column="target",
+        experiment_tracker=experiment_tracker,
+    )
+
+automl.fit_eval(df_train, df_eval, y_eval, target_col="target")
+
+# access the experiment tracker
+tracker = automl.experiment_tracker
+
+# see all stored information as a Pandas DataFrame
+tracker_df = tracker.retrieve_results_as_df()
+```
+
+
 ## Convenience features
 
 Despite being a lightweight library, BlueCast also includes some convenience

diff --git a/docs/source/index.md b/docs/source/index.md
@@ -42,6 +42,7 @@ the full documentation [here](https://bluecast.readthedocs.io/en/latest/).
     * [Custom feature selection](#custom-feature-selection)
     * [Custom ML model](#custom-ml-model)
     * [Using the inbuilt ExperientTracker](#using-the-inbuilt-experienttracker)
+    * [Use Mlflow via custom ExperientTracker API](#use-mlflow-via-custom-experienttracker-api)
 * [Convenience features](#convenience-features)
 * [Code quality](#code-quality)
 * [Documentation](#documentation)
@@ -752,6 +753,65 @@ The experiment triggers whenever the `fit` or `fit_eval` methods of a BlueCast
 class instance are called (also within BlueCastCV). This means for custom
 models the tracker will not trigger automatically and has to be added manually.
 
+#### Use Mlflow via custom ExperientTracker API
+
+The inbuilt experiment tracker is handy to start with, however in production environments
+it might be required to send metrics to a Mlflow server or comparable solutions. BlueCast
+allows to pass a custom experiment tracker.
+
+```sh
+# instantiate and train BlueCast
+from bluecast.blueprints.cast import BlueCast
+from bluecast.cnfig.base_classes import BaseClassExperimentTracker
+
+class CustomExperimentTracker(BaseClassExperimentTracker):
+    """Base class for the experiment tracker.
+
+    Enforces the implementation of the add_results and retrieve_results_as_df methods.
+    """
+
+    @abstractmethod
+    def add_results(
+        self,
+        experiment_id: int,
+        score_category: Literal["simple_train_test_score", "cv_score", "oof_score"],
+        training_config: TrainingConfig,
+        model_parameters: Dict[Any, Any],
+        eval_scores: Union[float, int, None],
+        metric_used: str,
+        metric_higher_is_better: bool,
+    ) -> None:
+        """
+        Add results to the ExperimentTracker class.
+        """
+        pass # add Mlflow tracking i.e.
+
+    @abstractmethod
+    def retrieve_results_as_df(self) -> pd.DataFrame:
+        """
+        Retrieve results from the ExperimentTracker class
+        """
+        pass
+
+
+experiment_tracker = CustomExperimentTracker()
+
+automl = BlueCast(
+        class_problem="binary",
+        target_column="target",
+        experiment_tracker=experiment_tracker,
+    )
+
+automl.fit_eval(df_train, df_eval, y_eval, target_col="target")
+
+# access the experiment tracker
+tracker = automl.experiment_tracker
+
+# see all stored information as a Pandas DataFrame
+tracker_df = tracker.retrieve_results_as_df()
+```
+
+
 ## Convenience features
 
 Despite being a lightweight library, BlueCast also includes some convenience