bump deps

airboxlab · Jun 10, 2024 · 3282a79 · 3282a79
1 parent b03a8f8
commit 3282a79
Show file tree

Hide file tree

Showing 5 changed files with 136 additions and 135 deletions.
diff --git a/README.md b/README.md
@@ -8,11 +8,11 @@
 
 ## What is HOPES?
 
-**HOPES** - HVAC Off-Policy Evaluation and Selection - is a Python package for evaluating and selecting RL-based
-control policies. It offers a set of estimators and tools to evaluate the performance of a target policy,
-compared to a set of baseline policies (characterized by an offline logged dataset), using off-policy evaluation
-techniques. It's particularly suited for the context of HVAC control, where the target policy is an RL-based controller
-and the baseline policies are rule-based controllers.
+**HOPES** - **H**VAC optimisation with **O**ff-**P**olicy **E**valuation and **S**election - is a Python package for
+evaluating and selecting RL-based control policies. It offers a set of estimators and tools to evaluate the performance
+of a target policy, compared to a set of baseline policies (characterized by an offline logged dataset), using
+off-policy evaluation techniques. It's particularly suited for the context of HVAC control, where the target policy is a
+n RL-based controller and the baseline policies are rule-based controllers.
 
 ## Installation
 

diff --git a/doc/source/index.rst b/doc/source/index.rst
@@ -14,7 +14,7 @@ Hopes
 What's in the box?
 ------------------
 
-**HOPES** - HVAC Off-policy Policy Evaluation and Selection - is a Python package for evaluating and selecting RL-based
+**HOPES**, which stands for **H**\ VAC optimisation with **O**\ ff **P**\ olicy **E**\ valuation and **S**\ election, is a Python package for evaluating and selecting RL-based
 control policies. It offers a set of estimators and tools to evaluate the performance of a target policy,
 compared to a baseline policy (characterized by an offline logged dataset), using off-policy evaluation
 techniques. It's particularly suited for the context of HVAC control, where the target policy is an RL-based controller