Skip to content

Commit

Permalink
bump deps
Browse files Browse the repository at this point in the history
  • Loading branch information
antoine-galataud committed Jun 10, 2024
1 parent b03a8f8 commit 3282a79
Show file tree
Hide file tree
Showing 5 changed files with 136 additions and 135 deletions.
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,11 +8,11 @@

## What is HOPES?

**HOPES** - HVAC Off-Policy Evaluation and Selection - is a Python package for evaluating and selecting RL-based
control policies. It offers a set of estimators and tools to evaluate the performance of a target policy,
compared to a set of baseline policies (characterized by an offline logged dataset), using off-policy evaluation
techniques. It's particularly suited for the context of HVAC control, where the target policy is an RL-based controller
and the baseline policies are rule-based controllers.
**HOPES** - **H**VAC optimisation with **O**ff-**P**olicy **E**valuation and **S**election - is a Python package for
evaluating and selecting RL-based control policies. It offers a set of estimators and tools to evaluate the performance
of a target policy, compared to a set of baseline policies (characterized by an offline logged dataset), using
off-policy evaluation techniques. It's particularly suited for the context of HVAC control, where the target policy is a
n RL-based controller and the baseline policies are rule-based controllers.

## Installation

Expand Down
2 changes: 1 addition & 1 deletion doc/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ Hopes
What's in the box?
------------------

**HOPES** - HVAC Off-policy Policy Evaluation and Selection - is a Python package for evaluating and selecting RL-based
**HOPES**, which stands for **H**\ VAC optimisation with **O**\ ff **P**\ olicy **E**\ valuation and **S**\ election, is a Python package for evaluating and selecting RL-based
control policies. It offers a set of estimators and tools to evaluate the performance of a target policy,
compared to a baseline policy (characterized by an offline logged dataset), using off-policy evaluation
techniques. It's particularly suited for the context of HVAC control, where the target policy is an RL-based controller
Expand Down
Loading

0 comments on commit 3282a79

Please sign in to comment.