-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Need a Model CI #8903
Comments
LGTM. Just to add more details
|
How about we build a Docker image every time run the CI, the docker image is named like |
Sounds Good. Not sure how large is a image. Maybe we can keep the last N good ones and the recent bad one |
Basic lower layers of the image can stay unchanged including the dataset contents. Changed files only contain |
how about a dev docker, clone any commitid, and run the test. So that we can test any commitid. No need to compile or store images. The only varients is commitid. |
Thanks for this important effort! I think it's better to test with prod image (or whl), since prod image is the image that our user uses. It is possible that some required Python dependencies is installed on dev image but not installed on prod image. |
Agree, we have daily compiled whls, those need to be tested. @helinwang But according to
the ModelCI needs to test commit ids using a binary search, so there are two input sources to the ModelCI:
let's clarify the logic of ModelCI: function test_whl(some_whl)
function compile_whl_from_source(path)
function clone_commit_from_repo(gitpath)
function binary_search_bad_commit(commits) {
# ...
source_path = clone_commit_from_repo(commit)
whl_path = compile_whl_from_source(source_path)
status = test_whl(whl_path)
# ...
}
function main() {
today_released_whl = __download(...)
ok = test_whl(today_released_whl)
if (!ok) {
# raise an alarm
today_commits = __get_commits(repo, today)
bad_commit = binary_search_bad_commit(today_commits)
# raise an alarm in some way
}
} |
I think we can raise an alert immediately when !ok is detected. And then search for bad_commit. |
I think it's a great idea to binary search bad commit, but not sure it's worth it to spend time to develop and maintain the feature:
|
|
Get it. The ModelCi will keep pulling the latest code and test. |
您好,此issue在近一个月内暂无更新,我们将于今天内关闭。若在关闭后您仍需跟进提问,可重新开启此问题,我们将在24小时内回复您。因关闭带来的不便我们深表歉意,请您谅解~感谢您对PaddlePaddle的支持! |
Model CI
The users occasionally found a negligible performance or precision difference between different Paddle versions. Though we have unit tests for each class and Travis-CI to ensures the precision of each operator, there is no any logic to ensure the model (a composition of several operators) works as reliable as the operators.
There are several conditions where an existing model will fail either in performance or precision:
The model-CI module is proposed to enhance the weaknesses above and track the overall performance and precision of the model granularity.
Module function
Inputs:
Outputs:
Indicators tracked
The text was updated successfully, but these errors were encountered: