Predict running workflows' actions. Currently running at vocms0116.
Project under CMS CompOps T&I
After environments set up and necessary configuration pieces added, set up a crontab task to run periodically main.py
.
To start the frontend instance,
cd web/
./quickStart.sh # this starts Flask basic server on port 8020 (subject to change)
A few more configuration files are needed to get it rolling.
-
config/config.yml
for connections to UNIFIED DB, MySQL DB and alert email sending.oracle: - *** # username - *** # password - *** # db name mysql: - *** # username - *** # password - *** # db name alert_recipients: - XXX@YYYY.ZZ
-
config/credential.yml
forstompAMQ
to produce docs and authentication.producer: toolsandint-workflows-collector topic: /topic/cms.toolsandint.workflowsinfo cert: PATH_TO_CERT_FILE (***.pem) key: PATH_TO_KEY_FILE (***.rsa) jiracookie: PATH_TO_COOKIE_FILE hostport: host: XXXX.cern.ch port: XXXXX
-
models/xgb_optimized.model
for running workflow inference. -
OSDroidDB
is the local MySQL database storing workflow prediction history, labels and short-term document archive. Three tables need to be created for each.PredictionHistory
CREATE TABLE IF NOT EXISTS OSDroidDB.PredictionHistory ( hid BIGINT NOT NULL AUTO_INCREMENT PRIMARY KEY, name VARCHAR(255) NOT NULL, good FLOAT, acdc FLOAT, resubmit FLOAT, timestamp TIMESTAMP );
LabelArchive
CREATE TABLE IF NOT EXISTS OSDroidDB.LabelArchive ( name VARCHAR(255) NOT NULL PRIMARY KEY, label INT );
DocsOneMonthArchive
CREATE TABLE IF NOT EXISTS OSDroidDB.DocsOneMonthArchive ( id INT NOT NULL AUTO_INCREMENT PRIMARY KEY, name VARCHAR(255) NOT NULL, document LONGTEXT, timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP );
- UNIFIED DB
- wmstats server
- couchdb/acdc server
- ReqMgr2
stompAMQ
(wrapped inCMSMonitoring
) -> HDFS /CERN MONIT infrastructure