Skip to content

Latest commit

 

History

History
17 lines (12 loc) · 1.14 KB

README.md

File metadata and controls

17 lines (12 loc) · 1.14 KB

Automatic Evluation

The evluation pipeline includes two parts: dialogue simulation and quality evaluation.

To evaluate different models, modify Nurse in eval_model_config. To use your custom dataset, update FILE_PATH and EMR_PATH in main.sh. The required data format can be found in SFMSS Data.

When comparing the performance of different models, we strongly recommend removing --sample in main.sh and running patient sampling first. This ensures the same patient simulation settings, making the results comparable. (Remember to update the FILE_PATH to the new path containing records with patient settings.)

bash sfmss/workflow/run_patient_sample.sh
bash main.sh

The Folder result contains all evaluation results presented in the paper, including both automatic and human evaluation results.