Cross validation not working for RegexEntityExtractor with lookup tables #10448
Labels
area:rasa-oss 🎡
Anything related to the open source Rasa framework
type:bug 🐛
Inconsistencies or issues which will cause an issue or problem for users or implementors.
Rasa Open Source version
2.8.15
Python version
3.8
What operating system are you using?
OSX
What happened?
Based on this forum report, which I was able to reproduce, it looks like using
rasa test nlu
with cross validation does not properly work forRegexEntityExtractor
with lookup tables. The lookup tables are not taken into account during the training and validation, leading to no entities being predicted.The following is a minimal example config leading to the issue, since the training data of the user was fairly large:
config.yml
domain.yml
nlu.yml
rules.yml
Afterwards, the report shows no entities extracted by
RegexEntityExtractor
with confusion matrixSource of the problem
Stepping through the code showed that when folds are generated from the training data in generate_folds, the
TrainingData
objects created here for don't have thelookup_tables
parameter set, resulting in empty lookup tables for both train and test data.Proposed solution
Add the
lookup_tables
parameter to take over the lookup tables from the original training data object.Definition of done
Command / Request
rasa test nlu --nlu data/nlu.yml --cross-validation --runs 1 --folds 2
Relevant log output
The text was updated successfully, but these errors were encountered: