-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Diet Classifier] ValueError: Number of examples should be the same for all data. #5508
Comments
Thanks for the issue, @rgstephens will get back to you about it soon! You may find help in the docs and the forum, too 🤗 |
It looks like some examples don't have intent labels |
Hi thanks for reply I had a look at my I trained |
otherwise, it could be that some examples couldn't be featurized for some reason. 0.14.1 version didn't have this check |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Any update on that? I'm getting the same issue here, using rasa 1.10.5. |
me too! using rasa 1.10.5 |
Hi! |
Hi! |
@shfshf @JoaoVFelipe Is one of you able to share his NLU data + config.yml so that I can take a closer look at the problem? Without the data to reproduce the issue it is hard to tell what is going wrong. Thanks. |
@robinsongh381 @JoaoVFelipe @tabergma @Ghostvv I am the colleague of @shfshf who provides the custom tokenizer component for his pipeline. And I finally find the there are two root causes of this issue:
Solutions:
|
Thanks @howl-anderson for the comment. We actually tackle problem 1 already in #5905. It is already merged into master. Just to be sure, if you update your custom tokenizer and solve the |
@tabergma It's good to see that the official team already takes action for problem 1. For problem 2, I am just working on the tokenizer rewriting process, but because when we using |
Thanks @tabergma and @howl-anderson for the help, setting the By the way, sorry for not sharing the NLU data before. It is pretty big, and I was instructed to not share it since some of it is enterprise sensitive. Thank you very much. |
@tabergma It's proved by @shfshf that updating the custom tokenizer indeed works! So, I think at least part of @robinsongh381's issue is related to the custom tokenizer too, since his tokenizer works in v0.14.1, but doesn't work in v1.9.2. I hope this message can help him. If @robinsongh381 has trouble rewrite his custom tokenizer, I can try my best to help him. |
Thanks @howl-anderson my colleague, |
Great, glad to hear that it works for you! I will close the issue as there is nothing code wise we can do. If you have trouble rewriting your tokenizers, feel free to ask a question on our forum. We are happy to help. |
Rasa version: 1.9.2
Rasa SDK version (if used & relevant):
Rasa X version (if used & relevant):
Python version:
3.6
Operating system (windows, osx, ...):
linux
Issue:
When training rasa nlu (i.e. rasa nlu train) there is an error from rasa/utils/tensorflow/model_data.py line 107
Error (including full traceback):
Command or request that led to error:
Content of configuration file (config.yml) (if relevant):
Content of domain file (domain.yml) (if relevant):
The text was updated successfully, but these errors were encountered: