Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

assertion fails when extracting representations #14

Open
ahmed451 opened this issue Feb 24, 2022 · 4 comments
Open

assertion fails when extracting representations #14

ahmed451 opened this issue Feb 24, 2022 · 4 comments

Comments

@ahmed451
Copy link

The code in function extract_sentence_representations line 119 neurox/data/extraction/transformers_extractor.py
Assertion in line 243
Loading model: asafaya/albert-base-arabic
Reading input corpus
Reading filter vocabulary
Preparing output file: ...join_3.test.txt.asafaya-albert-base-arabic.hdf5
Extracting representations from model
Original (015): ['الله', 'a يرزقني', 'a بالشخص', 'a اللي', 'a اذا', 'a مت', 'a يدعي', 'a لي', 'a بكل', 'a صلاه', 'a .', 'a .',
'a ?', 'a ', 'a ?']
Tokenized (024): ['[CLS]', '▁الله', '▁يرزق', 'ني', '▁بال', 'شخص', '▁اللي', '▁اذا', '▁مت', '▁يدعي', '▁لي', '▁بكل', '▁صل', 'ا', '
ه', '▁', '.', '▁', '.', '▁', '?', '▁', '?', '[SEP]']
['▁الله', '▁يرزقني', '▁بالشخص', '▁اللي', '▁اذا', '▁مت', '▁يدعي', '▁لي', '▁بكل', '▁صلاه', '▁.', '▁.', '▁?', '▁', '?']
[24, 15182, 150, 64, 5746, 2299, 998, 764, 7825, 154, 610, 3793, 16, 15, 11, 9, 11, 9, 11, 5158, 11, 5158]
Res: counter: 23 ids_without_special_tokens: 22

@fdalvi
Copy link
Owner

fdalvi commented Feb 24, 2022

@ahmed451 can you add the original sentence as well in its raw form?

@ahmed451
Copy link
Author

Original sentence: 'الله يرزقني بالشخص اللي اذا مت يدعي لي بكل صلاه . . ? ?'

@fdalvi
Copy link
Owner

fdalvi commented Mar 16, 2022

I think this was fixed with the update to latest neurox? Can you confirm?

@fdalvi
Copy link
Owner

fdalvi commented Jun 7, 2022

Hi @ahmed451, can you confirm that this is no longer an issue? I believe it was fixed with an update right?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants