We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The code in function extract_sentence_representations line 119 neurox/data/extraction/transformers_extractor.py Assertion in line 243 Loading model: asafaya/albert-base-arabic Reading input corpus Reading filter vocabulary Preparing output file: ...join_3.test.txt.asafaya-albert-base-arabic.hdf5 Extracting representations from model Original (015): ['الله', 'a يرزقني', 'a بالشخص', 'a اللي', 'a اذا', 'a مت', 'a يدعي', 'a لي', 'a بكل', 'a صلاه', 'a .', 'a .', 'a ?', 'a ', 'a ?'] Tokenized (024): ['[CLS]', '▁الله', '▁يرزق', 'ني', '▁بال', 'شخص', '▁اللي', '▁اذا', '▁مت', '▁يدعي', '▁لي', '▁بكل', '▁صل', 'ا', ' ه', '▁', '.', '▁', '.', '▁', '?', '▁', '?', '[SEP]'] ['▁الله', '▁يرزقني', '▁بالشخص', '▁اللي', '▁اذا', '▁مت', '▁يدعي', '▁لي', '▁بكل', '▁صلاه', '▁.', '▁.', '▁?', '▁', '?'] [24, 15182, 150, 64, 5746, 2299, 998, 764, 7825, 154, 610, 3793, 16, 15, 11, 9, 11, 9, 11, 5158, 11, 5158] Res: counter: 23 ids_without_special_tokens: 22
extract_sentence_representations
The text was updated successfully, but these errors were encountered:
@ahmed451 can you add the original sentence as well in its raw form?
Sorry, something went wrong.
Original sentence: 'الله يرزقني بالشخص اللي اذا مت يدعي لي بكل صلاه . . ? ?'
I think this was fixed with the update to latest neurox? Can you confirm?
Hi @ahmed451, can you confirm that this is no longer an issue? I believe it was fixed with an update right?
No branches or pull requests
The code in function
extract_sentence_representations
line 119 neurox/data/extraction/transformers_extractor.pyAssertion in line 243
Loading model: asafaya/albert-base-arabic
Reading input corpus
Reading filter vocabulary
Preparing output file: ...join_3.test.txt.asafaya-albert-base-arabic.hdf5
Extracting representations from model
Original (015): ['الله', 'a يرزقني', 'a بالشخص', 'a اللي', 'a اذا', 'a مت', 'a يدعي', 'a لي', 'a بكل', 'a صلاه', 'a .', 'a .',
'a ?', 'a ', 'a ?']
Tokenized (024): ['[CLS]', '▁الله', '▁يرزق', 'ني', '▁بال', 'شخص', '▁اللي', '▁اذا', '▁مت', '▁يدعي', '▁لي', '▁بكل', '▁صل', 'ا', '
ه', '▁', '.', '▁', '.', '▁', '?', '▁', '?', '[SEP]']
['▁الله', '▁يرزقني', '▁بالشخص', '▁اللي', '▁اذا', '▁مت', '▁يدعي', '▁لي', '▁بكل', '▁صلاه', '▁.', '▁.', '▁?', '▁', '?']
[24, 15182, 150, 64, 5746, 2299, 998, 764, 7825, 154, 610, 3793, 16, 15, 11, 9, 11, 9, 11, 5158, 11, 5158]
Res: counter: 23 ids_without_special_tokens: 22
The text was updated successfully, but these errors were encountered: