You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tokenized lengths of words may be different depending on the context, as illustrated by the example below.
A source of possible inconsistency in our ANN encode method: the tokenization of a word is non-uniform across contexts, so we can't rely on the tokenized length of the individual token to calculate offsets.
Tokenized lengths of words may be different depending on the context, as illustrated by the example below.
A source of possible inconsistency in our ANN
encode
method: the tokenization of a word is non-uniform across contexts, so we can't rely on the tokenized length of the individual token to calculate offsets.We need to evaluate the situations in which this would be an issue, and whether this will affect the output of
encode
The text was updated successfully, but these errors were encountered: