save token counts #1762

emrgnt-cmplxty · 2025-01-07T00:54:19Z

Important

Add token counting for messages and save token metadata to the database, with tiktoken dependency for encoding.

Token Counting:
- Add tokens_count_for_message() and num_tokens_from_messages() in retrieval_service.py to calculate token counts for messages.
Database Updates:
- Modify agent() in retrieval_service.py to save input_tokens and output_tokens metadata when adding messages to the database.
Dependencies:
- Add tiktoken dependency in pyproject.toml for token encoding.

^{This description was created by}^{for 02ac34e. It will automatically update as commits are pushed.}

ellipsis-dev

👍 Looks good to me! Reviewed everything up to 02ac34e in 1 minute and 32 seconds

More details

1. py/core/main/services/retrieval_service.py:36

Draft comment:
Consider making tokens_per_message configurable or document why 3 tokens are added for each message and reply. This hardcoded value might not be accurate for all models or message types.
Reason this comment was not posted:
Confidence changes required: 50%
The function tokens_count_for_message and num_tokens_from_messages are calculating token counts, but there is a hardcoded value of 3 tokens added for each message and reply. This might not be accurate for all models or message types.

2. py/core/main/services/retrieval_service.py:56

Draft comment:
Replace print with logger.warning for better logging practices.

logger.warning("Warning: model not found. Using cl100k_base encoding.")

Reason this comment was not posted:
Confidence changes required: 50%
The print statement in num_tokens_from_messages function is not ideal for logging warnings. It should use the logger for consistency and better control over log levels.

Workflow ID: wflow_XVoFD3adowaM6ew2

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

emrgnt-cmplxty added 2 commits January 6, 2025 16:53

save token counts

06a714d

save token counts

02ac34e

emrgnt-cmplxty marked this pull request as ready for review January 7, 2025 00:55

emrgnt-cmplxty merged commit 1689f20 into feature/configurable-api-base Jan 7, 2025
1 check passed

ellipsis-dev bot reviewed Jan 7, 2025

View reviewed changes

Provide feedback