Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Isn't CCP a white box model? How to test GPT-3.5-turbo #255

Open
wangzhonghai opened this issue Nov 14, 2024 · 6 comments
Open

Isn't CCP a white box model? How to test GPT-3.5-turbo #255

wangzhonghai opened this issue Nov 14, 2024 · 6 comments
Assignees

Comments

@wangzhonghai
Copy link

No description provided.

@IINemo
Copy link
Owner

IINemo commented Nov 14, 2024

Hi! Correct, CCP is a white box model. However, OpenAI API now provides top 5 logits from the probability distribution. It is not much, but we can run CCP with GPT-3.5-turbo. @ArtemVazh could you share an example?

@wangzhonghai
Copy link
Author

@IINemo Hello, could you please provide an example for this section?

@IINemo
Copy link
Owner

IINemo commented Nov 25, 2024

Sorry for the late reply, we will provide the example asap. @cant-access-rediska0123 could you help with that?

@wangzhonghai
Copy link
Author

@IINemo The CCP score I calculated is like this: [-0.872331670225528, -0.872331670225528, -0.849797917762312, -0.88501326670551728, -0.88501413239502603, -0.55894843210343627, -0.999430756426281, -1.0, -0.9999880353950354]. Is the result correct?

@cant-access-rediska0123
Copy link
Collaborator

@IINemo Hello, could you please provide an example for this section?

Currently, we do not have an implementation of CCP for the Blackbox case in LM-Polygraph. Our Blackbox models are specifically designed for scenarios where there is no information about token distributions.

We will consider adding code to address it in the future.

@cant-access-rediska0123
Copy link
Collaborator

cant-access-rediska0123 commented Nov 27, 2024

@IINemo The CCP score I calculated is like this: [-0.872331670225528, -0.872331670225528, -0.849797917762312, -0.88501326670551728, -0.88501413239502603, -0.55894843210343627, -0.999430756426281, -1.0, -0.9999880353950354]. Is the result correct?

The numbers you calculated seem correct. The negative scores occur because the CCP estimator includes a minus sign in the calculation. This design ensures that higher values represent greater uncertainty.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants