KL Divergence between Multivariate Gaussians of LLM roles

Persona-hub create diverse synthetic data by thematic roles, in one distribution (like hospital -> nurse -> patient). Therefore, I use clustering and dimensionality reduction (HDBSCAN and UMAP) to get a list of responses for a role in order to approximate the distribution of each role. Embeddings of texts are obtained using 2 approaches: BERT-based or TFiDF models.

First approach on TfiDF embeddings

Creating word frequency distribution for each cluster and converting counts to probabilities. The Kullback-Leibler divergence between two probability distributions $P$ and $Q$ is defined as:

$$D_{KL}(P \parallel Q) = \sum_{i} P(i) \log \left( \frac{P(i)}{Q(i)} \right)$$

where:

$P$ and $Q$ are the probability distributions,
$P(i)$ and $Q(i)$ are the probability mass functions for the discrete case,

Second approach on BERT embeddings

Since BERT averages the embeddings by words, normal distributions are obtained. The KL divergence between two multivariate Gaussian distributions $\mathcal{N}(\mu_1, \Sigma_1)$ and $\mathcal{N}(\mu_2, \Sigma_2)$ is given by the following formula:

$$D_{KL}(\mathcal{N}(\mu_1, \Sigma_1) \parallel \mathcal{N}(\mu_2, \Sigma_2)) = \frac{1}{2} \left( \text{tr}(\Sigma_2^{-1} \Sigma_1) + (\mu_2 - \mu_1)^T \Sigma_2^{-1} (\mu_2 - \mu_1) - k + \log \left(\frac{\det \Sigma_2}{\det \Sigma_1}\right) \right)$$

where:

$\mu_1$ and $\mu_2$ are the means of the distributions
$\Sigma_1$ and $\Sigma_2$ are the covariance matrices
$k$ is the dimensionality of the distributions

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
README.md		README.md
compute_kl_div_llm_roles.ipynb		compute_kl_div_llm_roles.ipynb
condition_clusters.csv		condition_clusters.csv
input persona_clusters.csv		input persona_clusters.csv
persona_clusters.csv		persona_clusters.csv
personahub-pipe.png		personahub-pipe.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KL Divergence between Multivariate Gaussians of LLM roles

About

Releases

Packages

Languages

Arseny5/roles-llm-agents

Folders and files

Latest commit

History

Repository files navigation

KL Divergence between Multivariate Gaussians of LLM roles

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages