You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
🔥<spanstyle="color: #ff3860">[NEW!]</span>We introduce the task of transsuasion, the task of transferring content from one behavior to another while holding the other conditions like meaning, speaker, and time constant.
126
+
🔥<spanstyle="color: #ff3860">[NEW!]</span><b>Introducing PersuasionBench and PersuasionArena</b> - First large-scale automated benchmark and arena to measure the persuasive abilities of generative models.
127
+
<br>
128
+
🔥<spanstyle="color: #ff3860">[NEW!]</span>We introduce the task of transsuasion, the task of transferring content from one behavior to another while holding the other conditions like meaning, speaker, and time constant.
127
129
<br>
128
-
🔥<spanstyle="color: #ff3860">[NEW!]</span>We exhibit better or similar 0-shot and few shot abilities than GPT4 on transcreation, seo, and modelling human preference with a 13B model!
130
+
🔥<spanstyle="color: #ff3860">[NEW!]</span><b>Challenging Scale Assumptions</b> - Smaller models can outperform larger ones in persuasion when trained on targeted datasets.
131
+
<br>
132
+
🔥<spanstyle="color: #ff3860">[NEW!]</span><b>Policy Implications</b> - Current regulations like SB-1047 and EU AI law fail to capture the full impact of AI on society, highlighting the need for more comprehensive measures.
129
133
<br>
130
134
🔥<spanstyle="color: #ff3860">[NEW!]</span>We release the <ahref="./PersuasionArena.html" target="_blank">Persuasion Leaderboard</a> and you can also participate in the persuasion <ahref="./humaneval.html" target="_blank">Human-Eval</a>
131
-
<br><br>
132
-
We develop an instruction fine-tuning regime to show that smaller LLMs can also surpass the persuasion capabilities of much larger LLMs. We compare the contributions of various types of instructions in developing persuasion capabilities.
133
-
<br><br>
134
-
Further, we show that training on synthetically generated explanations of why a tweet might perform better than another tweet further helps increase the persuasion capability of LLMs beyond just the ground-truth instruction data.
<p>Here are the results of our models on the Persuasion Leaderboard. The leaderboard is based on the <ahref="https://arxiv.org/abs/2410.02653">paper</a> and the <ahref="./PersuasionArena.html">PersuasionArena</a> website.</p>
A few samples showing Transsuasion. While the account, time, and meaning of the samples remain similar, the behavior over the samples varies significantly.
149
-
<imgid="transsuasion-ground-truth" width="100%" src="images/transsuasion-headline-image.jpeg",alt="A few samples showing Transsuasion. While the account, time, and meaning of the samples remain similar, the behavior over the samples varies significantly.">
218
+
<imgid="transsuasion-ground-truth" width="80%" src="images/transsuasion-samples-ground-truth-1.jpg",alt="A few samples showing Transsuasion. While the account, time, and meaning of the samples remain similar, the behavior over the samples varies significantly.">
219
+
<br><br>
220
+
<imgid="transsuasion-ground-truth" width="80%" src="images/transsuasion-headline-image.jpeg",alt="A few samples showing Transsuasion. While the account, time, and meaning of the samples remain similar, the behavior over the samples varies significantly.">
150
221
151
222
<br><br>
152
223
153
224
154
225
A few samples showing Transsuasion using our model. The left part contains original low-liked tweet, and the right contains the transsuaded version of the tweet.
155
-
<imgid="transsuasion-generated-examples" width="100%" src="images/transsuasion-generated-examples.jpeg",alt="A few samples showing Transsuasion using our model. The left part contains original low-liked tweet, and the right contains the transsuaded version of the tweet.">
226
+
<imgid="transsuasion-generated-examples" width="80%" src="images/transsuasion-samples-generated.jpg",alt="A few samples showing Transsuasion using our model. The left part contains original low-liked tweet, and the right contains the transsuaded version of the tweet.">
227
+
<br><br>
228
+
<imgid="transsuasion-generated-examples" width="80%" src="images/transsuasion-generated-examples.jpeg",alt="A few samples showing Transsuasion using our model. The left part contains original low-liked tweet, and the right contains the transsuaded version of the tweet.">
0 commit comments