Refusals (LLM Leaderboard)
Leaderboard of refusal rates across the GPT-4o, o1-mini, o1-preview, and Claude 3.5 Sonnet language models. Comparative analysis of how different models respond to various reasoning tasks.
Read more here: External Link
Leaderboard of refusal rates across the GPT-4o, o1-mini, o1-preview, and Claude 3.5 Sonnet language models. Comparative analysis of how different models respond to various reasoning tasks.
Read more here: External Link