From 6ac268dd9136e546334b51b732020018631fbb85 Mon Sep 17 00:00:00 2001 From: ra1n <162525686+xxraincandyxx@users.noreply.github.com> Date: Wed, 19 Nov 2025 11:46:13 +0800 Subject: [PATCH] Update link in README to point to live site --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 100cda4..97b0fe5 100644 --- a/README.md +++ b/README.md @@ -2,7 +2,7 @@ EQ-Bench 3 is a multi-turn emotional intelligence benchmark. It assesses active EQ skills, interpersonal skills, psychological insight and analytical depth. It challenges language models with role-play or analysis tasks that require empathy, depth of insight, social dexterity, and more. An auxiliary judge model (by default, Claude Sonnet 3.7) scores or pairwise-compares the outputs. -For full details on the benchmark including methodology, criteria, bias analysis, repeatabilty experiments and more, click [here](http://localhost:8000/about.html#long). +For full details on the benchmark including methodology, criteria, bias analysis, repeatabilty experiments and more, click [here](https://eqbench.com/about.html#long). **Features** - **Role-Play Scenarios**: The tested model is placed in conversation-based scenarios (e.g., parenting, relationship conflict, workplace tension). It must articulate what it (and others) feel/think before delivering its final response. @@ -316,4 +316,4 @@ repository and the original EQ-Bench paper. archivePrefix= {arXiv}, primaryClass = {cs.CL} } -``` \ No newline at end of file +```