diff --git a/README.md b/README.md index 100cda4..97b0fe5 100644 --- a/README.md +++ b/README.md @@ -2,7 +2,7 @@ EQ-Bench 3 is a multi-turn emotional intelligence benchmark. It assesses active EQ skills, interpersonal skills, psychological insight and analytical depth. It challenges language models with role-play or analysis tasks that require empathy, depth of insight, social dexterity, and more. An auxiliary judge model (by default, Claude Sonnet 3.7) scores or pairwise-compares the outputs. -For full details on the benchmark including methodology, criteria, bias analysis, repeatabilty experiments and more, click [here](http://localhost:8000/about.html#long). +For full details on the benchmark including methodology, criteria, bias analysis, repeatabilty experiments and more, click [here](https://eqbench.com/about.html#long). **Features** - **Role-Play Scenarios**: The tested model is placed in conversation-based scenarios (e.g., parenting, relationship conflict, workplace tension). It must articulate what it (and others) feel/think before delivering its final response. @@ -316,4 +316,4 @@ repository and the original EQ-Bench paper. archivePrefix= {arXiv}, primaryClass = {cs.CL} } -``` \ No newline at end of file +```