harmfulness-detection

Here is 1 public repository matching this topic...

raxITlabs / GrayZoneBench

Open-source AI safety benchmark testing how models handle tricky gray-zone requests. CLI for running benchmarks + web dashboard for exploring results.

open-source benchmark dashboard openai ai-safety cli-tool ai-research llm-evaluation safety-evaluation harmfulness-detection safe-completion

Updated Aug 20, 2025
Python

Improve this page

Add a description, image, and links to the harmfulness-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the harmfulness-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

harmfulness-detection

Here is 1 public repository matching this topic...

raxITlabs / GrayZoneBench

Improve this page

Add this topic to your repo