docs: add RAIL Score evaluation cookbook — content + agent safety scoring by SumitVermakgp · Pull Request #2766 · langfuse/langfuse-docs

SumitVermakgp · 2026-04-01T22:33:41Z

Summary

Adds a cookbook notebook demonstrating how to evaluate LLM outputs and agent tool calls with RAIL Score and push dimension scores to Langfuse traces.

What this cookbook covers

Content evaluation: inline and batch scoring of LLM outputs across 8 responsible AI dimensions (fairness, safety, reliability, transparency, privacy, accountability, inclusivity, user impact)
Deep mode: per-dimension explanations attached as score comments
Agent tool-call evaluation: pre-execution risk assessment (ALLOW/FLAG/BLOCK) pushed to observation-level scores
Agent session tracking: cumulative risk scores and pattern detection across multi-tool workflows
Human review integration: flagging low-scoring traces with needs_human_review boolean scores for Annotation Queue routing

Changes

Added cookbook/evaluation_with_rail_score.ipynb
Added route entry in cookbook/_routes.json

Links

GitHub Discussion: https://github.com/orgs/langfuse/discussions/12954
SDK on PyPI: rail-score-sdk
Documentation: docs.responsibleailabs.ai

Add cookbook notebook demonstrating RAIL Score integration with Langfuse for 8-dimension responsible AI evaluation of LLM outputs and agent tool calls. Covers inline scoring, batch evaluation, deep mode explanations, agent tool-call risk assessment, session tracking, and human review queue integration.

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

vercel · 2026-04-01T22:33:46Z

@SumitVermakgp is attempting to deploy a commit to the langfuse Team on Vercel.

A member of the Team first needs to authorize it.

review-notebook-app · 2026-04-01T22:33:47Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

CLAassistant · 2026-04-01T22:33:49Z

All committers have signed the CLA.

Replace all em-dash characters with colons or hyphens for consistency with project style conventions.

claude bot reviewed Apr 1, 2026

View reviewed changes

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Apr 1, 2026

dosubot bot added the docs label Apr 1, 2026

SumitVermakgp added 2 commits April 2, 2026 04:10

Merge branch 'main' into feat/rail-score-evaluation

0e22604

fix: replace em-dashes with colons in notebook

1180440

Replace all em-dash characters with colons or hyphens for consistency with project style conventions.

jannikmaierhoefer self-requested a review April 2, 2026 14:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add RAIL Score evaluation cookbook — content + agent safety scoring#2766

docs: add RAIL Score evaluation cookbook — content + agent safety scoring#2766
SumitVermakgp wants to merge 3 commits intolangfuse:mainfrom
SumitVermakgp:feat/rail-score-evaluation

SumitVermakgp commented Apr 1, 2026 •

edited

Loading

Uh oh!

claude bot left a comment

Uh oh!

vercel bot commented Apr 1, 2026

Uh oh!

review-notebook-app bot commented Apr 1, 2026

Uh oh!

CLAassistant commented Apr 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SumitVermakgp commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What this cookbook covers

Changes

Links

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

vercel bot commented Apr 1, 2026

Uh oh!

review-notebook-app bot commented Apr 1, 2026

Uh oh!

CLAassistant commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SumitVermakgp commented Apr 1, 2026 •

edited

Loading

CLAassistant commented Apr 1, 2026 •

edited

Loading