feat: Add OpenRAG Workbench evaluation framework by matanor · Pull Request #1213 · langflow-ai/openrag

matanor · 2026-03-22T09:04:15Z

Summary

This PR adds the OpenRAG Workbench evaluation framework to the repository, providing a comprehensive solution for evaluating RAG (Retrieval-Augmented Generation) systems using the OpenRAG SDK.

Prerequisites

Requires the changes to the SDK from #1086

Changes Made

New Components Added

Evaluation Framework (evaluation/openrag_workbench/)
- Complete evaluation pipeline for RAG systems
- Integration with OpenRAG SDK for ingestion and inference
- Rich table-based board for result visualization

Key Files

Pipeline Implementations
- pipelines/ingest.py - Document ingestion pipeline (265 lines)
- pipelines/inference.py - Inference pipeline for RAG queries (254 lines)
Evaluation Entry Point
- evaluate.py - Main evaluation script
Configuration & Utilities
- boards/table_rich/board.yaml - Board configuration for results display
- logging_config.py - Logging configuration
- utils.py - Utility functions

Testing

Comprehensive Test Suite
- tests/pipelines/test_ingest.py - Ingestion pipeline tests (158 lines)
- tests/pipelines/test_inference.py - Inference pipeline tests (232 lines)
- tests/conftest.py - Test fixtures and configuration

Project Configuration

pyproject.toml - Project dependencies and configuration
uv.lock - Locked dependencies (3121 lines)
.env.example - Environment variable template
.gitignore - Git ignore patterns
README.md - Documentation

Statistics

19 files changed
4,634 additions
Complete evaluation framework with pipelines, tests, and configuration

Additional Notes

This implementation provides a production-ready evaluation framework that integrates seamlessly with the OpenRAG SDK, enabling comprehensive testing and benchmarking of RAG systems.

- Add FilenameExistsResponse model to SDK - Add filename_exists() async method to DocumentsClient - Add GET /v1/documents/check-filename endpoint with API key auth - Export FilenameExistsResponse in SDK __init__.py This enables SDK users to check if a file exists in the knowledge base before ingestion, avoiding duplicate uploads.

…lization

…to sdk_for_ragworkbench

- Add inference and ingest pipelines - Add create_boards script and utility modules (utils, logging_config) - Add .env.example for configuration - add pyproject.toml

- Add shared conftest.py with environment and logging fixtures - Enhance inference test with explicit cache hit/miss validation - Update pytest configuration with pythonpath and strict markers

- Add boards module with table_rich board configuration - Rename create_boards.py to evaluate.py for clarity - Enhance .gitignore with additional patterns - Remove unused imports from pipelines - Update utility functions

define a single configuration with multiple options update acreens to use metric names

matanor and others added 26 commits March 9, 2026 12:30

add index_name to SettingsUpdateOptions

cbdb9e9

sdk version to 0.1.5

21aecf0

feat(sdk): add onboarding endpoint support for embedding model initia…

b971c4b

…lization

feat: add v1 onboarding endpoint with SDK client support

23e995b

Merge branch 'main' into sdk_for_ragworkbench

7708b69

add query to the SourcesEvent

044f014

Merge remote-tracking branch 'remotes/origin/sdk_for_ragworkbench' in…

4c93d6b

…to sdk_for_ragworkbench

adjust _transform_stream_to_sse to correctly handle the sources.

4111e04

feat(evaluation): add OpenRAG evaluation tool

5d2d584

- Add inference and ingest pipelines - Add create_boards script and utility modules (utils, logging_config) - Add .env.example for configuration - add pyproject.toml

Add uv support, pipeline improvements, and tests

58f21cf

test: improve test configuration and cache validation

1fef30c

- Add shared conftest.py with environment and logging fixtures - Enhance inference test with explicit cache hit/miss validation - Update pytest configuration with pythonpath and strict markers

Add evaluation boards and refactor module structure

a971205

- Add boards module with table_rich board configuration - Rename create_boards.py to evaluate.py for clarity - Enhance .gitignore with additional patterns - Remove unused imports from pipelines - Update utility functions

add openai dependency

9b90b8a

clean up board

eb5eebc

remove unused batching params

75d9d82

updat readme

61520c7

update to match dependencies (added openai)

3ff6ba1

update python version

a1d4ee2

remove unused params

904cb7f

update ragworkbench commit in lock

7f70ab5

Merge branch 'main' into sdk_for_ragworkbench

7247151

update lock file version

6354eb0

Merge branch 'sdk_for_ragworkbench' into openrag_workbench

3a42694

rename openrag_eval to openrag_workbench

f9d1951

update uv lock file

027e32b

github-actions bot added backend 🔷 Issues related to backend services (OpenSearch, Langflow, APIs) enhancement 🔵 New feature or request labels Mar 22, 2026

matanor marked this pull request as ready for review March 22, 2026 09:09

github-actions bot removed the enhancement 🔵 New feature or request label Mar 22, 2026

github-actions bot added the enhancement 🔵 New feature or request label Mar 22, 2026

matanor added 4 commits March 22, 2026 16:54

env variables for IBM watsonx.ai Configuration

dfa33d4

retry over invalid answers

0d3463f

default chunking params to match openrag defaults

6365f13

update lock file

609c476

github-actions bot added enhancement 🔵 New feature or request and removed enhancement 🔵 New feature or request labels Mar 22, 2026

switch to watsonx LLMaaJ

0c8cba9

define a single configuration with multiple options update acreens to use metric names

github-actions bot added enhancement 🔵 New feature or request and removed enhancement 🔵 New feature or request labels Mar 22, 2026

matanor added 2 commits March 23, 2026 14:34

support control of the embedding model provider id

aab03db

move tests to ollama

a3cc4ed

github-actions bot added enhancement 🔵 New feature or request and removed enhancement 🔵 New feature or request labels Mar 23, 2026

switch to an official release of rag WB

80fcea5

github-actions bot added enhancement 🔵 New feature or request and removed enhancement 🔵 New feature or request labels Mar 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add OpenRAG Workbench evaluation framework#1213

feat: Add OpenRAG Workbench evaluation framework#1213
matanor wants to merge 34 commits intolangflow-ai:mainfrom
matanor:openrag_workbench

matanor commented Mar 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

matanor commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Prerequisites

Changes Made

New Components Added

Key Files

Testing

Project Configuration

Statistics

Additional Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

matanor commented Mar 22, 2026 •

edited

Loading