Skip to content

Conversation

@Shuyib
Copy link
Owner

@Shuyib Shuyib commented Oct 12, 2025

This pull request introduces significant improvements to the project's AI safety and model management workflows, focusing on integrating a robust Inspect AI safety layer, updating supported models, and enhancing Docker container startup reliability. The most important changes are grouped below by theme.

AI Safety Layer Integration

  • Added comprehensive Inspect AI safety layer guide (INSPECT_SAFETY_GUIDE.md) and integrated safety checks into app.py, evaluating user input for prompt injection, jailbreaking, and other attacks before LLM processing. Safety results are logged, and configuration options for strict/normal modes are provided. [1] [2] [3]
  • Updated README.md to document the Inspect AI safety layer, its features, usage, integration points, and references, making security practices transparent and easy to follow.

Model and Docker Management

  • Updated the base Ollama Docker image from version 0.3.3 to 0.6.8 and switched the default model from qwen2.5:0.5b to qwen3:0.6b. Added a robust entrypoint script to ensure the model is pulled on container startup, improving reliability and startup diagnostics. [1] [2]
  • Changed Ollama service port mappings in docker-compose.yml and docker-compose-all.yml to avoid conflicts and clarify external/internal port usage. [1] [2]

Documentation and Project Structure

  • Expanded documentation in README.md to include new recommended models (e.g., Gemma 27B), added references to the safety guide and implementation summary, and clarified file roles for easier onboarding and maintenance. [1] [2]

Minor Fixes

  • Corrected a typo in Makefile for improved clarity in linting instructions.

These changes collectively strengthen the project's security posture, improve model support, and make deployment and usage more robust and transparent.

Copilot AI and others added 10 commits September 30, 2025 04:48
Co-authored-by: Shuyib <12908522+Shuyib@users.noreply.github.com>
Co-authored-by: Shuyib <12908522+Shuyib@users.noreply.github.com>
Co-authored-by: Shuyib <12908522+Shuyib@users.noreply.github.com>
remove sudo and edit typos
format code with black
Added Gemma model recommendation and clarified VRAM requirements.
…1-f50d49393314

Add Inspect AI Safety Layer for Prompt Injection and Jailbreaking Detection
@Shuyib Shuyib merged commit 1541f5d into main Oct 12, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants