Dash and Other development by jvachier · Pull Request #3 · jvachier/Personality-classification

jvachier · 2025-07-14T11:19:11Z

Describe your changes

Provide a clear and concise description of the changes made in this pull request. Include any relevant context or background information.

What is the purpose of this pull request?
What problem does it solve?
What functionality does it add or improve?

Issue ticket number and link

Type of Change

Check the type of change your pull request introduces:

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Refactoring (code improvements without changing functionality)

Checklist before requesting a review

Before submitting your pull request, ensure the following:

I have performed a self-review of my code.
I have added tests that prove my fix is effective or that my feature works.
I have added or updated relevant documentation (if applicable).
My changes do not introduce any new warnings or errors.
I have checked for security vulnerabilities in the code.
I have ensured backward compatibility (if applicable).

- Remove MLflow/FastAPI dependencies and resolve Pydantic warnings - Implement comprehensive Dash-based model serving dashboard - Add complete pytest test suite with 100% pass rate - Clean up codebase and remove unused MLOps infrastructure - Update documentation and project structure - Maintain all core ML functionality while simplifying architecture - Fix type annotations and security warnings

- Simplified Dash app UI to only use manual input form with actual personality features - Removed JSON input and file upload tabs for cleaner, focused interface - Enhanced prediction result display with confidence scores, probability bars, and personality descriptions - Updated model loader to properly load ensemble model with feature ordering and metadata - Added visual styling with personality-specific colors and improved formatting - Created trained ensemble model (VotingClassifier) with LR, RF, and LGBM estimators - Added comprehensive metadata for feature mapping and proper prediction decoding - Fixed feature encoding for Stage_fear and Drained_after_socializing categorical variables - Improved error handling and fallback mechanisms in model loading - Increased pre-commit file size limit to 25MB for trained models

- Moved train_and_save_models.py to scripts/ directory for better organization - Simplified Dockerfile by removing unnecessary build dependencies (gcc, g++) - Updated Docker configuration to use ensemble model and copy trained models - Removed MLflow service from docker-compose.yml (no longer needed) - Updated .dockerignore to include trained models while excluding unnecessary files - Added train-models target to Makefile for running model training script - Cleaned up examples directory by removing obsolete mlops_demo.py

- Only runs on main branch pushes and PRs - Triggers only when relevant files are modified (src/, dash_app/, tests/, scripts/, etc.) - Uses single Python version (3.11) instead of matrix - Sets UV_SYSTEM_PYTHON=1 to avoid virtual environment creation - Combines all tests into single job for efficiency - Includes code quality checks, tests, and Dash app validation

- Auto-fix import sorting in scripts/train_and_save_models.py - Auto-fix import sorting in tests/modules/test_model_builders.py - Remove unused import (unittest.mock.patch) from test file - Ensure code passes Ruff checks for CI pipeline

- Fixed formatting issues in 9 files across dash_app/, src/, scripts/, and tests/ - Ensured all code passes ruff format --check for CI pipeline - Maintained consistent code style throughout the project

- Fixed label mapping in model_loader.py to use metadata-driven approach - Added explicit label_mapping to model metadata during training - Fixed Makefile dash target to include --model-name ensemble parameter - Increased pre-commit large file limit to 150MB for ML model files - Added large stack model files to .gitignore (can be regenerated) - Predictions now correctly show 'Extrovert'/'Introvert' instead of 'Class_2' - Probabilities are now properly aligned with prediction classes - Kept ensemble model (20MB) in version control, excluded large stack models

- Auto-formatted scripts/train_and_save_models.py to comply with ruff standards - All linting and formatting checks now pass

Copilot

Pull Request Overview

This PR integrates a Dash-based interactive dashboard, enhances the main pipeline with logging and default handling, introduces a model training script, expands test coverage, and updates documentation and CI configurations.

Added unit and integration tests for model builders and Dash application
Introduced dash_app/ directory with a full Dash app, Docker setup, and CI workflow
Enhanced src/main_modular.py with improved logging, None defaults, and MLOps hooks
Added scripts/train_and_save_models.py to train and persist individual stacks and an ensemble
Updated docs and project configuration for new dependencies and best practices

Reviewed Changes

Copilot reviewed 35 out of 39 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/modules/test_model_builders.py	Added unit tests for model builder functions
tests/modules/init.py	Initialized modules test package
tests/dash_app/test_dash_app.py	Added integration tests for Dash app creation/layout/server
tests/dash_app/init.py	Initialized dash_app test package
tests/conftest.py	Expanded pytest fixtures and configuration
tests/init.py	Initialized root test package
tests/README.md	Added test suite documentation and usage instructions
src/modules/config.py	Refactored configuration docstring
src/main_modular.py	Enhanced main pipeline with logging and default parameter handling
scripts/train_and_save_models.py	Added training and model saving script for stacks and ensemble
pytest.ini	Configured pytest options and markers
pyproject.toml	Updated dependencies (imbalanced-learn, sdv, dash, joblib, bandit, pydocstyle)
docs/technical-guide.md	Added detailed Dash application architecture documentation
docs/mlops-integration-summary.md	Introduced MLOps integration summary
docs/mlops-infrastructure.md	New comprehensive MLOps infrastructure documentation
dash_app/src/model_loader.py	Implemented ModelLoader and LocalModelLoader for Dash
dash_app/src/layout.py	Created Dash UI layout and input components
dash_app/src/callbacks.py	Defined callback functions for predictions and history updates
dash_app/src/app.py	Developed Dash application initialization and run logic
dash_app/src/init.py	Defined dash_app package exports
dash_app/main.py	Entry point for running Dash app via CLI
dash_app/docker-compose.yml	Docker Compose config for Dash service
dash_app/Dockerfile	Dockerfile for building Dash app container
dash_app/.dockerignore	Dockerignore rules for Dash app build context
Makefile	Updated build targets (run, test, dash, train-models, stop-dash)
.pre-commit-config.yaml	Increased file size limit for large model files
.github/workflows/test.yml	CI workflow for tests, lint, type and security checks
README.md	Updated project README to highlight Dash and production infrastructure

Comments suppressed due to low confidence (3)

tests/modules/test_model_builders.py:9

Tests cover build_stack, build_stack_c, and build_sklearn_stack, but do not include build_neural_stack or build_noisy_stack. Consider adding test cases for these builder functions to ensure full coverage.

from modules.model_builders import build_sklearn_stack, build_stack, build_stack_c

tests/modules/test_model_builders.py:25

[nitpick] This fixture name duplicates the sample_data fixture in conftest.py, which may lead to fixture resolution conflicts. Consider renaming it (e.g., model_builders_sample_data).

    def sample_data(self):

dash_app/src/callbacks.py:154

The callback sync_json_input references component IDs json-input and json-input-display which are not defined in the layout, causing runtime errors. Consider removing or updating this callback to match actual component IDs.

    @app.callback(

tests/modules/test_model_builders.py

jvachier added 5 commits July 14, 2025 12:01

Dash app and other DevOps.

9cf8cd7

jvachier self-assigned this Jul 14, 2025

jvachier added the SemVer-Minor label Jul 14, 2025

jvachier added 2 commits July 14, 2025 13:27

Fix code style issues with Ruff

ccb8810

- Auto-fix import sorting in scripts/train_and_save_models.py - Auto-fix import sorting in tests/modules/test_model_builders.py - Remove unused import (unittest.mock.patch) from test file - Ensure code passes Ruff checks for CI pipeline

Auto-format code with Ruff

cd7a62c

- Fixed formatting issues in 9 files across dash_app/, src/, scripts/, and tests/ - Ensured all code passes ruff format --check for CI pipeline - Maintained consistent code style throughout the project

jvachier requested a review from Copilot July 14, 2025 11:35

This comment was marked as outdated.

Sign in to view

jvachier added 5 commits July 14, 2025 13:38

fixe test yml file.

54a489a

removing unnecessary code.

e1930cb

-

76c2656

Simplifying Makefile.

5badb1b

jvachier requested a review from Copilot July 14, 2025 12:49

ruff.

6a6c5df

This comment was marked as outdated.

Sign in to view

jvachier added 2 commits July 14, 2025 14:53

Fix code formatting with ruff

43e092c

- Auto-formatted scripts/train_and_save_models.py to comply with ruff standards - All linting and formatting checks now pass

Adding documentations for the Dash Application.

b1f52ac

jvachier requested a review from Copilot July 14, 2025 13:43

Copilot AI reviewed Jul 14, 2025

View reviewed changes

tests/modules/test_model_builders.py Show resolved Hide resolved

Adding images.

a841b8c

jvachier added SemVer-Major and removed SemVer-Minor labels Jul 14, 2025

jvachier merged commit 58ee1c7 into main Jul 14, 2025
1 check passed

jvachier deleted the jv/experimentation branch July 14, 2025 14:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dash and Other development#3

Dash and Other development#3
jvachier merged 16 commits intomainfrom
jv/experimentation

jvachier commented Jul 14, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jvachier commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Issue ticket number and link

Type of Change

Checklist before requesting a review

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jvachier commented Jul 14, 2025 •

edited

Loading