HOUND: High-fidelity Optimized Urban Noise Detection

HOUND is an audio classification system optimized for urban noise detection, built on the UrbanSound8K dataset. It features a custom deep learning model for high-fidelity sound classification, a user-friendly Gradio interface for inference, and robust CI/CD pipelines.

This project was developed as part of the Software Engineering for Artificial Intelligence course at University of Salerno.

Authors

Features

Custom CNN model for urban sound classification.
Data augmentation and mel spectrogram extraction for improved accuracy.
Gradio-based web interface for easy inference.
Docker support for containerization.
Comprehensive unit tests with pytest and coverage reports.
CI/CD workflow with linting, security scans, and artifact uploads.

Installation

Prerequisites

Python 3.10+
pip for package management

Setup

Clone the repository:

git clone https://github.com/davidcocc/hound.git
cd hound

Install dependencies:

pip install -r requirements.txt

Note: For testing, additionally install:

pip install pytest pytest-cov pytest-html gradio

(Optional) Set up Docker:
- Build the image: docker build -t hound .
- Run: docker run -p 7860:7860 hound

Usage

Training

To train or retrain the custom model:

python -m src.hound_train --dataset data/archive/ --output custom_model/custom_UrbanSound8K.keras

Use --augment for data augmentation.
Metrics and visualizations are saved in metrics/custom/.

Inference

Run inference on a single audio file:

python -m src.hound_inference --file path/to/audio.wav

Use --compare to evaluate original vs. custom model on fold 10.

Gradio Interface

Launch the web UI:

python -m src.interface

Upload an audio file.
Select a model via buttons (defaults to custom).
Click "Classify" to see prediction, spectrogram, and probability pie chart.

Testing

Run unit tests with coverage and reports:

pytest --cov=src --cov-report=html --html=report.html

View htmlcov/index.html for coverage.
View report.html for test results.

Model Card

Model Overview

Name: Hound
Version: 1.0
Description: A convolutional neural network fine-tuned on the UrbanSound8K dataset for classifying 10 urban sound classes (e.g., air_conditioner, car_horn).
Architecture: CNN with mel spectrogram inputs (168x168), trained with data augmentation (noise, pitch shift, time stretch).
Training Data: UrbanSound8K (8732 labeled sound excerpts ≤4s across 10 folds).
Performance:
- Accuracy: ~0.85 (custom model; see metrics/metrics_custom_val.txt for details).
- Confusion Matrix and ROC: Available in metrics/custom/.
Limitations: Performs best on short urban clips; may struggle with overlapping sounds or non-urban noise.
Ethical Considerations: Designed for urban monitoring; ensure ethical use in surveillance contexts.
Saved Models:
- Original: model/UrbanSound8K.keras
- Custom: custom_model/custom_UrbanSound8K.keras (and best variant)

For more details, refer to the training script and metrics outputs.

License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.cursor		.cursor
.github/workflows		.github/workflows
custom_model		custom_model
metrics		metrics
model		model
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HOUND: High-fidelity Optimized Urban Noise Detection

Authors

Features

Installation

Prerequisites

Setup

Usage

Training

Inference

Gradio Interface

Testing

Model Card

Model Overview

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

davidcocc/HOUND

Folders and files

Latest commit

History

Repository files navigation

HOUND: High-fidelity Optimized Urban Noise Detection

Authors

Features

Installation

Prerequisites

Setup

Usage

Training

Inference

Gradio Interface

Testing

Model Card

Model Overview

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages