Dr. Stéphane DEDIEU DrStef

Machine Learning - Deep Learning Projects

Advanced Projects in DSP & ML/DL

This repository showcases cutting-edge R&D projects in Digital Signal Processing (DSP) and Machine/Deep Learning (ML/DL), focusing on noise reduction and custom signal transforms (e.g., modified CWT/STFT) for detecting anomalies in acoustic and vibration signals. Drawing on expertise in optimization, calculus, and linear algebra, these developments enable real-time feature extraction for applications like environmental sound analysis and classification. They also support industrial sound monitoring, with potential extensions to rotating machinery failure detection (bearings, motors, rotors), HVAC fault detection and diagnosis (pumps, compressors, valves), and drilling telemetry monitoring.

A collection of high-performance pipelines for early fault detection in rotating equipment, validated on the NASA Prognostics Dataset (20 kHz, 984 frames, Bearing 1 failure ~frame 530–540).

What we actually achieve

Classical time-domain indicators (RMS, kurtosis, crest factor) only react around frames 520–540.
Our methods detect the very first fault signatures as early as frames 450–470 — a consistent pre-alarm 60–90 frames (~1–1.5 minutes) ahead, with robust and repeatable results.

Part I – Initial Research
Time-series analysis, spectral insights (FFT), Wiener denoising, LSTM on STD/kurtosis sequences.

Part II – CNN Autoencoder with bTSTFT
Precomputed custom bTSTFT transforms (magnitude + phase) on Wiener-denoised surframes (5-frame sliding window) + CNN autoencoder. Reconstruction error (MSE) + CUSUM deliver consistent early detection below frame 450, significantly outperforming traditional metrics.

Note: The pipeline requires significant compute resources (no real-time edge deployment; runs comfortably on modern laptops/servers with GPU acceleration).

All code and precomputed data (.npy tensors) are open-source and reproducible. For details on the custom bTSTFT method, contact me.

Applications

Rotating machinery (bearings, gearboxes, motors, pumps, rotors)
Oil & Gas drilling telemetry (mud motors, top-drive, drill-string vibration)
Wind-turbine drivetrains
Any system where catching a fault days instead of minutes ahead saves millions


_{Magnitude FFT - Frame 540 early degradation Raw v. Wiener denoised}	_{btstft transform Mag + Phase - Frame 510}	_{Cusum on Test mse (triggers alarm at frame 448)}

https://zenodo.org/records/3384388

Applications

Rotating machinery Failure Detection: bearings, motors,rotors.
HVAC Fault detection and diagnosis (FDD): pumps, compressors, valves.


_{Novel ACSTFT Transform Top: 3x normal, bottom: 3x default}	_{ROC-AUC= 0.99 Valve Type id_04}	_{Reconstruction error (MSE) Valve type id_04}	_{MVDR beamforming Beampattern 1000Hz}	_{Denoised Valve Sound Signals with VAD Decision}

🎙️⚙️This repository implements a Hybrid Denoising Framework designed to isolate signals from extreme non-stationary backgrounds.
It explores the synergy between Deep Learning (Residual U-Net) and advanced Spectral Analysis.🚀 Key Innovations & Dual-Domain StrategyThe project is built on a specialized dual-path approach, selecting the optimal transform based on the nature of the noise:

STFT Path (Optimized for Speech & Communication)Target:

Vocal restoration in high-noise environments (Helicopters, Marine, Field recordings).
Residual U-Net (v06d): 5-level deep architecture with 512 filters at the bottleneck.Complex Masking: Predicts Real/Imaginary masks (clamped at $K=5.0$) to maintain phase coherence and restore critical formants (1kHz - 3kHz).
Hybrid Injection: A 15% blend of the original mixture to restore natural "warmth".

CWT Path: A Paradigm Shift in Wavelet Denoising:

Beyond Thresholding: Unlike traditional Wavelet denoising methods based on hard/soft thresholding (which often lose phase information and introduce artifacts), this project implements Complex Masking directly in the CWT domain.
Phase-Preserving Reconstruction: By predicting Real/Imaginary masks on the CWT coefficients, we maintain the integrity of the signal's wavefront—a critical factor for high-fidelity industrial diagnostics.
Target: Why CWT? Unlike the STFT, the Continuous Wavelet Transform provides superior time-resolution for high-frequency transients, making it the ideal tool for Machine Health Monitoring and impulsive fault detection. Industrial Impulses: While our benchmarks are validated on speech for intelligibility metrics, the CWT-UNet path is specifically engineered for Industrial Noise & Transient Impacts (shocks, clicks, mechanical fatigue signatures).
Computational Trade-off: The high resolution of the CWT makes it the "Surgical" option for mechanical monitoring, where precision in the time-frequency plane outweighs the computational cost. While computationally intensive, the CWT path is dedicated to high-stakes industrial diagnostics where STFT-based models fail to capture the "sharpness" of mechanical impacts.

Applications

Telecommunications: Enhanced front-end for VoIP and radio systems in industrial or outdoor settings.
Forensic Audio: Voice extraction from surveillance or emergency recordings with non-stationary interference.
Aviation & Marine: Cockpit and deck communication recovery in high-noise environments.
the CWT-UNet path is specifically engineered for Industrial Noise & Transient Impacts (shocks, clicks, mechanical fatigue signatures).

Automatic Environmental Sound Classification (ESC) leverages the ESC-50 dataset (and its ESC-10 subset) developed by Karol Piczak, as detailed in his paper titled: "ESC: Dataset for Environmental Sound Classification." by Karol J. Piczak. 2015. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). Association for Computing Machinery, New York, NY, USA, 1015–1018. https://doi.org/10.1145/2733373.2806390"

This dataset serves as a foundation for research in audio event recognition.

Advancements in ESC Using Multi-Feature CNNs:

We propose a two-stages classification approach with Multi-feature Convolutional Neural Networks (CNNs), achieving near-perfect accuracy rates, specifically reaching up to 99%. This high accuracy is attributed to innovative pre-processing techniques that combine mel-spectrograms with complex wavelet transforms (CWT).

Resolution of Remaining Classification Challenges:

A notable challenge in ESC-10 sound classification was the confusion between "sea waves" and "rain" sounds. This issue was addressed by developing an original transformation of the complex CWT, termed aT-CWT. This transformation replaces the phase component of the CWT for stationary and pseudo-stationary sounds with a Gaussian distribution, enhancing the model's ability to differentiate between similar sounding environmental events.
By integrating the aT-CWT transformation, the multi-feature CNN model has now achieved 100% accuracy in classifying environmental sounds from the ESC-10 dataset.


_{aT-CWT transform: "seawave"}	_{aT-CWT transform: "rain"}	_{Confusion Matrix with aT-CWT}

In this project, we are developing effective methods for classifying mitochondrial genomes (DNA sequences) using Digital Signal Processing (DSP), Machine Learning (ML), and Deep Learning (DL). This research is ongoing, and we plan to publish our results regularly. As a starting point, we analyzed the paper titled:
"ML-DSP: Machine Learning with Digital Signal Processing for ultrafast, accurate, and scalable genome classification at all taxonomic levels" by Gurjit S. Randhawa , Kathleen A. Hill and Lila Kari. https://doi.org/10.1186/s12864-019-5571-y

The alignment-free DNA sequence classification approach: ML-DSP, proposed by Gurjit S. Randhawa has proven to be very effective.
By introducing a simple alignment technique alongside short Fast Fourier Transforms (FFTs), termed ML-FFT + SoftAlign, we have surpassed the performance of ML-DSP, particularly with challenging datasets such as those from Fungi and Insects.

Standard projects

This section is a portfolio of Machine Learning projects with Python and various visualization and analysis tools. Most of these projects were carried out within the framework of IBM certifications. They are presented with Jupyter Notebooks.
Some projects have been improved by incorporating more in-depth data analysis, better graphs, advanced ML techniques.

Data Analysis


_{Word Cloud}	_{Folium with markers}	_Choropleth

Digital Signal Processing

Modeling and Scientific Computing


_{"Figure 8" toroid}	_Gyroid	_{Truncated cuboctahedron}	_{Helicoid-Catenoid}

I'm Dr. Stef (@DrStef)

Crafting DSP & ML/DL innovations that uncover hidden signals in noise, grounded in rigorous numerical analysis and real-world impact.

Audio & Signal Processing scientist blending mathematical precision, custom transform design, and ML-driven anomaly detection to transform chaotic data into actionable insights for industrial and environmental applications.

What drives me most in DSP/ML is the thrill of R&D: Distilling cutting-edge theory (NR gains, STFT, wavelets, synchrosqueezing) into tools that solve tough problems, like detecting valve faults or drilling vibrations before they escalate.

⇒ Signal Analysis Rigor - Advanced time-frequency methods (CWT, STFT, Mel-Spectrograms) for non-stationary phenomena
⇒ Noise Reduction & Enhancement - Beamforming, VAD, and adaptive filters achieving >20 dB SNR gains in multi-sensor setups
⇒ Mathematical Foundations - Expertise in calculus, linear algebra, and optimization for feature extraction and model stability
⇒ ML/DL Innovation - CNN, CNN autoencoders and genetic algorithms for unsupervised anomaly detection in acoustics/vibrations
⇒ Industrial Applications - 25+ years in electroacoustics, computational mathematics, DSP, in Telecom & Consumer Electronics industries, Intelligent Building and Safety Industry. | PhD in Mech. Engineering: Numerical Analysis (BEM/FEM) and Optimization.

I develop open-source pipelines that bridge numerical rigor with practical engineering, specializing in custom transforms for anomaly detection in noisy environments. From environmental sound classification to industrial valve diagnostics, my work advances real-time ML for telemetry and beyond.

▶ Mission: Empowering engineers with math-backed DSP tools that turn signal chaos into predictive power—building the future of acoustic, vibrations, signals intelligence, one transform at a time.

🔭 I’m currently working on advanced projects in ML & DL
👯 I’m looking to collaborate on Digital Signal Processing, Machine Learning, Deep Learning
📫 How to reach me: stephane.dedieu@bloo-audio.com

Dr. Stéphane DEDIEU DrStef

Machine Learning - Deep Learning Projects

Advanced Projects in DSP & ML/DL

What we actually achieve

Applications

Applications

Deep Learning and Digital Signal Processing: Voice Activity Detection (VAD)

Machine Learning and Digital Signal Processing: Sound Source Localization (SSL)

Standard projects

Data Analysis

Databases and SQL for Data Science

Stock extraction & vizualisation - yFinance, Webscraping

Digital Signal Processing

Microphone Array - Beamforming

6 microphones circular array - Generalized Sidelobe Canceller (GSC) - Validation

A few words about complex wavelet transforms

Modeling and Scientific Computing

Integral equation - Boundary Elements Method (BEM)

I'm Dr. Stef (@DrStef)

Popular repositories Loading

Uh oh!