Sound Wave 🎵

A real-time audio graphic equalizer with modern interface and high-performance fullscreen visualization.

🇮🇹 Italiano: Leggi questo README in italiano

📋 Overview

Sound Wave is a desktop application that captures real-time audio from system input devices and visualizes it as a dynamic graphic equalizer. The application offers both an integrated preview mode and a high-performance fullscreen mode for multiple monitors.

✨ Key Features

🎤 Multi-Device Audio Capture: Supports all available audio devices on the system
📊 Real-Time Visualization: Equalizer with colored bars (green, yellow, red)
🖥️ Fullscreen Mode: High-performance visualization with multi-monitor support
⚙️ Advanced Configuration:
- Adjustable noise threshold
- Dynamic frequency bands (1-100)
- Customizable background transparency
- Configurable parallel processing
🎨 Visual Customization:
- Support for custom background images
- Light/dark/automatic themes
- Background alpha control
⚡ Performance Optimizations:
- Multi-core parallel FFT computation
- OpenGL rendering for fullscreen mode
- Adaptive smoothing based on monitor refresh rate

🖼️ Screenshots

Main Interface

The main interface with controls on the left and equalizer preview on the right.

Fullscreen Mode

Immersive full-screen visualization with high-performance OpenGL rendering.

🛠️ Installation

System Requirements

Python: 3.8 or higher
Operating System: Windows, macOS, Linux
Hardware: Audio card with compatible drivers

Dependencies

pip install -r requirements.txt

Main dependencies include:

customtkinter - Modern user interface
soundcard - Cross-platform audio capture
numpy & scipy - Numerical processing and FFT
PyOpenGL - High-performance graphics rendering
Pillow - Image handling
glfw - OpenGL window management

Quick Start

Clone the repository:

git clone https://github.com/yourusername/sound-wave.git
cd sound-wave

Install dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
python main.py
```

🚀 Usage

Initial Setup

Select an Audio Device:
- Double-click on a device from the list on the left
- The "Start/Pause" button will turn green when ready
Start Visualization:
- Click the "Start/Pause" button to begin
- The button will turn red during playback

Controls and Settings

🎛️ Settings Panel

Appearance Mode: Select light, dark, or automatic theme
Background Image Path: Load a custom background image
Alpha Amount: Control background transparency (0-1)
Noise Threshold: Set minimum signal threshold for visualization
Frequency Bands: Number of equalizer bars (1-100)
Number of Processors: Configure parallel processing

🖥️ Fullscreen Mode

Click the "Fullscreen" button to open immersive visualization
Use ESC to exit fullscreen mode
On multi-monitor systems, select preferred display from settings

⌨️ Keyboard Shortcuts

ESC: Exit fullscreen mode
Space: Start/Stop audio capture (when application has focus)

🔧 Advanced Configuration

Performance Optimization

Parallel Processing

The application supports parallel FFT processing to improve performance:

Auto: Automatic calculation based on number of bands
Manual: Manually set number of workers (1-5)

Hardware Recommendations

Number of Bands	Recommended Workers	Notes
1-15	1	Single-thread processing
16-30	2	Optimal CPU balance
31-50	3	Intensive CPU usage
51+	4-5	Fullscreen mode recommended

Background Images

Supported formats: PNG, JPG, JPEG, GIF

For best results:

Minimum resolution: 1920x1080
Preferred format: PNG with alpha channel
Avoid overly detailed images that might interfere with visualization

📁 Project Structure

sound-wave/
├── main.py                          # Application entry point
├── requirements.txt                 # Python dependencies
├── resource_manager.py              # Resource management (images, assets)
│
├── gui/                            # User interface
│   ├── main_window_gui.py          # Main window
│   ├── help_window.py              # Help window
│   ├── slider_frame.py             # Custom slider component
│   ├── optionmenu_frame.py         # Options menu component
│   ├── background_filepicker_frame.py # Image file picker
│   └── processes_number_frame.py   # Processor count control
│
├── thread/                         # Processing threads
│   ├── audioCaptureThread.py       # Background audio capture
│   ├── equalizer_tkinter_thread.py # Equalizer rendering (Tkinter)
│   ├── opengl_thread.py           # Fullscreen rendering (OpenGL)
│   └── AudioBufferAccumulator.py  # Audio buffer with overlap
│
└── resources/                      # Application assets
    ├── bg/                        # Background images
    └── icons/                     # Interface icons

🔬 Technical Details

Audio Processing

Sample Rate: 44.1 kHz
Channels: Stereo (2 channels)
FFT Size: 4096 samples (fullscreen mode) / 8192 (preview mode)
Overlap: 50% between successive windows
Window Function: Blackman (to reduce spectral leakage)

Visualization Algorithms

Frequency Distribution

Frequency bands are distributed on a logarithmic scale from 20 Hz to 20 kHz:

def generate_frequency_bands(self, num_bands: int) -> list[tuple[float, float]]:
    """
    Generate frequency bands on logarithmic scale.
    
    Args:
        num_bands (int): Desired number of bands
        
    Returns:
        list[tuple[float, float]]: List of (freq_min, freq_max) tuples
    """

Adaptive Smoothing

The system implements adaptive temporal smoothing that automatically adjusts based on monitor refresh rate:

60 Hz: Base factor 0.25
120 Hz: Base factor 0.30
144+ Hz: Base factor 0.35

Graphics Rendering

Tkinter Mode (Preview)

Canvas-based rendering
Frame rate: 120 FPS
Colors: Green (0-50%), Yellow (50-80%), Red (80-100%)

OpenGL Mode (Fullscreen)

Hardware-accelerated rendering
VSync enabled for monitor synchronization
Circular buffers for optimal performance
Texture support for custom backgrounds

🐛 Troubleshooting

Common Issues

Application doesn't detect audio devices

Solution:

Verify audio drivers are updated
Check microphone permissions (macOS/Linux)
Restart application as administrator (Windows)

Poor performance with many bands

Solutions:

Reduce number of frequency bands
Use fullscreen mode for better performance
Increase number of workers in settings
Close other CPU-intensive applications

Fullscreen mode won't start

Checks:

Verify graphics drivers are updated
Check system OpenGL support
On Linux, ensure OpenGL libraries are installed

Background image won't load

Solutions:

Verify image format is supported (PNG, JPG, JPEG, GIF)
Check file read permissions
Use absolute paths instead of relative

Logging and Debug

To enable detailed logging, modify variables in thread/opengl_thread.py:

GENERAL_LOG = True   # General logs
FFT_LOGS = True      # FFT processing logs  
PERF_LOGS = True     # Performance logs

🤝 Contributing

Contributions are welcome! To contribute:

Fork the repository
Create a branch for your feature (git checkout -b feature/new-feature)
Commit your changes (git commit -am 'Add new feature')
Push to the branch (git push origin feature/new-feature)
Open a Pull Request

Contribution Guidelines

Follow existing code style
Add docstrings for new functions/classes
Test changes on multiple platforms when possible
Update documentation if necessary

📄 License

This project is released under the MIT License. See the LICENSE file for complete details.

🙏 Acknowledgments

CustomTkinter - For the modern user interface
SoundCard - For cross-platform audio capture
PyOpenGL - For high-performance rendering
GLFW - For OpenGL window management

Developed with ❤️

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
gui		gui
resources		resources
thread		thread
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_IT.md		README_IT.md
glfw3.dll		glfw3.dll
main.py		main.py
py-to-exe-settings.json		py-to-exe-settings.json
requirements.txt		requirements.txt
resource_manager.py		resource_manager.py

Folders and files

Latest commit

History

Repository files navigation

Sound Wave 🎵

📋 Overview

✨ Key Features

🖼️ Screenshots

Main Interface

Fullscreen Mode

🛠️ Installation

System Requirements

Dependencies

Quick Start

🚀 Usage

Initial Setup

Controls and Settings

🎛️ Settings Panel

🖥️ Fullscreen Mode

⌨️ Keyboard Shortcuts

🔧 Advanced Configuration

Performance Optimization

Parallel Processing

Hardware Recommendations

Background Images

📁 Project Structure

🔬 Technical Details

Audio Processing

Visualization Algorithms

Frequency Distribution

Adaptive Smoothing

Graphics Rendering

Tkinter Mode (Preview)

OpenGL Mode (Fullscreen)

🐛 Troubleshooting

Common Issues

Application doesn't detect audio devices

Poor performance with many bands

Fullscreen mode won't start

Background image won't load

Logging and Debug

🤝 Contributing

Contribution Guidelines

📄 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages