Skip to content

Conversation

@4-alok
Copy link

@4-alok 4-alok commented Nov 11, 2025

🚀 Multilingual TTS Support Enhancement

This PR adds comprehensive multilingual text-to-speech capabilities to the Chatterbox TTS Server, enabling support for 23 languages including Hindi, Arabic, Chinese, French, German, and more.

✨ Key Features Added

🔄 Multilingual Model Integration

  • Upgraded to ChatterboxMultilingualTTS from GitHub repository
  • Supports 23 languages: Arabic (ar), Danish (da), German (de), Greek (el), English (en), Spanish (es), Finnish (fi), French (fr), Hebrew (he), Hindi (hi), Italian (it), Japanese (ja), Korean (ko), Malay (ms), Dutch (nl), Norwegian (no), Polish (pl), Portuguese (pt), Russian (ru), Swedish (sv), Swahili (sw), Turkish (tr), Chinese (zh)

🍎 Apple Silicon Compatibility

  • Fixed MPS device loading issues for Apple Silicon Macs
  • Resolved attention implementation conflicts with SDPA on MPS devices
  • Added proper torch.load map_location handling for non-CUDA devices

🎛️ Enhanced User Interface

  • Added model selection dropdown (Multilingual vs English-only)
  • Integrated language selector with all 23 supported languages
  • Added status badges showing current model type
  • Improved UI state persistence and configuration management

🔧 Technical Improvements

  • Updated server API endpoints (/tts, /v1/audio/speech) to handle language_id parameter
  • Added graceful fallback for missing multilingual support
  • Enhanced configuration system with multilingual settings
  • Fixed various compatibility issues with newer PyTorch versions

📚 Documentation & Setup

  • Added comprehensive documentation for multilingual setup
  • Updated configuration examples and troubleshooting guides
  • Included migration instructions for existing installations

🔧 Configuration Changes

The PR introduces new configuration options:

model:
  use_multilingual: true  # Enable multilingual model
  
generation_defaults:
  language: en  # Default language (English)

- Upgrade to ChatterboxMultilingualTTS from GitHub (supports 23 languages)
- Add MPS device compatibility fixes for Apple Silicon Macs
- Fix attention implementation issues with SDPA on MPS devices
- Update config to enable multilingual support with English as default language
- Enhance UI with model selection and language controls
- Add graceful fallback for missing multilingual support
- Update server API to handle language_id parameter
- Fix torch.load map_location for non-CUDA devices
- Add comprehensive documentation for multilingual setup

Now working with multilingual model on Mac, supporting 23 languages including Hindi, Arabic, Chinese, French, German, etc. Default language set to English while maintaining full multilingual capabilities.
@hadessalvacion2025
Copy link

perfect.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants