Bangla TTS

The Bangla TTS was training mono(male) speakers using Vit TTS model. The paper is ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer, we used the coqui-ai🐸-toolkit for Bangla Text-to-Speech training as well as inference.

N.B : This pipeline only for inference as well as end point API testing purposes.

__Please check the faster test into

Requiremnts

Create Environments

conda create -n bn_tts python==3.8
conda activate bn_tts

Install require modules

pip install -r requirements.txt

Dataset

Bangla Speech corpus prepared by the Indic TTS Team of IIT Madras. I've downsampled the dataset down to 22050 and converted the raw iitm annotation format into ljspeech format for training several TTS models for bangla. in this dataset, i am sharing the final processed dataset for Bangla TTS along with trained best models weight files. please cite this paper: https://aclanthology.org/2020.lrec-1.789.pdf if you are using the dataset in your research works.

Dataset link: https://www.kaggle.com/datasets/mobassir/comprehensive-bangla-tts

Training

Training code jupyter

Single Test[Inference]

For the single testing run,

python inference.py

or

Inference on jupyter notebook

huggingface

End Point API

For the API testing,

1. Run the `app.py` script

python app.py

2. Testing using python request

Write a .py script and run this code the audio .wav file will save into logs directory,

import os
import request
import time

username = "saiful"
text = "আপনি কেমন আছেন।"
log_dir = "logs"
filename = "audio_file_"+str(time.strftime("%Y%m%d-%H%M%S"))+".wav"
os.makedirs(log_dir, exist_ok= True)

file_dir = os.path.join(log_dir, filename)
# here use your localhost machine api or localhost and post 
url = 'http://192.168.1.154:8009/tts'

payload = {
    "sender": username, 
    "message": text
    }

payload = {
    "text" : text,
    "sender" : username,
    "save_dir" : file_dir
}
headers = {'content-type': 'application/json'} 
result = requests.post(url, json=payload, headers=headers)
print(result)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
__pycache__		__pycache__
image		image
logs		logs
modules		modules
README.md		README.md
app.py		app.py
inference.ipynb		inference.ipynb
inference.py		inference.py
requirements.txt		requirements.txt
train_bangla_vits.ipynb		train_bangla_vits.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bangla TTS

Requiremnts

Dataset

Training

Single Test[Inference]

End Point API

1. Run the `app.py` script

2. Testing using python request

3. if want to use Postman skip the procedure 2

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

saiful9379/Bangla_TTS

Folders and files

Latest commit

History

Repository files navigation

Bangla TTS

Requiremnts

Dataset

Training

Single Test[Inference]

End Point API

1. Run the app.py script

2. Testing using python request

3. if want to use Postman skip the procedure 2

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

1. Run the `app.py` script

Packages