Offline TTS API Library

The voice format is <TTS System>:<Voice Name>[#<Speaker ID>. The Speaker ID is optional for multi speakers model only.

The following TTS Engines used:

If your input text begins with a left angle bracket (<) character, it will be interpreted as SSML.

SSML

A subset of SSML is supported:

<speak> - wrap around SSML text
- lang - set language for document
<s> - sentence (disables automatic sentence breaking)
- lang - set language for sentence
<w> / <token> - word (disables automatic tokenization)
<voice name="..."> - set voice of inner text
- voice - name or language of voice
  - Name format is tts:voice (e.g., "glow-speak:en-us_mary_ann") or tts:voice#speaker_id (e.g., "coqui-tts:en_vctk#p228")
  - If one of the supported languages, a preferred voice is used (override with --preferred-voice <lang> <voice>)
<say-as interpret-as=""> - force interpretation of inner text
- interpret-as one of "spell-out", "date", "number", "time", or "currency"
- format - way to format text depending on interpret-as
  - number - one of "cardinal", "ordinal", "digits", "year"
  - date - string with "d" (cardinal day), "o" (ordinal day), "m" (month), or "y" (year)
<break time=""> - Pause for given amount of time
- time - seconds ("123s") or milliseconds ("123ms")
<sub alias=""> - substitute alias for inner text

eg,

<speak>
  <s lang="zh">欢迎使用离线语音合成</s>
  <s lang="en-us">Welcome to Offline Speech Synthesis.</s>
</speak>

Coqui-TTS
ESpeaker
Main Inspired by OpenTTS.
- Great Thanks. Without OpenTTS there would be no Offline TTS.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
tts		tts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE.txt		LICENSE.txt
README.md		README.md
VERSION		VERSION
__init__.py		__init__.py
api.py		api.py
logger.py		logger.py
pyproject.toml		pyproject.toml
swagger.yaml		swagger.yaml
to_wav.py		to_wav.py
utils.py		utils.py