Speech-To-Text

A simple open source Speech-To-Text app. Powered by Comtegra GPU Cloud and Whisper.

whisper-large-v3: 5 minutes of audio → 35 seconds of processing
whisper-large-v3-turbo: 5 minutes of audio → 8 seconds of processing

Made with ❤️ by Comtegra S.A.

Tech stack

Streamlit - for the app frontend
Python - for the app backend
Comtegra GPU Cloud (CGC) - for the best cloud compute
Whisper-large-v3, Whisper-large-v3-turbo - for the speech to text models

Demo

Possible future improvements

Improve how timestamps work
Integrate additional/newer STT models
Implement real-time transcription
Option to transcribe only a part of the audio
Add voice recording option

The sky's the limit! With CGC (Comtegra GPU Cloud), we're only limited by our imagination.

Support

Visit our usecase to see how to run the app and how it works
Visit our documentation for more detailed information
For questions or support, please contact: ai@comtegra.pl

License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

About Comtegra S.A.

Comtegra is an IT systems integrator based in Poland, specializing in various aspects of information technology, including data storage and management, information security, and network construction. Founded in 1999, Comtegra has established itself as a significant player in the Polish IT market, providing services such as backup solutions, cybersecurity, and virtualization technologies. The company emphasizes the integration of artificial intelligence within business operations to enhance data management and decision-making processes.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
media		media
LICENSE.md		LICENSE.md
README.md		README.md
softstack.ipynb		softstack.ipynb
speech-to-text.py		speech-to-text.py
usecase.md		usecase.md
usecase_pl.md		usecase_pl.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech-To-Text

Tech stack

Demo

Possible future improvements

Support

License

About Comtegra S.A.

About

Uh oh!

Releases

Packages

Languages

License

Comtegra/Speech-To-Text

Folders and files

Latest commit

History

Repository files navigation

Speech-To-Text

Tech stack

Demo

Possible future improvements

Support

License

About Comtegra S.A.

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages