Spiko is a real-time video audio translation browser extension designed to break language barriers in online learning. It captures video audio, translates it into the user's preferred language, and provides live subtitles and voice translation. Spiko enhances accessibility and engagement for global audiences, making online education more inclusive.
You can watch the Live demo of the project here
- Problem Statement
- Proposed Solution
- Features
- Technical Implementation
- Installation
- Usage
- Team
- Conclusion (#Conclusion)
Millions of people watch online videos daily, but language barriers limit access to content. Subtitles are often unavailable, inaccurate, or difficult to follow. Existing solutions lack real-time voice translation, making content less engaging for non-native speakers.
- Limited access to global educational, entertainment, and informational videos.
- Reduced engagement for users who struggle with foreign languages.
- Missed opportunities for creators to reach a wider audience.
Spiko is a browser extension that:
- Captures video audio from any website.
- Uses AI-powered Speech-to-Text, Translation, and Text-to-Speech.
- Provides live voice translation in the user’s preferred language and subtitles.
- Enhances accessibility and engagement for global audiences.
- Real-Time Voice Translation: Translates video audio in real-time (only server wait time).
- Multiple Language Support: Supports a wide range of languages.
- Real-Time Captions: Displays subtitles in the selected language.
- Play/Pause Functionality: Pauses translation when the video is paused.
- Multi-Platform Support: Works on various platforms.
Spiko is built using the following technologies:
-
Capture Video Audio:
- The Web Audio API extracts audio from online videos and sends it to the backend.
-
Speech-to-Text:
- The Flask server receives the audio and uses OpenAI’s Whisper AI to convert speech to text.
-
Text Translation:
- The detected text is translated into the user’s preferred language using the Google Translate API.
-
Text-to-Speech:
- The translated text is converted into speech using gTTS and sent back to the frontend.
-
Play Translated Audio:
- The frontend (built with React) receives the translated speech and plays it in sync with the video.
To install and run Spiko locally, follow these steps:
-
Clone the Repository:
git clone https://github.com/your-username/spiko.git cd spiko -
Install Dependencies Install the required Dependencies
-
Run the Python Server
python server.py
-
Run the WebSite in your local system
-
Uplaod the video file and load the extension tab
-
Click on Translate button and wait for some time
- Open any video on a supported platform.
- Click the Spiko extension icon.
- Select your preferred language.
- Enjoy real-time translation and subtitles!
- Veer Jain
- Vijay Kant
- Ritik Raj
- Vikram Singh Gangwar
Spiko eliminates language barriers, making online education accessible to everyone, everywhere. By combining AI-driven speech-to-text, translation, and accessibility tools, we provide a seamless learning experience for millions of users.