Skip to content

Nayanpawar03/Multilingual-AI-Voice-Assistant

Repository files navigation

Multilingual Assistant

This Streamlit-based application allows you to interact with a multilingual AI assistant powered by Google Gemini Pro. You can communicate with the assistant through voice commands (using speech recognition) and receive responses in both text and spoken form.

Features

  • Voice Input: Speak naturally to the assistant using your microphone.
  • Text Generation: The assistant leverages Google Gemini Pro to generate intelligent and contextually relevant responses.
  • Text-to-Speech: Hear the assistant's responses in a natural voice.
  • Download Speech: Download the assistant's responses as MP3 audio files.

Requirements

  • Python 3.7 or higher
  • A Google Cloud API key with access to the Gemini Pro model
  • The following Python libraries (specified in requirements.txt):
    • SpeechRecognition
    • pipwin
    • pyaudio
    • gTTS
    • google-generativeai
    • python-dotenv
    • streamlit

How to run?

STEPS:

Clone the repository

Project repo: https://github.com/

STEP 01- Create a conda environment after opening the repository

conda create -n llmapp python=3.11 -y
conda activate llmapp

STEP 02- install the requirements

pip install -r requirements.txt
# Finally run the following command
streamlit run app.py

Add your GOOGLE_CLOUD_API_KEY credentials in the Streamlit app. Get your GOOGLE_CLOUD_API_KEY here:

GOOGLE_CLOUD_API_KEY = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxx"

Techstack Used:

  • Python
  • Google API
  • Streamlit
  • Gemini-pro
  • s2t
  • t2s

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •