This Streamlit-based application allows you to interact with a multilingual AI assistant powered by Google Gemini Pro. You can communicate with the assistant through voice commands (using speech recognition) and receive responses in both text and spoken form.
- Voice Input: Speak naturally to the assistant using your microphone.
- Text Generation: The assistant leverages Google Gemini Pro to generate intelligent and contextually relevant responses.
- Text-to-Speech: Hear the assistant's responses in a natural voice.
- Download Speech: Download the assistant's responses as MP3 audio files.
- Python 3.7 or higher
- A Google Cloud API key with access to the Gemini Pro model
- The following Python libraries (specified in
requirements.txt):- SpeechRecognition
- pipwin
- pyaudio
- gTTS
- google-generativeai
- python-dotenv
- streamlit
Clone the repository
Project repo: https://github.com/conda create -n llmapp python=3.11 -yconda activate llmapppip install -r requirements.txt# Finally run the following command
streamlit run app.pyAdd your GOOGLE_CLOUD_API_KEY credentials in the Streamlit app. Get your GOOGLE_CLOUD_API_KEY here:
GOOGLE_CLOUD_API_KEY = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxx"- Python
- Google API
- Streamlit
- Gemini-pro
- s2t
- t2s