CallSenseAI SAGE

A Multi-Agent AI System for Banking Call Transcript Behavioural Analysis

CallSenseAI is a multi-agent conversational analytics system designed to automatically analyze customer service call transcripts in banking environments. The system identifies customer intent, sentiment, and root cause issues to support Quality Assurance teams and improve customer satisfaction.

It's an advanced multi-agent AI system designed to analyze customer service audio calls. It leverages a team of specialized AI agents to transcribe, analyze, and report on call data, providing actionable insights for businesses to enhance customer experience, identify operational inefficiencies, and detect emerging trends.

The system is built with a modern Streamlit user interface, allowing users to easily upload audio files, view comprehensive analysis reports, and engage in a follow-up chat to ask deeper questions about the results.

Features

Automated Transcription & Diarization: Utilizes OpenAI's gpt-4o-transcribe-diarize to generate accurate, speaker-separated transcripts from audio files.
Alternate option is also available for offline using whisper and pyannote
In-Depth Call Analysis: A parallel team of agents work together to identify:
- Intent Recognition: The primary reason for the customer's call.
- Sentiment Analysis: The emotional tone and satisfaction level for each minute of the call.
- Root Cause Identification: The fundamental issue or problem driving the conversation.
Comprehensive Reporting: An agent synthesizes all analysis into a single, well-structured summary report.
Interactive Chat: A chat interface allows users to ask follow-up questions based on the generated report.
Persistent Sessions: All analyses are saved and can be revisited later, including the full report and chat history.
Modern Web UI: A clean, intuitive, and beautiful user interface built with Streamlit.

Agent Architecture

Technology Stack

Backend: Python 3.13
Frontend: Streamlit
AI Orchestration: Google Agent Development Kit (ADK)
AI Models:
- OpenAI GPT-4o (for transcription and diarization)
- Google Gemini Flash (for analysis and synthesis)
- Gemma 3 27B (for light weight tasks)
Core Libraries:
- google-adk
- streamlit
- openai
- pyannote
- open-whisper
- google-generativeai
- python-dotenv

Getting Started

Follow these instructions to set up and run the project on your local machine.

1. Prerequisites

Python 3.10+
An environment with access to pip.

2. Setup and Installation

Clone the repository:
```
git clone <your-repository-url>
cd SAGE
```

Create and activate a virtual environment:

python -m venv env
source env/bin/activate

Install the required dependencies:
```
pip install -r requirements.txt
```
Configure your environment variables: Create a file named .env in the root directory of the project and add your API keys:
```
OPENAI_API_KEY="your_openai_api_key"
GOOGLE_API_KEY="your_google_api_key"
HF_TOKEN="your_hugging_face_api_key"
```

3. Running the Application

Launch the Streamlit application:
```
streamlit run sage/ui.py
```
Your web browser should open with the SAGE home page.
To start a new analysis:
- Use the file uploader to select an audio file (.wav only).
- Click the "Analyze File" button.
To revisit a past analysis:
- Find the session in the "Previous Wisdom" section.
- Click the "View Analysis" button.

🐳 Running with Docker

Alternatively, you can run the application inside a Docker container for better portability and dependency management.

Build the Docker image: From the root of the project directory, run:
```
docker build -t sage-app .
```
Run the Docker container: Execute the following command. This will start the container, map the necessary ports, pass your API keys from the .env file, and mount local directories for data persistence.
```
docker run -p 8501:8501 --env-file .env \
  -v ./my_agent_data.db:/app/my_agent_data.db \
  -v ./sage/uploaded_audio:/app/sage/uploaded_audio \
  sage-app
```
Access the application: Open your web browser and navigate to http://localhost:8501.

Project Structure

/SAGE
├── sage/
│   ├── app.py                # Main Streamlit application UI
│   ├── main.py              # Original CLI application entry point
│   ├── utils.py             # CLI utility functions (logging, colors)
│   ├── manager_agent/       # Contains the main manager agent
│   │   └── agent.py
│   ├── sub_agents/          # Contains all specialized agents
│   │   ├── audio_to_transcript_agent/
│   │   ├── intent_agent/
│   │   ├── root_cause_agent/
│   │   ├── sentiment_agent/
│   │   └── synthesizer_agent/
│   └── uploaded_audio/      # Default directory for uploaded files
├── .env                     # Local environment variables (API keys)
├── requirements.txt         # Project dependencies
└── README.md                # This file

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
books		books
playground		playground
sage		sage
Dockerfile		Dockerfile
README.md		README.md
rag_agent.py		rag_agent.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CallSenseAI SAGE

Features

Agent Architecture

Technology Stack

Getting Started

1. Prerequisites

2. Setup and Installation

3. Running the Application

🐳 Running with Docker

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CallSenseAI SAGE

Features

Agent Architecture

Technology Stack

Getting Started

1. Prerequisites

2. Setup and Installation

3. Running the Application

🐳 Running with Docker

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages