<aside>

An AI-powered voice assistant for healthcare front-desk operations, built with OpenAI GPT-4o, Whisper, and ElevenLabs.

๐Ÿ† Project Highlights


๐ŸŽฏ Overview

This system simulates a real healthcare clinic receptionist that can:

The assistant uses:


๐Ÿ“ Project Structure

healthcare-voice-assistant/
โ”œโ”€โ”€ src/
โ”‚   โ”œโ”€โ”€ conversation_engine.py    # LLM orchestration & function calling
โ”‚   โ”œโ”€โ”€ voice_handler.py          # STT/TTS integration
โ”‚   โ”œโ”€โ”€ appointment_service.py    # Appointment scheduling logic
โ”‚   โ””โ”€โ”€ insurance_service.py      # Insurance verification logic
โ”œโ”€โ”€ data/
โ”‚   โ”œโ”€โ”€ appointments.json         # Mock appointment calendar
โ”‚   โ”œโ”€โ”€ insurance_providers.json  # Accepted insurance providers
โ”‚   โ””โ”€โ”€ clinic_info.json          # Clinic information
โ”œโ”€โ”€ demos/                        # Complete demo recordings (3 MP3s)
โ”œโ”€โ”€ recordings/                   # Individual audio clips from testing
โ”œโ”€โ”€ main.py                       # Application entry point
โ”œโ”€โ”€ requirements.txt              # Python dependencies
โ”œโ”€โ”€ .env.example                  # Environment variables template
โ”œโ”€โ”€ README.md                     # This file
โ””โ”€โ”€ SYSTEM_DESIGN.md              # Architecture documentation


๐Ÿš€ Setup Instructions

1. Prerequisites

2. Installation

# Clone or extract the project
cd healthcare-voice-assistant

# Create virtual environment (recommended)
python -m venv venv
source venv/bin/activate  # On Windows: venv\\Scripts\\activate

# Install dependencies
pip install -r requirements.txt

3. Configure API Keys

# Copy the example env file
cp .env.example .env

# Edit .env and add your API keys
OPENAI_API_KEY=your_openai_api_key_here
ELEVENLABS_API_KEY=your_elevenlabs_api_key_here
ELEVENLABS_VOICE_ID=21m00Tcm4TlvDq8ikWAM  # Optional: Rachel voice (default)

Getting API Keys:


๐ŸŽฎ Usage

Run the Application

python main.py

Available Modes

1. Text Modeย - Interactive chat (no voice)

2. Voice Simulation - Appointment Scheduling

3. Voice Simulation - Insurance Verification

4. Voice Simulation - No Available Slot

5. Process Audio File



๐Ÿ”Š Generated Recordings During Testing

When running voice simulations, individual audio clips are saved in theย recordings/ย folder:

These files show the step-by-step conversation flow.


๐Ÿง  How It Works

Architecture

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚   User      โ”‚
โ”‚   Audio     โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”˜
       โ”‚
       โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Speech-to-Text โ”‚ (OpenAI Whisper)
โ”‚  (STT)          โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
       โ”‚
       โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Conversation Engine        โ”‚
โ”‚  - GPT-4o for dialogue      โ”‚
โ”‚  - Function calling for:    โ”‚
โ”‚    * check_available_slots  โ”‚
โ”‚    * book_appointment       โ”‚
โ”‚    * verify_insurance       โ”‚
โ”‚    * get_clinic_info        โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
       โ”‚
       โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Text-to-Speech โ”‚ (ElevenLabs)
โ”‚  (TTS)          โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
       โ”‚
       โ–ผ
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  Audio      โ”‚
โ”‚  Response   โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Key Components

1. Conversation Engineย (conversation_engine.py)

2. Voice Handlerย (voice_handler.py)

3. Appointment Serviceย (appointment_service.py)

4. Insurance Serviceย (insurance_service.py)

๐Ÿ”‘ Key Technical Decisions

Why GPT-4o over GPT-3.5?

Why ElevenLabs over alternatives?

Architecture Choices:

๐Ÿ“Š Project Stats

ChatGPT Image Nov 4, 2025, 12_51_54 AM.png

View GitHub Repository:

https://github.com/AlvLeoAI/healthcare-ai-voice-assistant