<aside>

Goal:

Build a reusable AI voice assistant that can receive voice messages on Telegram, transcribe them, respond using an LLM, and reply back with natural-sounding audio using ElevenLabs.

Problem:

Most Telegram bots are text-only and lack voice interaction. There's no easy plug-and-play AI voice assistant that supports transcribing, reasoning with context, and replying with human-like speech.

Solution:

A modular and reusable workflow using n8n, OpenRouter, and ElevenLabs that:

πŸ”Ή Step 1 – Trigger: Telegram Bot Trigger (listens for new voice/audio messages).

πŸ”Ή Step 2 – Download file from Telegram using "Get file".

πŸ”Ή Step 3 – Transcribe audio to text using speech-to-text node.

πŸ”Ή Step 4 – Send transcription to AI Agent (connected to OpenRouter chat model). Process natural language reasoning via OpenRouter (e.g., GPT-4, Claude, etc). AI Agent generates a textual response.

πŸ”Ή Step 5 – Text-to-speech conversion using ElevenLabs.

πŸ”Ή Step 6 – Send voice message back to user.

πŸ”Ή Step 7 – Trigger: Webhook (POST) receives user input (e.g., from ElevenLabs Voice Agent)

πŸ”Ή Step 8 – Message is passed to a Perplexity

πŸ”Ή Step 9 – Response is routed to an AI Agent that can use memory, context, or tools if needed.

πŸ”Ή Step 10 – The final AI-generated response is sent back via Respond to Webhook.

Tools Used:

Impact:

🧠Learnings:

</aside>

Screenshot 2025-08-04 091525.png

πŸ“‚ Download & Explore the Workflow:

Voice-AI-Assistant-with-Telegram-ElevenLabs---n8n-/AI Agent Into a Voice Assistant.json at main Β· AlvLeoAI/Voice-AI-Assistant-with-Telegram-ElevenLabs---n8n-