Skip to content
@agentvoiceresponse

Agent Voice Response

Agent Voice Response is an advanced application that utilizes artificial intelligence to function as a virtual agent

Agent Voice Response (AVR)

Discord

The Agent Voice Response (AVR) agentvoiceresponse.com is an advanced IVR solution that integrates with AI, providing a voicebot interface through Asterisk's AudioSocket application. This architecture allows for the replacement of traditional IVR systems with AI-powered conversational agents.

Table of Contents

Overview

AVR Core manages real-time voice communication between customers and a Asterisk AudioSocket application and interacts with various AI services:

  1. ASR (Automatic Speech Recognition): Transcribes the incoming audio stream from the customer into text
  2. LLM (Large Language Model): Interprets the text and generates an appropriate response.
  3. TTS (Text-to-Speech): Converts the generated text response back into speech, which is then played to the customer.
  4. STT (Speech-to-Text): Provides accurate transcription of spoken language into text, supporting multiple languages and dialects.
  5. STS (Speech-to-Speech): Enables direct voice-to-voice communication with AI agents, creating natural and fluid conversations.

AVR Core is designed to be flexible, allowing users to integrate any ASR, LLM, and TTS services by interacting via HTTP API Streams. This modularity allows you to develop your own middleware between AVR Core and the services of your choice. In recent versions, AVR Core has been enhanced with STT (Speech-to-Text) integration to support providers that don't yet offer ASR capabilities, and STS (Speech-to-Speech) integration for direct connection with Conversational AI services like OpenAI Realtime, bypassing the need for separate ASR, LLM, and TTS components.

Example Flow with ASR, LLM and TTS:

  • Asterisk sends the audio stream to the AVR Core.
  • AVR Core forwards the audio to an ASR service for transcription (e.g., ASR_URL=http://localhost:6001/speech-to-text-stream).
  • Once transcription is received, AVR Core sends the text to an LLM service (e.g., LLM_URL=http://localhost:6005/prompt-stream).
  • The LLM generates a response, which is sent to a TTS service for voice synthesis (e.g., TTS_URL=http://localhost:6003/text-to-speech-stream).
  • The synthesized voice is played back to the customer via Asterisk.

Features

  • Plug-and-play architecture: Easily swap out different ASR, LLM, and TTS services.
  • Real-time voice-to-text and text-to-voice streaming: Handles customer interactions seamlessly via HTTP API streams.
  • Scalable design: Integrate your own AI services using custom middleware.
  • Multi-language support: Handle conversations in multiple languages.
  • Customizable voice personalities: Configure different voice characteristics for your AI agents.
  • Detailed analytics: Monitor and analyze call metrics and performance.
  • Secure communication: End-to-end encryption for all voice and data streams.

For a list of available integrations, check Agent Voice Response Integrations.

Prerequisites

Before installing AVR, ensure you have the following components:

  • Docker and Docker Compose
  • An Asterisk server with AudioSocket module enabled
  • Access and credentials to ASR, LLM, and TTS services

Installation

  1. Clone the AVR Infrastructure
    Clone the avr-infra repository from the official GitHub repository:

    git clone https://github.com/agentvoiceresponse/avr-infra.git
    cd avr-infra
    
  2. Follow the Instructions in the README Inside the cloned repository, follow the setup and configuration steps described in the README.md file to launch your AVR agent with the desired ASR, LLM, and TTS providers.

Troubleshooting

If you encounter issues during installation or usage:

  1. Connection Issues:

    • Ensure all services are running with docker-compose ps
    • Check logs for specific errors: docker-compose logs avr-core
    • Verify network connectivity between services
  2. Audio Quality Issues:

    • Verify the audio codec settings in Asterisk
    • Check the ASR service compatibility with your audio format
    • Ensure proper audio device configuration
  3. Performance Issues:

    • Consider scaling resources for components handling high traffic
    • Optimize LLM prompts for faster response times
    • Monitor system resource usage
  4. Common Solutions:

Community

Join our growing community of developers and users to share ideas, get help, and collaborate on projects:

License

Copyright (c) 2024 Agent Voice Response

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Pinned Loading

  1. avr-infra avr-infra Public

    The AVR Infrastructure project is designed to launch the Agent Voice Response application, which will start the Core, ASR, LLM, and TTS services integrated with Asterisk Audiosocket.

    17 8

  2. avr-asr-google-cloud-speech avr-asr-google-cloud-speech Public

    This repository provides a real-time speech-to-text transcription service using Google Cloud Speech-to-Text API integrated with the Agent Voice Response system.

    JavaScript 2

  3. avr-llm-openai avr-llm-openai Public

    JavaScript 3 2

  4. avr-tts-google-speech-tts avr-tts-google-speech-tts Public

    This project demonstrates the integration of Agent Voice Response with Google Cloud Text-to-Speech (TTS).

    JavaScript 1 2

  5. avr-llm-openai-assistant avr-llm-openai-assistant Public

    This project integrates Agent Voice Response with OpenAI Assistant, enabling the application to handle dynamic conversations in real-time using OpenAI's API

    JavaScript 1 2

  6. avr-llm-typebot avr-llm-typebot Public

    JavaScript 2

Repositories

Showing 10 of 28 repositories
  • avr-docs Public
    agentvoiceresponse/avr-docs’s past year of commit activity
    0 0 0 0 Updated Aug 12, 2025
  • avr-infra Public

    The AVR Infrastructure project is designed to launch the Agent Voice Response application, which will start the Core, ASR, LLM, and TTS services integrated with Asterisk Audiosocket.

    agentvoiceresponse/avr-infra’s past year of commit activity
    17 8 0 0 Updated Aug 12, 2025
  • avr-sts-deepgram Public

    This repository showcases the integration between Agent Voice Response and Deepgram's Speech-to-Speech API. The application leverages Deepgram's powerful speech processing capabilities to provide intelligent, context-aware responses in real-time audio format.

    agentvoiceresponse/avr-sts-deepgram’s past year of commit activity
    JavaScript 0 MIT 0 0 0 Updated Aug 11, 2025
  • avr-sts-ultravox Public

    This repository showcases the integration between Agent Voice Response and Ultravox's Real-time Speech-to-Speech API. The application leverages Ultravox's powerful language model to process audio input from users, providing intelligent, context-aware responses in real-time audio format.

    agentvoiceresponse/avr-sts-ultravox’s past year of commit activity
    JavaScript 0 MIT 1 0 0 Updated Aug 8, 2025
  • avr-sts-openai Public

    This repository showcases the integration between Agent Voice Response and OpenAI's Real-time Speech-to-Speech API

    agentvoiceresponse/avr-sts-openai’s past year of commit activity
    JavaScript 2 MIT 2 0 0 Updated Aug 6, 2025
  • avr-resampler Public

    A TypeScript npm package for audio resampling between different sample rates, specifically designed for AVR STS or TTS service integration.

    agentvoiceresponse/avr-resampler’s past year of commit activity
    TypeScript 0 MIT 0 0 0 Updated Aug 6, 2025
  • avr-ami Public
    agentvoiceresponse/avr-ami’s past year of commit activity
    JavaScript 0 MIT 3 0 1 Updated Jul 11, 2025
  • avr-tts-coquitts Public

    This project demonstrates the integration of Agent Voice Response with CoquiTTS Text-to-Speech (TTS).

    agentvoiceresponse/avr-tts-coquitts’s past year of commit activity
    JavaScript 1 MIT 1 0 0 Updated Jun 27, 2025
  • agentvoiceresponse/avr-llm-openai’s past year of commit activity
    JavaScript 3 MIT 2 0 0 Updated Jun 11, 2025
  • avr-vad Public

    A Node.js library for Voice Activity Detection using the Silero VAD model.

    agentvoiceresponse/avr-vad’s past year of commit activity
    TypeScript 3 MIT 2 0 0 Updated Jun 11, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.