Skip to main content

Conversations AI Bots: Respond to Audio

Conversation AI bots can transcribe, understand, and respond to voice messages automatically, enabling natural, hands-free customer interactions across supported messaging channels.

Updated over 2 months ago

Who This Is For / When to Use

This feature is for teams using Conversation AI bots where customers frequently send voice notes instead of typing.

Use this when:

  • Customers prefer speaking over typing

  • You support WhatsApp, Instagram, or Messenger conversations

  • You want AI to handle audio-based booking, inquiries, or updates

What Audio Response Does

When audio responses are enabled, the AI bot can:

  • Transcribe incoming voice messages into text

  • Understand the intent and context of the audio

  • Trigger actions such as appointment booking or contact updates

  • Respond naturally using the bot’s existing prompt and knowledge base

No additional training or configuration is required beyond enabling the setting.

Supported Audio Sources

Voice Notes from Platforms

  • WhatsApp

  • Facebook Messenger

  • Instagram DMs

Supported Audio Formats

  • OGG

  • MP3

  • MP4 (audio)

  • AAC

  • M4A

  • MPEG

How to Enable Audio Responses

Step 1: Enable Voice Notes in Bot Settings

Open your Conversation AI bot and go to Bot Settings.

Under Auto-pilot mode, enable Also allow this bot to respond to: Voice Notes.

Once enabled, the bot can process and respond to incoming audio messages.

Step 2: Test Audio Conversations

Send a voice note to a conversation handled by the AI bot (for example, asking to book an appointment).

The bot will transcribe the audio, understand the request, and respond accordingly.

How Audio Processing Works

  • Audio is converted to text automatically

  • The bot evaluates intent using the same logic as text messages

  • Actions (such as booking or contact updates) are executed if applicable

  • The response is generated and logged in AI Response Info

This allows audio messages to fully participate in workflows and automation.

Common Issues and Fixes

The bot does not respond to voice notes

  • Confirm Voice Notes is enabled in Bot Settings

  • Ensure the bot is running in Auto-pilot mode

  • Verify the channel supports voice messaging

The transcription is inaccurate

  • Ask the contact to speak clearly and avoid background noise

  • Ensure the audio file is not truncated or corrupted

Frequently Asked Questions

Can the AI respond to audio files sent as attachments?

Yes. Any supported audio format (such as MP3 or M4A) sent as an attachment is transcribed and processed.

What if multiple audio clips are sent together?

All audio clips are transcribed and analyzed together. The bot responds using the combined context.

Do I need separate training for audio messages?

No. Audio is transcribed into text and handled using your existing bot prompt and knowledge base.

Does the AI respond in real time?

Yes. Audio processing is optimized for fast transcription and response with minimal delay.

How do I enable audio responses?

Go to Conversations AI → Bot Settings and enable Also allow this bot to respond to: Voice Notes.

Did this answer your question?