Using Voice Mode

Agent One allows your agents to interact with users through natural voice conversations. With Voice Mode enabled, users can speak to your agent instead of typing, and the agent can respond with spoken audio.

What is Voice Mode?

Voice Mode adds two key capabilities to your agent:

  1. Voice Input (Speech-to-Text): Users can speak their message, and it will be transcribed into text for the agent to understand.
  2. Voice Output (Text-to-Speech): The agent's text responses can be read aloud to the user.

This feature is excellent for accessibility, hands-free usage, and creating more engaging, human-like experiences.

Prerequisites

Voice Mode is a premium feature available on:

  • Basic Plan
  • Standard Plan

If you are on a Free plan, you will need to upgrade to enable this feature.

Enabling Voice Mode

To turn on Voice Mode for your agent:

  1. Navigate to your Agent Dashboard.
  2. Click on the Agent Editor tab.
  3. Scroll down to the bottom of the editor panel.
  4. Click on More settings to expand the advanced configuration options.
  5. Locate the Enable Voice Mode (Input & Output) checkbox.
  6. Toggle it to ON.
  7. Click Save (or use CMD+S) to apply your changes.

Using Voice Features

Once enabled, your chat interface will be updated with voice controls.

Speaking to Your Agent (Voice Input)

  1. Look for the Microphone icon in the message input bar (usually on the right side).
  2. Click the microphone to start recording.
  3. Speak your message clearly.
  4. Click the Stop button (square icon) when you are finished speaking.
  5. Your speech will be automatically transcribed and sent to the agent.

Listening to Responses (Voice Output)

  1. When the agent responds, look for a Speaker/Play icon next to the message bubble.
  2. Click the icon to hear the message read aloud.
  3. Click it again to pause or stop the audio.

Best Practices & Troubleshooting

  • Browser Permissions: The first time a user tries to use the microphone, their browser will ask for permission to access the audio input. They must click "Allow" for voice input to work.
  • Clear Environment: For best transcription results, speak clearly and try to minimize background noise.
  • Supported Browsers: Voice features work on most modern browsers (Chrome, Edge, Safari, Firefox). Ensure your browser is up to date.
  • Mobile Support: Voice Mode is fully compatible with mobile browsers, making it great for on-the-go interactions.

Frequently Asked Questions

Q: Can I change the voice of the agent? A: Currently, the agent uses a standard, high-quality voice. We are working on adding more voice options in future updates.

Q: Does it work in all languages? A: Voice Mode works best with major languages. The transcription and text-to-speech engines will generally detect and support the language being spoken, but performance may vary for less common dialects.