Skip to content

Voice Input

Aori can transcribe your voice messages and treat them as text input, allowing you to interact with your agent hands-free.

  • In-app Chat: Hold the microphone button in the Aori chat screen.
  • Messaging Channels: Send a voice message through Telegram or WhatsApp, and Aori will transcribe it automatically.

Aori sends the audio to a speech-to-text (STT) backend for transcription. The resulting text appears in the input field, allowing you to review and edit it before Aori responds.

To use voice input, you must configure a speech-to-text provider in SettingsConnectionsVoiceSTT Backend.

  • Groq Whisper (Default): Fast, high quality, and free. Uses your existing Groq API key.
  • OpenAI Whisper: An alternative using your OpenAI API key.
  1. Open the Chat screen in the Aori app.
  2. Hold the microphone button next to the text input.
  3. Speak your request and release the button to send for transcription.
  4. Review the transcribed text and tap Send to communicate with Aori.

If you have connected a channel, simply send a voice message to your bot or linked session. Aori will transcribe it and respond as if you had typed the message.

  • Microphone Not Showing: Ensure that Aori has permission to access your device’s microphone in your OS settings.
  • Poor Quality: If transcription is inaccurate, try switching your STT backend in SettingsConnectionsVoice.