Gemini API Integration

Ayumi uses the Google Gemini API for voice-to-text transcription. You bring your own API key (BYOK), so you have full control over costs and usage.

Getting a Gemini API Key

The API key is stored securely in your device’s Keychain.

Model	Description
gemini-2.5-flash	Fast and capable, good balance of speed and quality
gemini-2.5-flash-lite	Lighter model, faster processing
gemini-3-flash-preview	Latest preview model
Custom	Enter any Gemini model ID

You can create custom prompts to control how audio is transcribed and analyzed:

Custom presets appear in the recording view alongside the built-in options.

When using the Gemini API:

Audio data is sent to Google’s servers for processing
Ayumi shows a consent dialog before the first API call
No data is stored on Ayumi’s servers — the API call goes directly from your device to Google
See Google’s AI terms for details on how Google handles API data