How Tourist Translator works
Learn how Tourist Translator enables real-time two-way voice conversations across 64+ languages — ideal for travel.
Written By Umakhan Magomedov
Last updated About 13 hours ago
Tourist Translator is a real-time two-way voice interpreter. Tap to speak in your language, hear the translation instantly in the other person's language — no typing required.
When to use
Communicate with locals while traveling: hotels, restaurants, markets, transport
Have a quick back-and-forth conversation with someone who speaks a different language
Understand announcements, directions or instructions abroad
Help in any situation where typing is inconvenient or slow
How to start
Open Tourist Translator from the Tools tab.
Select a language for each side at the top of each panel: your language on one side, the other person's language on the other.
Tap the Start button on your side and speak. The app recognizes your speech in real time and streams the translation to the opposite panel.
Hand the phone to the other person (or turn it around). They tap Start on their side and speak.
If Autoplay is on, the translation is read aloud automatically after each phrase.
ℹ️ The app recognizes speech as you speak — the transcript and translation appear progressively, so you see the result before you finish talking.
What you get
Recognized text displayed in your language panel as you speak
Translated text streamed to the opposite panel in real time
Voiced playback of the translation via ElevenLabs (if Autoplay is on or you tap the listen button)
Session history: all exchanges in the current conversation are saved and grouped by session
Settings
Tap the settings icon in the top right to adjust:
Supported languages
Tourist Translator supports 64+ languages, including Arabic, Chinese, Czech, Danish, Dutch, English, Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Indonesian, Italian, Japanese, Korean, Malay, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swedish, Thai, Turkish, Ukrainian, Vietnamese, and more.
How much it costs
Each exchange uses tokens for speech recognition, text translation, and (if voice playback is used) audio generation. The default model (Gemini 2.5 Flash Lite) keeps costs low for frequent use. For full pricing details, see Token pricing for each tool.