{"site":{"name":"Koji","description":"AI-native customer research platform that helps teams conduct, analyze, and synthesize customer interviews at scale.","url":"https://www.koji.so","contentTypes":["blog","documentation"],"lastUpdated":"2026-05-18T12:58:19.209Z"},"content":[{"type":"documentation","id":"003ebeaf-01f4-45e2-9673-8d471b5cc6b4","slug":"voice-interview-experience","title":"Voice Interview Experience","url":"https://www.koji.so/docs/voice-interview-experience","summary":"Voice interviews let participants speak naturally with an AI interviewer powered by ElevenLabs. The microphone is always on, messages are visible on screen, and participants can switch to text mode and back at any time. This guide covers the full experience from start to completion.","content":"Voice interviews produce richer, more natural responses because participants can speak freely rather than typing. Here is a complete walkthrough of what participants experience from the moment they choose voice mode to the final thank-you screen.\n\n## Starting a Voice Interview\n\nWhen a participant arrives at your [interview landing page](/docs/interview-landing-page), they see two separate start buttons — **Start Voice Chat** and **Start Text Chat**. The participant clicks **Start Voice Chat** to begin.\n\nIf you have configured your project to offer only voice mode, the text option does not appear and participants go straight to the next step.\n\n## Microphone Permission\n\nBefore the conversation begins, the browser prompts the participant to grant microphone access. This is a standard browser permission dialog that looks slightly different on each browser and operating system.\n\nA few things to note:\n\n- **First-time visitors** will always see this prompt. Once they grant permission, the browser remembers it for future visits.\n- **HTTPS is required.** Microphone access only works on pages served over a secure connection. If your interview link uses HTTP, the browser will block the request.\n- **If permission is denied,** the participant is offered the option to switch to text mode instead.\n\n## The Conversation\n\nOnce microphone access is granted, the interview begins immediately. Here is what the participant experiences:\n\n### The AI Interviewer Speaks First\n\nThe conversation opens with a warm greeting from the AI interviewer. The participant hears a natural-sounding voice that introduces the topic and asks the first question. The AI uses ElevenLabs Conversational AI for lifelike, expressive speech.\n\n### Always-On Microphone\n\nThere is no push-to-talk button. The microphone is always listening, and the system automatically detects when the participant is speaking and when they have finished. It feels like a real phone call or video chat — just talk, pause, and the interviewer responds.\n\n### Real-Time Follow-Up Questions\n\nThe AI interviewer listens carefully and asks follow-up questions based on what the participant says. If someone mentions an interesting detail, the interviewer probes deeper. If a response is vague, the interviewer asks for clarification. This dynamic back-and-forth is what makes voice interviews so effective for qualitative research.\n\n### Visual Feedback\n\nWhile the conversation is happening, the participant sees:\n\n- **An animated orb** that responds to audio state — it pulses and moves when the interviewer or participant is speaking\n- **A mute button** for moments when the participant needs to cough, speak to someone else, or take a brief pause\n- **Conversation messages** displayed on screen alongside the orb, so participants can follow along with the text of the conversation as it happens\n\n### Messages Are Visible\n\nUnlike a phone call, voice interviews in Koji display the conversation messages on screen in real time. Participants can see both what the AI interviewer said and what they said, which provides helpful context and makes it easy to reference earlier parts of the conversation.\n\n## Switching Between Modes\n\nAt any point during a voice interview, participants can switch to text mode. This is helpful if:\n\n- Their microphone stops working mid-conversation\n- They move to a noisy environment\n- They simply prefer typing for a particular answer\n\nThe switch is seamless — the conversation history carries over, and the AI interviewer picks up right where it left off, now in text form. Participants can also switch back to voice mode if they prefer. See [Text Interview Experience](/docs/text-interview-experience) for details on the text interface.\n\n## Interview Duration\n\nVoice interviews tend to run faster than text interviews because speaking is quicker than typing. A typical voice interview takes around 10 minutes, though this depends on the complexity of your research questions and how much the participant has to share.\n\nThe AI interviewer manages the pacing automatically. It will cover all the key topics in your research brief and naturally wind down the conversation when enough ground has been covered.\n\n## Ending the Interview\n\n### Automatic Completion\n\nWhen the AI interviewer determines that it has gathered sufficient responses across all the topics in your research brief, it wraps up the conversation naturally — thanking the participant and saying goodbye. The AI can also trigger an automatic end-of-interview signal when all research questions have been covered.\n\n### Manual Completion\n\nParticipants can end the interview at any time by clicking the **Done** button in the header. Either way, the conversation moves to the [completion flow](/docs/interview-completion-flow).\n\n## Audio Quality Tips\n\nTo help your participants have the best experience, consider sharing these tips in your outreach:\n\n1. **Use headphones.** This prevents echo and improves audio clarity for the AI interviewer.\n2. **Find a quiet space.** Background noise can interfere with speech detection.\n3. **Use a stable internet connection.** Voice interviews require real-time audio streaming, so a strong connection helps avoid interruptions.\n4. **Use Chrome or a Chromium-based browser.** These tend to have the best support for real-time audio features.\n\n## What Researchers See\n\nAs the study owner, you do not hear the interview in real time. Instead, you see completed interviews in your project dashboard, each with:\n\n- A full text transcript of the conversation\n- A quality score assigned by Koji's analysis\n- AI-generated insights and themes\n\nVoice and text interviews produce identical output in your dashboard — the transcript, score, and insights look the same regardless of which mode the participant used.\n\n## Next Steps\n\n- [Text Interview Experience](/docs/text-interview-experience) — how text mode works, including structured question widgets\n- [Interview Landing Page](/docs/interview-landing-page) — what participants see before the interview starts\n- [Interview Completion Flow](/docs/interview-completion-flow) — what happens when the interview ends\n\n## Further reading on the blog\n\n- [AI-Moderated Interview Platforms Compared: Which One Actually Works? (2026)](/blog/ai-moderated-interview-platforms-2026) — Not all AI interview platforms deliver real qualitative depth. This guide compares the top AI-moderated interview platforms in 2026 — Koji, \n- [Best AI Customer Interview Tools in 2026: The Complete Buyer's Guide](/blog/best-ai-customer-interview-tools-2026) — AI has fundamentally changed how product teams conduct customer research. Here are the best AI customer interview tools in 2026 — ranked by \n- [Best Customer Churn Interview Tools (2026): The Top 8 Compared](/blog/best-customer-churn-interview-tools-2026) — A 2024 study found exit surveys match the real churn driver in only 31% of cases. The right interview tool fixes that. Here are the 8 best c\n\n<!-- further-reading:blog -->\n","category":"Interview Experience","lastModified":"2026-05-13T00:26:36.807295+00:00","metaTitle":"Voice Interview Experience — Koji Docs","metaDescription":"Understand what participants experience during a Koji voice interview, from microphone setup to natural AI conversation.","keywords":["voice interview","participant experience","microphone permission","AI conversation","voice mode","real-time interview"],"aiSummary":"Voice interviews let participants speak naturally with an AI interviewer powered by ElevenLabs. The microphone is always on, messages are visible on screen, and participants can switch to text mode and back at any time. This guide covers the full experience from start to completion.","aiPrerequisites":["interview-landing-page"],"aiLearningOutcomes":["Understand the voice interview participant flow","Know what visual elements participants see","Advise participants on audio quality","Explain how switching to text mode works"],"aiDifficulty":"beginner","aiEstimatedTime":"6 min read"}],"pagination":{"total":1,"returned":1,"offset":0}}