New

Now in Claude, ChatGPT, Cursor & more with our MCP server

Back to docs
Research Methods

How to Write Unbiased Survey Questions: Avoiding Leading, Loaded & Double-Barreled Questions

A practical guide to question wording — the biggest hidden source of bad data. Learn to spot and fix leading, loaded, double-barreled, and assumptive questions, with real research examples and a pre-launch checklist.

Short answer: The fastest way to ruin a survey is not a bad sample or a low response rate — it is bad wording. A single biased phrase can swing results by double digits. The fixes are concrete: ask about one thing at a time (no double-barreled questions), use neutral language (no leading or loaded questions), avoid hidden assumptions, balance your answer options, and always pilot before you launch. This guide shows the four most damaging wording mistakes, how to spot them, and how an AI interviewer like Koji sidesteps the worst of them by asking adaptive, neutral follow-ups instead of locking everyone into one rigid script.

Why wording is the highest-leverage thing in survey design

Researchers obsess over sample size, but question wording often does more damage. The evidence is striking. In a Pew Research Center experiment on government surveillance, respondents who heard the program described as happening "with court approval" expressed support 12 percentage points higher than those who heard no mention of courts; describing it as part of "anti-terrorism efforts" added another 9 points (Pew Research Center). Same underlying issue, same population — only the words changed, and the "finding" flipped.

Pew is blunt about the stakes: "Even small wording differences can substantially affect the answers people provide" (Pew Research Center). If a phrase can move a national poll by 12 points, it can certainly mislead your product or pricing decision. As Qualtrics notes, biased questions "produce results that simply reinforce the researcher's own assumptions" — the most expensive kind of data, because it looks valid.

The four most damaging wording mistakes

1. Double-barreled questions (asking two things at once)

A double-barreled question asks about two topics but allows only one answer. "How satisfied are you with our price and customer service?" A respondent who loves the price but hates support has no honest answer. They are forced to average, pick one barrel silently, or default to neutral — and you can never tell which (Scribbr).

The tell: the words "and" or "or" joining two distinct concepts.

The fix: split it. Ask about price in one question and support in another. One idea per question, always.

2. Leading questions (steering the answer)

A leading question nudges the respondent toward a particular response through its phrasing. "How much did you enjoy our award-winning onboarding?" presumes enjoyment and primes a positive answer. "Don't you agree that the new design is better?" invites agreement.

The fix: strip evaluative and presumptive language. "How would you describe your onboarding experience?" lets the answer go either way.

3. Loaded & assumptive questions (smuggling in a premise)

A loaded question contains an assumption the respondent may not share — the classic "Have you stopped overspending on software?" traps anyone who never overspent. Assumptive questions presume a behavior happened at all: "What did you like about the new feature?" assumes they used it and liked something.

The fix: verify the premise first with a screening or yes/no question, then branch. Never bake the conclusion into the question.

4. Absolutes, jargon, and unbalanced scales

Words like "always" and "never" force false precision. Internal jargon ("How useful is the SKU-level RFM dashboard?") gets guessed at, not answered. And an unbalanced scale — four positive options and one negative — quietly inflates your numbers. Balanced scales and plain language are non-negotiable.

The acquiescence trap

Beyond specific question types, humans tend toward acquiescence bias — a habit of agreeing with statements regardless of content, especially when tired or unsure. Agree/disagree batteries are particularly vulnerable. Prefer item-specific scales ("How would you rate X? Poor → Excellent") over "Do you agree that X is good?" and vary the direction of your items so respondents cannot autopilot. (For scale design specifics, see the Likert scale research guide.)

A pre-launch wording checklist

Before any survey goes live, run every question through this list:

  1. One idea per question? No "and"/"or" joining two concepts.
  2. Neutral phrasing? No adjectives ("award-winning," "easy") that pre-judge the answer.
  3. No hidden assumptions? The question works even if the respondent has never done the thing.
  4. Plain language? No jargon, acronyms, or terms a customer wouldn't use.
  5. Balanced options? Equal positive and negative choices; a genuine "neutral"/"N/A" where appropriate.
  6. No absolutes? Avoid "always," "never," "all."
  7. Piloted? You have tested it on 3–5 real people and watched where they hesitated.

That last step is the cheapest insurance in research. As Pew demonstrates, you cannot reliably predict wording effects from the armchair — you have to test. Survey experiments and cognitive pretests exist precisely because intuition is unreliable here.

How Koji reduces wording bias by design

Static surveys are fragile because every respondent gets the exact same rigid wording, so any flaw is amplified across your whole sample. An AI interviewer changes the dynamics in three ways:

  • Adaptive neutral follow-ups. Instead of cramming two ideas into one leading question, Koji's AI moderator asks a clean opener and then probes — "You mentioned price; tell me more about that" — capturing the nuance a double-barreled question would have destroyed.
  • Six structured question types, used correctly. Koji supports open_ended, scale, single_choice, multiple_choice, ranking, and yes_no. Pairing a clean scale question with an open_ended "why" gets you the number and the unbiased reasoning, instead of forcing a loaded single question to do both jobs. See the structured questions guide.
  • Premise-checking branches. Rather than assuming a behavior, Koji can confirm it ("Have you used the new feature?") and only then explore — eliminating assumptive-question bias automatically.

The result is data that reflects what respondents actually think, not what your phrasing nudged them toward.

The modern approach with AI

Traditional survey design puts the entire burden of neutrality on the question author, then locks that wording for thousands of respondents. AI-moderated research distributes that burden: clean, simple prompts up front, with intelligent follow-up doing the work that a clever-but-biased question used to attempt. You still need to apply the principles above — but you are no longer one bad adjective away from a misleading dataset. Combine disciplined wording with adaptive AI moderation, and you get the rarest thing in research: answers you can trust.

Before-and-after rewrites

Principles stick when you see them applied. Each row takes a flawed question and fixes it:

ProblemFlawed questionFixed question
Double-barreled"How satisfied are you with our price and quality?"Two questions: one on price, one on quality.
Leading"How much did you love our fast, intuitive checkout?""How would you describe your checkout experience?"
Loaded / assumptive"What do you like most about the new dashboard?""Have you used the new dashboard?" → if yes → "What stood out, good or bad?"
Absolutes"Do you always read our emails?""In the last month, how often did you read our emails?" (with frequency options)
Jargon"How useful is the RFM segmentation view?""How useful is the view that groups customers by how recently and often they buy?"
Unbalanced scaleExcellent / Very good / Good / FairExcellent / Good / Fair / Poor / Very poor (balanced)

Question order and context effects

Wording is not the only place bias hides — order matters too. An early question can prime how respondents answer later ones, a phenomenon Pew and other methodologists document repeatedly. Asking "How satisfied are you overall?" right after a battery of complaints will drag the overall score down; asking it first gives a cleaner read. Two practical rules: put general questions before specific ones when you want an uncontaminated overall judgment, and randomize the order of items in a list so no single option benefits from always appearing first (primacy) or last (recency).

Open-ended questions: the bias relief valve

Closed questions can only return the options you imagined — and any bias in those options is baked into every response. A well-placed open-ended question lets respondents answer in their own words and surfaces the framing you missed. The historical trade-off was analysis cost: open text is slow to code by hand. This is exactly where AI analysis changes the economics — tools that auto-cluster open-ended responses make it practical to ask "why" of thousands of people, not dozens. (See open-ended interview questions.)

Why this matters more than ever

As surveys get easier to send, the marginal cost of a biased survey approaches zero — and so does the value of its data. Discipline in wording is what separates a survey that informs a decision from one that merely launders an assumption. Get the words right, pilot before you launch, and let adaptive follow-up do the work a clever-but-leading question used to attempt.

Related Resources

Related Articles