New

Now in Claude, ChatGPT, Cursor & more with our MCP server

Back to docs
Research Methods

Generative vs. Evaluative Research: When to Use Each Method

Understand the difference between generative and evaluative research, when to use each, and how combining both leads to better product decisions. Includes a comparison table and decision framework.

Every research study fits into one of two categories: generative or evaluative. Mixing them up is one of the most common mistakes in product development — and it leads to research that produces the wrong kind of insight at the wrong time.

Understanding the difference helps you choose the right method every time.

What Is Generative Research?

Generative research explores and discovers. It helps you understand problems, user needs, and contexts that you do not yet fully comprehend. The goal is to generate new understanding — which is where the name comes from.

Generative research answers questions like:

  • "What problems do our users have that we have not thought about?"
  • "What is the context in which people make this decision?"
  • "Why do customers churn after the first month?"
  • "What does success look like for our target users?"

Common generative methods:

  • In-depth interviews — the workhorse of generative research
  • Contextual inquiry — observing users in their natural environment
  • Diary studies — capturing experiences as they happen over time
  • Ethnographic research — deep immersion in user contexts
  • Jobs-to-Be-Done interviews — understanding underlying motivations and goals

Generative research is inherently open-ended. You do not know exactly what you will find — that is the point. The goal is to discover, not confirm.

What Is Evaluative Research?

Evaluative research tests and validates. It assesses a specific solution, design, or hypothesis against clear criteria. The goal is to evaluate something you have already created or hypothesized.

Evaluative research answers questions like:

  • "Can users complete this task with the new interface?"
  • "Do users understand what this feature does?"
  • "Which version of this landing page converts better?"
  • "Is this feature actually solving the problem we intended?"

Common evaluative methods:

  • Usability testing — observing users attempting specific tasks
  • A/B testing — comparing two versions quantitatively
  • Surveys — measuring attitudes or satisfaction at scale
  • Card sorting and tree testing — evaluating information architecture
  • Concept testing — testing whether a proposed solution resonates

Evaluative research is inherently convergent. You are testing against specific criteria, not exploring freely.

The Key Differences

DimensionGenerativeEvaluative
GoalDiscover new understandingTest existing hypotheses
StageEarly — discovery, definitionLater — validation, iteration
OutputInsights, themes, frameworksRatings, pass/fail, comparisons
Typical methodsInterviews, ethnography, diary studiesUsability tests, surveys, A/B tests
Sample sizeSmaller — 5 to 20 participantsLarger — 20 to 100+ participants
ModerationUsually moderated, in-depthOften unmoderated
When to use"We need to understand X""We need to know if Y works"

According to Nielsen Norman Group, research teams most commonly over-index on evaluative research — testing what they have built rather than discovering what to build. This leads to well-executed products that solve the wrong problems.

When to Use Generative Research

Use generative research when:

  • You are entering a new market or domain and need foundational understanding
  • You are struggling to understand why users behave in a certain way
  • Your product decisions keep being questioned internally — "how do we know users want this?"
  • You are designing a new feature and do not have validated user needs
  • You want to find opportunities you have not thought of yet
  • You are trying to understand a problem space before committing to a solution

Pro tip: Generative research does not require a large budget or a research team. Platforms like Koji allow you to conduct 10–20 in-depth AI interviews in a few days — giving you the depth of qualitative research without weeks of scheduling and analysis.

When to Use Evaluative Research

Use evaluative research when:

  • You have a specific design, prototype, or feature ready to test
  • You need to decide between two or more competing approaches
  • You want to measure whether a recent change achieved its intended effect
  • You need to validate a hypothesis before committing to full development
  • You are preparing for a major release and want to find critical usability issues

The Danger of Skipping Generative Research

"Let us just test what we built" is a phrase that leads to products optimized for the wrong problem. Evaluative research can tell you if your solution works — but it cannot tell you if it is solving the right problem.

Research from Nielsen Norman Group found that most product failures are failures of problem framing, not execution. The team built something that worked well — it just was not what users actually needed.

Generative research prevents this by investing time upfront to understand the problem before committing to solutions. Even a small round of 8–10 customer interviews — which Koji can run for you in under a week — can prevent months of wasted engineering effort.

How AI-Powered Interviews Transform Generative Research

Historically, the biggest barrier to generative research was time. Conducting 20 in-depth interviews typically required:

  • Three to four weeks of scheduling
  • A skilled researcher to moderate each session
  • Hours of analysis per transcript
  • Manual synthesis across all conversations

AI interview platforms like Koji compress this timeline dramatically. Koji's AI conducts naturalistic, adaptive conversations with participants — asking follow-up questions, exploring interesting threads, and probing for deeper context automatically. Then Koji analyzes all interviews together, surfacing themes, recurring patterns, and key quotes.

What used to take a research team several weeks now takes days. This makes continuous generative research feasible — rather than a one-time discovery phase, teams can run regular generative studies alongside product development.

Teams that practice continuous generative research make higher-quality product decisions and ship fewer features that need to be rolled back or redesigned.

The Research Cycle

Effective research programs use both generative and evaluative research in sequence:

Generative → Design → Evaluative → Ship → Generative → ...
  1. Generative phase: Understand the problem space and user needs
  2. Design phase: Create solutions based on generative insights
  3. Evaluative phase: Test the solution against specific criteria
  4. Ship phase: Release and measure outcomes
  5. Generative phase: Understand what happened and why — then repeat

This cycle creates a feedback loop that continuously grounds product decisions in real user understanding. Many teams run evaluative research but skip the generative phases — resulting in well-executed products solving the wrong problems.

Choosing the Right Method: A Quick Framework

A simple heuristic for deciding which type of research you need:

  • If you are asking "why?" → generative method
  • If you are asking "does it work?" → evaluative method
  • If you are asking "which one is better?" → evaluative method
  • If you are asking "what should we build?" → generative method

When in doubt, ask: "What decision will this research inform?" If the decision is about problem framing or opportunity identification, use generative research. If the decision is about validating a specific solution, use evaluative research.

Mixing Methods

Some research studies blend generative and evaluative elements — but do this intentionally, not by accident.

A usability test that includes open-ended exploration questions at the end is doing both: evaluating task completion (evaluative) while also discovering new user needs (generative). This can be efficient, but be careful not to let one goal undermine the other.

The risk: a study designed primarily for evaluation that tries to also be generative usually produces shallow insights on both fronts. Better to run separate, focused studies than one study trying to do everything.

Key Things to Know

  • Most teams under-invest in generative research: Evaluative research feels more concrete — "test this design" — but generative research prevents the more costly mistake of building the wrong thing entirely
  • Small samples are fine for generative research: 5–8 participants is usually enough for qualitative saturation; you do not need statistical significance when the goal is discovery
  • Generative research should inform your evaluation criteria: The metrics you evaluate against should be grounded in what generative research revealed matters to users
  • AI tools are most impactful for generative research: The depth and breadth of conversational data matters most in generative work, which is where platforms like Koji provide the greatest acceleration

Tips & Best Practices

  • Do not mix generative and evaluative goals in one study: Studies trying to be both usually do both poorly — keep the goals separate
  • Budget time for both types throughout your product lifecycle: A well-rounded research program allocates time to both discovery and validation, not just pre-launch testing
  • Document your research type upfront: Brief stakeholders on whether you are exploring or validating — it sets appropriate expectations for the output
  • Generative insights have a shelf life: If you did generative research 18 months ago, your insights may be stale. Run refresh studies before making major product bets.

Related Articles

Frequently Asked Questions

Q: Can the same research method be both generative and evaluative? A: Yes — interviews can be either, depending on how they are designed. An open-ended interview exploring user workflows is generative. An interview testing whether users understand a new concept is evaluative. The method is less important than the research goal.

Q: Which type of research should early-stage startups do first? A: Generative. Before building, founders need deep understanding of the problem space. Many early-stage companies fail not because their product is bad, but because they are solving a problem people do not care enough about. Generative customer discovery interviews — which Koji makes easy to run at scale — should come before significant engineering investment.

Q: Do I need a dedicated research team for generative research? A: Not anymore. AI interview platforms like Koji allow product managers, founders, and small teams to conduct scalable generative research. Koji handles moderation and analysis — you focus on interpreting insights and making decisions.

Q: How do I convince stakeholders to invest in generative research? A: Frame it in terms of risk reduction. Generative research reduces the risk of building the wrong thing. A single round of 15 customer interviews — which Koji can run in under a week — can prevent months of wasted engineering effort.

Q: What is the right balance between generative and evaluative research? A: It depends on your product stage. Early-stage teams should heavily favor generative research. As you scale and establish a product, the balance shifts toward more evaluative work — though generative research should never be eliminated entirely. The right cadence is continuous generative work alongside periodic evaluative studies at key decision points.

Related Articles

The Definitive Guide to User Interviews

Everything you need to plan, conduct, and analyze user interviews that produce actionable research insights.

Qualitative vs. Quantitative Research: When to Use Each Method

A clear breakdown of qualitative and quantitative research — what each method reveals, when to use each, and how to combine them for the most complete picture of your users.

Customer Discovery Interviews: The Complete Guide

Learn how to conduct customer discovery interviews to validate your product ideas before building. Covers Steve Blank methodology, question frameworks, sample sizes, and common mistakes.

How to Conduct Usability Testing: The Complete Guide

A comprehensive guide to usability testing for UX researchers and product managers. Covers types of testing, participant numbers, step-by-step facilitation, and the most common mistakes to avoid.

Surveys vs. Interviews: How to Choose the Right Research Method

A comprehensive comparison of surveys and interviews as research methods. Understand when to use each, the key trade-offs, how to combine them in mixed-methods studies, and why the choice matters for research quality.

Diary Studies: The Complete Guide to Longitudinal User Research

Learn how to design, run, and analyze diary studies that capture real user experiences in context. Includes how AI interviews complement diary research at scale.

Card Sorting: The Complete Guide to Information Architecture Research

Everything you need to run effective card sorting studies — open, closed, and hybrid variants. Includes sample sizes, analysis techniques, and how to combine card sorting with qualitative interviews.