The Complete Guide to Thematic Analysis

You've conducted your interviews. You have hours of recordings and hundreds of pages of transcripts. Now what?

This is the moment where many research projects stall. The gap between raw data and actionable insights is where thematic analysis comes in — it's the most widely used method for making sense of qualitative data, and this guide will walk you through it step by step.

What Is Thematic Analysis?

Thematic analysis is a method for identifying, analyzing, and reporting patterns (themes) within qualitative data. It was formalized by Virginia Braun and Victoria Clarke in their landmark 2006 paper, which has been cited over 200,000 times according to Google Scholar — making it one of the most influential methodology papers in social science.

Unlike content analysis (which counts occurrences) or grounded theory (which builds theory from data), thematic analysis offers a flexible approach that works across different epistemological positions. You don't need to subscribe to a specific theoretical framework to use it effectively.

At its core, thematic analysis answers the question: What are the recurring patterns across my data, and what do they mean?

When to Use Thematic Analysis

Thematic analysis is appropriate when you:

Have qualitative data from user interviews, focus groups, open-ended surveys, or similar sources
Want to identify patterns across multiple participants or data sources
Need to produce findings that are accessible to non-researcher stakeholders
Want a flexible method that doesn't require deep expertise in a specific theoretical tradition

It's less suitable when you need to preserve individual narratives (use narrative analysis), when you're building theory from scratch (use grounded theory), or when you need to analyze language use itself (use discourse analysis).

Braun and Clarke's Six-Phase Framework

Braun and Clarke's framework provides a clear, systematic process. These phases are sequential but not strictly linear — you'll often move back and forth between them as your understanding deepens.

Phase 1: Familiarization with the Data

Goal: Immerse yourself in the data so deeply that you begin noticing patterns intuitively.

This phase is about reading and re-reading your data. If you conducted the interviews yourself, you already have a head start, but don't skip the careful re-reading — you'll notice things you missed in the moment.

What to do:

Transcribe all audio/video recordings if you haven't already
Read through every transcript at least twice
Take initial notes — jot down things that strike you as interesting, surprising, or recurring
Don't start coding yet; stay open and curious

According to a 2020 study published in the International Journal of Qualitative Methods, researchers who spend more time on familiarization produce significantly more nuanced and well-supported themes. Rushing this phase is the most common cause of shallow analysis.

Time investment: Plan for 1–2 hours per transcript during familiarization. For a study of 10 interviews, that's 10–20 hours before you even begin formal coding.

Phase 2: Generating Initial Codes

Goal: Systematically label meaningful segments of your data.

Coding is the process of attaching short labels to segments of text that capture something relevant to your research questions. A single passage might receive multiple codes.

What to do:

Work through each transcript systematically
Highlight data segments that relate to your research questions
Assign descriptive codes to each segment
Code for as many potential themes as possible — you can always merge or discard later
Keep a codebook that defines each code

Example of coding in practice:

Data Excerpt	Codes
"I spend half my Monday just trying to figure out what everyone else did last week."	Time waste, Status visibility gap, Monday friction
"The dashboard shows numbers but I don't know what to actually do with them."	Data without context, Actionability gap, Dashboard frustration
"I just ask Sarah because she always knows what's going on."	Informal information networks, Key-person dependency, Workaround behavior

Coding approaches:

Approach	Description	Best For
Inductive	Codes emerge from the data itself	Exploratory research, new domains
Deductive	Codes come from existing theory or a predefined framework	Testing hypotheses, building on prior research
Hybrid	Start with some deductive codes, allow inductive codes to emerge	Most product research scenarios

Phase 3: Searching for Themes

Goal: Group your codes into potential themes — broader patterns that tell a story about your data.

A theme captures something important about the data in relation to your research question. It's not just a topic (like "onboarding") — it's a pattern with a point (like "users feel abandoned after initial setup because documentation assumes prior expertise").

What to do:

Gather all codes and their associated data into one view
Look for codes that cluster together or relate to a similar concept
Create candidate themes by grouping related codes
Some codes won't fit anywhere — that's fine; set them aside
Consider both semantic themes (surface-level) and latent themes (underlying assumptions or ideologies)

This is where affinity mapping can be incredibly valuable — physically or digitally grouping coded data to see patterns emerge.

Phase 4: Reviewing Themes

Goal: Refine your themes so they are coherent, distinct, and well-supported by data.

This is a quality-control phase. You're checking that your themes actually work — that they hold together internally and are clearly distinguishable from each other.

Two levels of review:

Level 1 — Review coded extracts. Read all the data segments assigned to each theme. Do they form a coherent pattern? If not, consider splitting the theme, moving some codes to another theme, or discarding codes that don't fit.
Level 2 — Review against the full dataset. Re-read the entire dataset with your theme map in mind. Do the themes accurately represent the data as a whole? Are there patterns you missed?

Signs a theme needs work:

It's too broad (captures everything but says nothing)
It overlaps significantly with another theme
It's supported by only one or two data points
You can't explain it in one or two sentences

Phase 5: Defining and Naming Themes

Goal: Write clear, concise definitions and give each theme a compelling name.

Each theme should have:

A name that captures the essence (avoid generic labels like "Communication Issues" — try "The Information Black Hole Between Teams" instead)
A definition of what the theme captures and what it doesn't
A narrative that explains the story this theme tells, with supporting data

Good theme names are specific, evocative, and instantly understandable. They help stakeholders grasp the insight without reading the full analysis.

Phase 6: Producing the Report

Goal: Tell a compelling, evidence-based story that answers your research questions.

Your report isn't just a list of themes — it's a narrative that weaves together your findings into a coherent answer to your research questions. Each theme should be illustrated with vivid data extracts that bring the pattern to life.

A strong thematic analysis report includes:

An overview of the research questions and methodology
A presentation of each theme with supporting evidence
An analysis of how themes relate to each other
Implications for design, product, or business decisions
Limitations and areas for further research

The Time Problem — And How to Solve It

Let's be honest: traditional thematic analysis is slow. Research published in Qualitative Research journal estimated that a full thematic analysis of 10 interviews takes between 60 and 120 hours when done manually — from transcription through reporting.

For product teams operating on sprint timelines, that's often not feasible. This is where the analysis process has evolved significantly:

AI-assisted analysis can reduce the mechanical parts of the process — transcription, initial coding, and code clustering — from days to hours. A 2023 study in the Journal of Medical Internet Research found that AI-assisted qualitative coding achieved 78% agreement with expert human coders, and that human-AI collaborative coding was faster than either alone.

Koji applies this principle to research analysis: after your interviews are complete, AI generates initial codes and theme suggestions that you can review, refine, and build upon. This keeps the researcher's judgment at the center while eliminating the repetitive mechanical work. The result is thematic analysis in hours rather than weeks — without sacrificing rigor.

Inductive vs. Deductive Thematic Analysis

Dimension	Inductive	Deductive
Starting point	The data itself	A pre-existing framework or theory
Code generation	Codes emerge from reading data	Codes derived from theory before analysis
Flexibility	High — follows wherever data leads	Lower — constrained by the framework
Best for	Exploratory research, new domains	Testing specific hypotheses, building on prior studies
Risk	May miss connections to existing knowledge	May force data into ill-fitting categories

Most product research benefits from a hybrid approach: start with a few deductive codes based on your research questions, but remain open to inductive codes that emerge from the data.

Quality Criteria for Thematic Analysis

How do you know if your analysis is good? Braun and Clarke outlined a 15-point checklist, but here are the essentials:

Transcriptions are accurate and retain enough detail for analysis
Each data item has been given equal attention during coding
Themes are not just paraphrases of questions — they capture patterns
Data has been analyzed, not just described — you've interpreted what patterns mean
There is a good balance between analytic narrative and data extracts
Enough time has been allocated to complete all phases adequately

Common Pitfalls

Using data collection questions as themes. "What participants said about onboarding" is a topic, not a theme. A theme captures a pattern with a point.
Weak or unconvincing themes. Every theme needs substantial supporting evidence from multiple participants.
Mismatch between data and claims. Don't claim a theme is prevalent if only two people mentioned it.
Not going beyond description. Description says what the data contains. Analysis says what the data means.

Getting Started with Thematic Analysis

If you're new to thematic analysis, here's a practical starting point:

Start with a small dataset — 5 to 6 interviews from a focused study
Use a hybrid coding approach with 3–5 deductive codes and room for inductive codes
Keep a reflexive journal noting your interpretive decisions
Discuss emerging themes with a colleague — external perspective improves quality
Practice naming themes with specificity

For related techniques, explore our guide to affinity mapping, which provides a complementary approach to organizing qualitative data into themes.

Next Steps

User Interview Guide — ensure your data collection supports strong analysis
Affinity Mapping — a visual approach to theme identification
Writing Interview Questions — better questions lead to richer data for analysis

Product & Research

Revenue & Growth

Advisory & Services

The Complete Guide to Thematic Analysis

The Complete Guide to Thematic Analysis

What Is Thematic Analysis?

When to Use Thematic Analysis

Braun and Clarke's Six-Phase Framework

Phase 1: Familiarization with the Data

Phase 2: Generating Initial Codes

Phase 3: Searching for Themes

Phase 4: Reviewing Themes

Phase 5: Defining and Naming Themes

Phase 6: Producing the Report

The Time Problem — And How to Solve It

Inductive vs. Deductive Thematic Analysis

Quality Criteria for Thematic Analysis

Common Pitfalls

Getting Started with Thematic Analysis

Next Steps

Related Articles

Turning Interviews Into Insights: From Raw Data to Action

How to Analyze Qualitative Data: From Raw Interviews to Actionable Insights

How to Code Qualitative Data: A Step-by-Step Guide

MCP Workflow Guide for UX Researchers

The Definitive Guide to User Interviews

How to Write Great Interview Questions

Affinity Mapping: Organize Qualitative Data Into Themes

Qualitative vs. Quantitative Research: When to Use Each Method

Surveys vs. Interviews: How to Choose the Right Research Method