The Complete Guide to Thematic Analysis
Learn how to systematically analyze qualitative data using Braun and Clarke's six-phase thematic analysis framework.
The Complete Guide to Thematic Analysis
You've conducted your interviews. You have hours of recordings and hundreds of pages of transcripts. Now what?
This is the moment where many research projects stall. The gap between raw data and actionable insights is where thematic analysis comes in — it's the most widely used method for making sense of qualitative data, and this guide will walk you through it step by step.
What Is Thematic Analysis?
Thematic analysis is a method for identifying, analyzing, and reporting patterns (themes) within qualitative data. It was formalized by Virginia Braun and Victoria Clarke in their landmark 2006 paper, which has been cited over 200,000 times according to Google Scholar — making it one of the most influential methodology papers in social science.
Unlike content analysis (which counts occurrences) or grounded theory (which builds theory from data), thematic analysis offers a flexible approach that works across different epistemological positions. You don't need to subscribe to a specific theoretical framework to use it effectively.
At its core, thematic analysis answers the question: What are the recurring patterns across my data, and what do they mean?
When to Use Thematic Analysis
Thematic analysis is appropriate when you:
- Have qualitative data from user interviews, focus groups, open-ended surveys, or similar sources
- Want to identify patterns across multiple participants or data sources
- Need to produce findings that are accessible to non-researcher stakeholders
- Want a flexible method that doesn't require deep expertise in a specific theoretical tradition
It's less suitable when you need to preserve individual narratives (use narrative analysis), when you're building theory from scratch (use grounded theory), or when you need to analyze language use itself (use discourse analysis).
Braun and Clarke's Six-Phase Framework
Braun and Clarke's framework provides a clear, systematic process. These phases are sequential but not strictly linear — you'll often move back and forth between them as your understanding deepens.
Phase 1: Familiarization with the Data
Goal: Immerse yourself in the data so deeply that you begin noticing patterns intuitively.
This phase is about reading and re-reading your data. If you conducted the interviews yourself, you already have a head start, but don't skip the careful re-reading — you'll notice things you missed in the moment.
What to do:
- Transcribe all audio/video recordings if you haven't already
- Read through every transcript at least twice
- Take initial notes — jot down things that strike you as interesting, surprising, or recurring
- Don't start coding yet; stay open and curious
According to a 2020 study published in the International Journal of Qualitative Methods, researchers who spend more time on familiarization produce significantly more nuanced and well-supported themes. Rushing this phase is the most common cause of shallow analysis.
Time investment: Plan for 1–2 hours per transcript during familiarization. For a study of 10 interviews, that's 10–20 hours before you even begin formal coding.
Phase 2: Generating Initial Codes
Goal: Systematically label meaningful segments of your data.
Coding is the process of attaching short labels to segments of text that capture something relevant to your research questions. A single passage might receive multiple codes.
What to do:
- Work through each transcript systematically
- Highlight data segments that relate to your research questions
- Assign descriptive codes to each segment
- Code for as many potential themes as possible — you can always merge or discard later
- Keep a codebook that defines each code
Example of coding in practice:
| Data Excerpt | Codes |
|---|---|
| "I spend half my Monday just trying to figure out what everyone else did last week." | Time waste, Status visibility gap, Monday friction |
| "The dashboard shows numbers but I don't know what to actually do with them." | Data without context, Actionability gap, Dashboard frustration |
| "I just ask Sarah because she always knows what's going on." | Informal information networks, Key-person dependency, Workaround behavior |
Coding approaches:
| Approach | Description | Best For |
|---|---|---|
| Inductive | Codes emerge from the data itself | Exploratory research, new domains |
| Deductive | Codes come from existing theory or a predefined framework | Testing hypotheses, building on prior research |
| Hybrid | Start with some deductive codes, allow inductive codes to emerge | Most product research scenarios |
Phase 3: Searching for Themes
Goal: Group your codes into potential themes — broader patterns that tell a story about your data.
A theme captures something important about the data in relation to your research question. It's not just a topic (like "onboarding") — it's a pattern with a point (like "users feel abandoned after initial setup because documentation assumes prior expertise").
What to do:
- Gather all codes and their associated data into one view
- Look for codes that cluster together or relate to a similar concept
- Create candidate themes by grouping related codes
- Some codes won't fit anywhere — that's fine; set them aside
- Consider both semantic themes (surface-level) and latent themes (underlying assumptions or ideologies)
This is where affinity mapping can be incredibly valuable — physically or digitally grouping coded data to see patterns emerge.
Phase 4: Reviewing Themes
Goal: Refine your themes so they are coherent, distinct, and well-supported by data.
This is a quality-control phase. You're checking that your themes actually work — that they hold together internally and are clearly distinguishable from each other.
Two levels of review:
-
Level 1 — Review coded extracts. Read all the data segments assigned to each theme. Do they form a coherent pattern? If not, consider splitting the theme, moving some codes to another theme, or discarding codes that don't fit.
-
Level 2 — Review against the full dataset. Re-read the entire dataset with your theme map in mind. Do the themes accurately represent the data as a whole? Are there patterns you missed?
Signs a theme needs work:
- It's too broad (captures everything but says nothing)
- It overlaps significantly with another theme
- It's supported by only one or two data points
- You can't explain it in one or two sentences
Phase 5: Defining and Naming Themes
Goal: Write clear, concise definitions and give each theme a compelling name.
Each theme should have:
- A name that captures the essence (avoid generic labels like "Communication Issues" — try "The Information Black Hole Between Teams" instead)
- A definition of what the theme captures and what it doesn't
- A narrative that explains the story this theme tells, with supporting data
Good theme names are specific, evocative, and instantly understandable. They help stakeholders grasp the insight without reading the full analysis.
Phase 6: Producing the Report
Goal: Tell a compelling, evidence-based story that answers your research questions.
Your report isn't just a list of themes — it's a narrative that weaves together your findings into a coherent answer to your research questions. Each theme should be illustrated with vivid data extracts that bring the pattern to life.
A strong thematic analysis report includes:
- An overview of the research questions and methodology
- A presentation of each theme with supporting evidence
- An analysis of how themes relate to each other
- Implications for design, product, or business decisions
- Limitations and areas for further research
The Time Problem — And How to Solve It
Let's be honest: traditional thematic analysis is slow. Research published in Qualitative Research journal estimated that a full thematic analysis of 10 interviews takes between 60 and 120 hours when done manually — from transcription through reporting.
For product teams operating on sprint timelines, that's often not feasible. This is where the analysis process has evolved significantly:
AI-assisted analysis can reduce the mechanical parts of the process — transcription, initial coding, and code clustering — from days to hours. A 2023 study in the Journal of Medical Internet Research found that AI-assisted qualitative coding achieved 78% agreement with expert human coders, and that human-AI collaborative coding was faster than either alone.
Koji applies this principle to research analysis: after your interviews are complete, AI generates initial codes and theme suggestions that you can review, refine, and build upon. This keeps the researcher's judgment at the center while eliminating the repetitive mechanical work. The result is thematic analysis in hours rather than weeks — without sacrificing rigor.
Inductive vs. Deductive Thematic Analysis
| Dimension | Inductive | Deductive |
|---|---|---|
| Starting point | The data itself | A pre-existing framework or theory |
| Code generation | Codes emerge from reading data | Codes derived from theory before analysis |
| Flexibility | High — follows wherever data leads | Lower — constrained by the framework |
| Best for | Exploratory research, new domains | Testing specific hypotheses, building on prior studies |
| Risk | May miss connections to existing knowledge | May force data into ill-fitting categories |
Most product research benefits from a hybrid approach: start with a few deductive codes based on your research questions, but remain open to inductive codes that emerge from the data.
Quality Criteria for Thematic Analysis
How do you know if your analysis is good? Braun and Clarke outlined a 15-point checklist, but here are the essentials:
- Transcriptions are accurate and retain enough detail for analysis
- Each data item has been given equal attention during coding
- Themes are not just paraphrases of questions — they capture patterns
- Data has been analyzed, not just described — you've interpreted what patterns mean
- There is a good balance between analytic narrative and data extracts
- Enough time has been allocated to complete all phases adequately
Common Pitfalls
- Using data collection questions as themes. "What participants said about onboarding" is a topic, not a theme. A theme captures a pattern with a point.
- Weak or unconvincing themes. Every theme needs substantial supporting evidence from multiple participants.
- Mismatch between data and claims. Don't claim a theme is prevalent if only two people mentioned it.
- Not going beyond description. Description says what the data contains. Analysis says what the data means.
Getting Started with Thematic Analysis
If you're new to thematic analysis, here's a practical starting point:
- Start with a small dataset — 5 to 6 interviews from a focused study
- Use a hybrid coding approach with 3–5 deductive codes and room for inductive codes
- Keep a reflexive journal noting your interpretive decisions
- Discuss emerging themes with a colleague — external perspective improves quality
- Practice naming themes with specificity
For related techniques, explore our guide to affinity mapping, which provides a complementary approach to organizing qualitative data into themes.
Next Steps
- User Interview Guide — ensure your data collection supports strong analysis
- Affinity Mapping — a visual approach to theme identification
- Writing Interview Questions — better questions lead to richer data for analysis
Related Articles
Turning Interviews Into Insights: From Raw Data to Action
A complete guide to transforming raw interview transcripts into structured, actionable insights — covering manual analysis, AI-assisted workflows, and frameworks for prioritizing findings.
How to Analyze Qualitative Data: From Raw Interviews to Actionable Insights
A step-by-step guide to qualitative data analysis — from reviewing raw transcripts to synthesizing themes, generating insights, and presenting findings that teams act on.
How to Code Qualitative Data: A Step-by-Step Guide
Learn the complete process of qualitative coding — from building a codebook to identifying themes — and how AI tools like Koji automate the most time-consuming parts.
MCP Workflow Guide for UX Researchers
How UX researchers use Koji MCP with Claude to scale qualitative research. Manage multiple studies, analyze transcripts across projects, generate themed reports, and maintain a living research repository.
The Definitive Guide to User Interviews
Everything you need to plan, conduct, and analyze user interviews that produce actionable research insights.
How to Write Great Interview Questions
Learn to craft open-ended, neutral interview questions that surface genuine user insights instead of confirmation bias.
Affinity Mapping: Organize Qualitative Data Into Themes
Learn how to use affinity mapping to group qualitative research data into meaningful clusters and uncover actionable patterns.
Qualitative vs. Quantitative Research: When to Use Each Method
A clear breakdown of qualitative and quantitative research — what each method reveals, when to use each, and how to combine them for the most complete picture of your users.
Surveys vs. Interviews: How to Choose the Right Research Method
A comprehensive comparison of surveys and interviews as research methods. Understand when to use each, the key trade-offs, how to combine them in mixed-methods studies, and why the choice matters for research quality.