{"site":{"name":"Koji","description":"AI-native customer research platform that helps teams conduct, analyze, and synthesize customer interviews at scale.","url":"https://www.koji.so","contentTypes":["blog","documentation"],"lastUpdated":"2026-06-28T12:46:35.756Z"},"content":[{"type":"documentation","id":"40e12c3c-a1ef-4677-a170-5a6dd2491a73","slug":"supr-q-guide","title":"SUPR-Q: The Standardized Questionnaire for Measuring Website Quality, Trust & Loyalty (2026 Guide)","url":"https://www.koji.so/docs/supr-q-guide","summary":"SUPR-Q (Standardized User Experience Percentile Rank Questionnaire) is an 8-item survey that scores a website or app on four factors — usability, trust and credibility, appearance, and loyalty — and converts the result to a percentile rank against a normative database of 150+ sites and 5,000+ users. Use SUPR-Q to benchmark whole web experiences (where trust and aesthetics drive conversion); use SUS for usability-only task evaluation. Report the four sub-scores, not just the overall, field it after a representative task, and never modify the wording. Koji runs SUPR-Q as scale-type structured questions plus AI open-ended follow-up on low Trust/Appearance scores, producing the standardized percentile and the qualitative reason behind each score automatically.","content":"## What is the SUPR-Q?\n\nThe **SUPR-Q (Standardized User Experience Percentile Rank Questionnaire)** is an 8-item survey that measures the overall quality of a website or app and expresses the result as a **percentile rank** — a single number from 0 to 100% that tells you how your experience compares to hundreds of other digital products.\n\n> **The bottom line:** A SUPR-Q score at the 50th percentile is exactly average. Above the 90th percentile means your site outperforms 90% of the products in the benchmark database. The questionnaire rolls up four sub-dimensions — **Usability, Trust & Credibility, Appearance, and Loyalty** — into one comparable score, which is what makes it more complete than a usability-only metric like SUS.\n\nSUPR-Q was developed by Jeff Sauro and the team at MeasuringU and validated against a normative database of **150+ websites and over 5,000 users**. That normative database is the magic: unlike a raw average, a percentile rank is only meaningful because it is anchored to real-world data from e-commerce, B2B, travel, finance, and SaaS sites. When you report \"our checkout flow is at the 72nd percentile,\" everyone in the room instantly understands whether that is good.\n\n## The 8 SUPR-Q questions and their four factors\n\nSUPR-Q uses eight statements rated on a 5-point agreement scale (1 = Strongly disagree, 5 = Strongly agree), plus one 11-point likelihood-to-recommend item. The items load onto four factors, two items each:\n\n**Usability**\n1. The website is easy to use.\n2. It is easy to navigate within the website.\n\n**Trust & Credibility**\n3. The information on the website is trustworthy.\n4. The website is trustworthy.\n\n**Appearance**\n5. I find the website to be attractive.\n6. The website has a clean and simple presentation.\n\n**Loyalty**\n7. How likely are you to recommend this website to a friend or colleague? (0–10, the Net Promoter item)\n8. I will likely return to the website in the future.\n\nTwo design choices make SUPR-Q efficient. First, it is short — eight items take respondents under two minutes. Second, it bakes loyalty (including the NPS question) directly into the instrument, so you capture an attitude metric and a behavioral-intent metric in one pass instead of fielding two separate surveys.\n\n## How SUPR-Q scoring works\n\nThere are three layers to a SUPR-Q result:\n\n1. **The raw score.** Average the eight items (after converting the 0–10 NPS item to the same 1–5 footing using the published transformation) to get a mean from 1 to 5.\n2. **The percentile rank.** Convert that raw mean to a percentile against the normative database. A raw score around 3.9 typically lands near the 50th percentile; the relationship is non-linear, which is exactly why the lookup against the benchmark matters more than the raw average.\n3. **The sub-scores.** Report each of the four factors separately. A site can sit at the 80th percentile on Appearance but the 30th percentile on Trust — and that gap is the most actionable thing the instrument gives you.\n\nThe percentile framing is the whole point: it turns an abstract 1–5 average into a competitive statement (\"we beat 72% of sites\") that executives and designers both understand without a statistics lesson.\n\n## SUPR-Q vs SUS vs NPS vs CES\n\nThese instruments are complementary, not interchangeable:\n\n- **SUS (System Usability Scale)** measures perceived usability only, on a 0–100 scale. It is the gold standard for *usability*, but says nothing about trust or visual appeal.\n- **SUPR-Q** measures the *broader experience* — usability plus trust, appearance, and loyalty — and is purpose-built for websites and web apps.\n- **NPS** measures loyalty alone. SUPR-Q includes the NPS item but contextualizes it inside the full experience.\n- **CES (Customer Effort Score)** measures friction on a single task.\n\nRule of thumb: use **SUS** when you are evaluating a tool or task flow, and **SUPR-Q** when you are benchmarking a whole website or marketing/commerce experience where trust and aesthetics drive conversion. Many mature teams track SUPR-Q quarterly as a site-health benchmark and SUS per release.\n\n## How many participants do you need?\n\nSUPR-Q is a quantitative instrument, so sample size matters more than it does for a formative usability test:\n\n- **30–50 respondents** for a directional internal read.\n- **75–100 respondents** for a stable benchmark you will track over time.\n- **100+ per segment** if you want to compare percentile ranks between audiences (e.g., new vs. returning visitors) with confidence.\n\nBecause the score is anchored to an external normative database, you do not need thousands of responses — you need enough to make your own mean stable, then the benchmark does the comparative work.\n\n## Common SUPR-Q pitfalls\n\n- **Reporting only the overall score.** The four sub-scores are where the insight lives. A strong overall percentile can hide a Trust problem that is quietly killing conversion.\n- **Modifying the wording.** Like SUS, the normative database is built on the exact published items. Rewrite them and you forfeit the percentile comparison.\n- **Surveying the wrong moment.** Field SUPR-Q *after* a representative task, not on arrival — you are measuring the experience, not first impressions (use a 5-second test for that).\n- **Ignoring the \"why.\"** A percentile rank tells you *where* you stand, never *why*. Pair every quantitative score with open-ended follow-up.\n\n## How to run SUPR-Q faster with Koji\n\nTraditional SUPR-Q studies mean building a survey, recruiting a panel, exporting to a stats tool, and manually computing percentile ranks — usually a one-to-two-week cycle. **Platforms like Koji collapse that into an afternoon** by treating SUPR-Q as a set of structured questions inside an AI-moderated interview.\n\nKoji supports six structured question types — **open_ended, scale, single_choice, multiple_choice, ranking, and yes_no** (see the [structured questions guide](/docs/structured-questions-guide)). For SUPR-Q you map the eight items to:\n\n- **Scale questions** for the six 1–5 agreement statements and the 0–10 NPS item (Koji captures the exact ground-truth value via the response widget, so scoring is deterministic — no transcription guesswork).\n- **An open_ended probe** layered on the Trust and Appearance factors. This is the part traditional surveys cannot do: when a respondent rates Trust a 2, Koji's AI interviewer automatically asks a follow-up — *\"What made the site feel less trustworthy to you?\"* — and keeps probing up to your configured depth.\n\nThe result is a study that delivers the standardized percentile score **and** the qualitative reason behind every low sub-score, with the per-question distribution charts and themed open-text findings generated automatically in a real-time report. No moderator, no manual coding, voice or text, running 24/7. That is the difference between knowing you are at the 35th percentile on Trust and knowing *exactly which three things to fix.*\n\n## Worked example: reading a SUPR-Q result\n\nImagine you run SUPR-Q on your checkout flow with 90 shoppers and get an overall result at the **68th percentile**. On its own that looks fine — above average. But break out the four factors and the story changes:\n\n- **Usability — 81st percentile.** The flow is easy to use.\n- **Appearance — 74th percentile.** It looks clean and credible.\n- **Trust & Credibility — 38th percentile.** A clear weakness.\n- **Loyalty — 55th percentile.** Middling intent to return and recommend.\n\nThe overall percentile hid the real problem. Shoppers can complete checkout easily and find it attractive, but they do not fully trust it — which on a payment page directly suppresses conversion. The action is obvious: invest in trust signals (security badges, clearer policies, social proof), not in visual polish or flow simplification. This is why reporting the sub-scores is non-negotiable, and why pairing each score with an open-ended \"why\" question — automatically, on the low factors — turns a benchmark into a roadmap.\n\n## When to re-field SUPR-Q\n\nTreat SUPR-Q as a tracking benchmark, not a one-off. Re-field it after any significant redesign, and on a fixed quarterly cadence even when nothing changes, so you can see whether competitors, expectations, or your own iterations have moved your percentile. Keep the audience, task, and wording identical between waves — the only variable you want to change is the experience itself.\n\n## Related Resources\n\n- [Structured Questions Guide](/docs/structured-questions-guide) — the six question types that power quantitative-plus-qualitative studies in Koji\n- [System Usability Scale (SUS): Complete Guide](/docs/system-usability-scale-guide) — the usability-only companion benchmark\n- [HEART Framework: Google's 5-Metric UX Model](/docs/heart-framework-ux-metrics) — for behavioral UX measurement at scale\n- [Likert Scale Questions in User Research](/docs/likert-scale-research-guide) — how to design the agreement scales SUPR-Q relies on\n- [CSAT vs NPS vs CES](/docs/csat-vs-nps-vs-ces) — choosing the right experience metric\n- [Usability Benchmarking Guide](/docs/usability-benchmarking-guide) — how to track UX metrics over time","category":"Research Methods","lastModified":"2026-06-28T03:18:25.813216+00:00","metaTitle":"SUPR-Q Guide: Score Your Website on Usability, Trust & Loyalty (2026)","metaDescription":"SUPR-Q is an 8-item questionnaire that benchmarks website quality across usability, trust, appearance, and loyalty as a percentile rank. Learn the questions, scoring, sample size, and how to run it faster with AI.","keywords":["SUPR-Q","SUPR-Q questionnaire","SUPR-Q score","website quality questionnaire","percentile rank UX","SUPR-Q vs SUS","MeasuringU SUPR-Q","UX benchmark questionnaire","trust and credibility survey","website usability metric"],"aiSummary":"SUPR-Q (Standardized User Experience Percentile Rank Questionnaire) is an 8-item survey that scores a website or app on four factors — usability, trust and credibility, appearance, and loyalty — and converts the result to a percentile rank against a normative database of 150+ sites and 5,000+ users. Use SUPR-Q to benchmark whole web experiences (where trust and aesthetics drive conversion); use SUS for usability-only task evaluation. Report the four sub-scores, not just the overall, field it after a representative task, and never modify the wording. Koji runs SUPR-Q as scale-type structured questions plus AI open-ended follow-up on low Trust/Appearance scores, producing the standardized percentile and the qualitative reason behind each score automatically.","aiPrerequisites":["Basic understanding of Likert and rating scales","Familiarity with website or app UX evaluation","A live website or prototype to measure"],"aiLearningOutcomes":["Identify the 8 SUPR-Q items and the four factors they measure","Convert raw SUPR-Q responses into a percentile rank","Choose between SUPR-Q, SUS, NPS, and CES for a given goal","Pick a defensible sample size for a SUPR-Q benchmark","Run a SUPR-Q study in Koji using scale questions plus AI follow-up"],"aiDifficulty":"intermediate","aiEstimatedTime":"11 min read"}],"pagination":{"total":1,"returned":1,"offset":0}}