Case Study · E-Commerce · Product Discovery & Trust
A 3-phase product exercise for Amazon - from identifying the review trust crisis to designing a full Review Integrity System (ARIS) with a PRD, wireframes, RICE prioritisation, and rollout plan.
Phase 1 · Step 01
The starting point was a product I use often and have both appreciation for and small frustrations with. Amazon. Below is the friction map across its 4 core flows.
Phase 2 · Step 02
3 out of 5 interviewees said some version of "I don't fully trust Amazon reviews." What surprised me was the emotion behind it - not just frustration, but feeling cheated.
I wasted almost 25 minutes comparing identical mobile holders. Every listing had 4+ stars and the reviews looked copy-pasted or copied from ChatGPT.
The product pictures looked amazing. The real thing that arrived looked cheap. I honestly felt cheated.
For my kid's lunchbox, I was genuinely stressed. I don't want to rely on random reviews when it's for my child.
Most users described the same pattern: search, open 5-10 tabs of similar listings, scroll ratings, skim reviews, zoom user photos, then cross-check on YouTube.
By the time I reach a decision, I am just mentally done. I click and hope for the best.
Phase 2 · Step 03
Phase 2 · Step 04
Users are struggling to confidently evaluate product quality during discovery because the current review experience feels noisy, repetitive, and hard to trust. This leads to longer decision times, mental fatigue, and drop-offs before purchase - especially in high-anxiety categories like fashion, kids' products, and home/kitchen.
| Problem | Population | Intensity | Frequency | Total |
|---|---|---|---|---|
| P1: Low trust in reviews during discovery | 5 - cuts across almost all shoppers | 5 - directly hits confidence and mental effort | 5 - shows up in most medium+ consideration purchases | 15 |
| P2: Confusing deals and sponsored results | 4 - strongest for price-sensitive users | 4 - leads to regret and mistrust | 4 - spikes around sales, not every purchase | 12 |
| P3: Inconsistent repeat orders | 3 - mostly repeat buyers in certain categories | 3 - annoying but usually fixable via returns | 3 - occasionally, not every reorder | 9 |
Phase 3 · Step 05
11 solutions were brainstormed across four categories, then scored using RICE to identify the top 3 for the MVP.
| Solution | Reach | Impact | Confidence | Effort | Score |
|---|---|---|---|---|---|
| Review Integrity Check + Pattern Detection | 3 | 3 | 3 | 3 | 9 |
| Confidence Meter (Trust Score) | 3 | 3 | 2 | 2 | 9 |
| Customizable Quick Compare | 3 | 3 | 3 | 3 | 9 |
| Long-term Use Review Badge | 2 | 2 | 2 | 1 | 8 |
| Theme Keyword Clusters | 3 | 2 | 2 | 2 | 6 |
| "Why different" listing highlight | 2 | 2 | 2 | 2 | 4 |
| Category Trust Signals | 2 | 2 | 1 | 2 | 4 |
Ideas not scored but kept as moonshots: Social proof layer, Cross-platform sentiment summary (YouTube/Instagram), AR try-out expansion, Audio reviews.
Phase 3 · Step 06
The three winning solutions are unified under a single system: ARIS. Together they address review noise, trust signalling, and comparison fatigue.
Search → Product Page → Read dozens of reviews → Open 5-10 tabs → Cross-check on YouTube → Maybe buy
Search → Product Page → Confidence Meter → Review Quality Snapshot → Quick Compare → Buy with confidence
Displayed as a percentage bar below the product price. Example: "Overall Trust Confidence: 82%". A simple label accompanies it: "Trusted Seller" or "New Seller".
QC auto-selects the top 3 most similar listings using: image similarity, product attributes matching, title/model matching, price range proximity, and category signals.
Users can then: remove any column, add a product manually, or add from the Related Products / Customers Also Viewed sections via the Quick Compare icon on each card.
| Row | Description |
|---|---|
| Product Image | With 'x' remove button per column |
| Product Name | Full name as per listing |
| Seller | Seller name, clickable link to seller page |
| Price | With any discounts |
| Rating | Stars + number of ratings |
| Confidence Meter | Trust score bar + label |
| Return Rate | Recent return rate for this product |
| Key Differences | Side-by-side seller/product comparison and differences |
| Warning Badge | e.g. "Higher number of potential fake reviews detected" |
| Shipping Details | Expected delivery date |
| Actions | Add to Cart + Buy Now buttons |
Horizontal scroll appears if more than 3 items are compared.
Phase 3 · Step 07
| Type | Metric | Definition | Why it matters |
|---|---|---|---|
| Key | Product Page Conversion Rate | % of users who buy after viewing product page | Direct measure of reduced decision fatigue |
| Secondary | Time to Purchase | Average time between first view and purchase | Should decrease with summary and trust signals |
| Secondary | Review Scroll Depth | How deep users scroll into reviews | Should reduce significantly with AI summary |
| Secondary | Compare Tab Open Count | Number of similar product tabs opened | Less confusion = fewer tabs needed |
| Guardrail | Return Rate | % of returns post purchase | Ensures trust improvements are not misleading users |
| Guardrail | Customer Complaints | Category-specific issue tickets | Monitors user sentiment after changes |
| Assumption | Failure Mode | Impact | Likelihood | Mitigation | Rollback |
|---|---|---|---|---|---|
| AI correctly identifies low-credibility reviews | High-credibility reviews flagged OR fake reviews missed | High | Medium | Conservative scoring; manual review of edge cases; supervised fine-tuning; user challenge option (high-friction) | Temporarily relax pattern detection; reprocess flagged reviews |
| AI Quick Summary is perceived as neutral and helpful | Summary feels biased, oversimplified, or incorrect - users lose trust | Medium | Low-Medium | Transparency disclaimer; "Was this summary helpful?" feedback | Pause AI summary; fallback to review highlights |
| AI selects truly comparable products for Quick Compare | Quick Compare shows mismatched or irrelevant items | High | Medium | Multi-signal similarity (image + attributes + title matching); internal validation set | Lower AI similarity threshold; fallback to Amazon's existing matching logic |
Phase 3 · Step 08