IDEA FORGE LABS · RUBRIC-FLIP STUDY

yOptions — meta self-audit

We ran yOptions through yOptions. Same product, two goal profiles, two honest verdicts.

Confidence delta: 39 pts

THE IDEA · INPUT TO BOTH AUDITS

Idea Forge — a $29 honest-verdict consumer AI service for solo founders. The wedge: ChatGPT cheerleads, friends won't tell the truth, Reddit has no signal. Founders submit an idea and receive one cited verdict — confidence number, flip-conditions, real fetched competitor pricing pages, hard questions, and a goal-aware rubric (lifestyle vs. venture vs. replacement-income). The brand promise is honesty under fire, demonstrated through a published eval harness, visible system prompts, and citation receipts. Currently calibration mode (operator writes verdicts manually); planning Tier 2 with real LLM, then Vercel AI Gateway for multi-model resilience. Built solo by Derek Johnson at Ekow Solutions Group.

Two rubrics. Two verdicts.

Venture rubric

Large TAM. Moats. Scaling unit economics.

PURSUE w/ caveats

61%

The honesty positioning is the most genuinely differentiated feature in a crowded category, but the path to $50k+/mo MRR runs through distribution channels that haven't been built yet — the product moat is real, the funnel is the open question.

Real wedge, real moat, unsolved distribution.

Three structural strengths in Idea Forge that competitors don't match. First, goal-aware rubric is the sharpest moat. None of DimeADozen, ValidatorAI, IdeaProof, GooValid, or ChatGPT ask the founder what success means before evaluating. A stay-at-home parent submitting a $2k/mo info-product idea to DimeADozen gets a venture-rubric verdict and walks away thinking the idea is bad. Idea Forge correctly evaluates against their stated goal. This is a defensible position no one else has — not because they couldn't add it, but because the eval harness investment behind it (rubric specs, scored eval …

Read full venture rubric report →

Lifestyle rubric

$1–5k/month. Under 10 hours/week.

GOAL MISMATCH

22%

Idea Forge is venture-shaped by design — multi-rubric eval engine, prompt iteration loop, agent runner, schema migrations, ongoing prompt-drift management, customer support, refund handling. None of that fits a $1–5k/month with under 10 hours per week budget. The audit isn't grading the concept, it's grading whether the vehicle matches a lifestyle goal.

Genuinely good venture project. Structurally not a lifestyle business.

Three structural problems for the lifestyle rubric — none of which are problems under the venture rubric. First, build cost vs time budget. The architecture investment is already substantial: multi-rubric eval engine, prompt iteration to ≥80% accuracy on a 30-case eval set, pack-router branching on goalProfile, agent runner with cost-bomb defense, citation validator, ConciergeRequest schema, /labs comparison pages, /admin queues. That's 60+ hours of architecture done. Phase 2 (real LLM, Vercel AI Gateway, prompt iteration) is another 30+ hours documented in docs/external-facing-readiness.md. …

Read full lifestyle rubric report →

HOW IS THIS HONEST? · WHY THE SAME IDEA GETS TWO VERDICTS

The audit didn't change its mind. It answered a different question.

Venture rubric asks

Could this be a fundable, scaling business?

Lifestyle rubric asks

Could one person at <10h/week reach $1–5k/month?

Same facts. Inverted signal weights. The audit doesn't reconsider the evidence — it reweights it. What counts as a positive signal under one rubric can be a fatal negative under the other:

Small TAMconcern (no path to scale)fine (only ~100 customers needed at $20/mo)
No moatconcern (incumbents will copy)fine (organic discovery + niche knowledge IS the moat)
6–12 month sales cycleacceptable for B2B SaaSfatal (no revenue within time budget)
Ops linear to revenuefixable with team at scalefatal (no time budget for support)
Security-review burdenamortize over many customersfatal (same friction at any scale)

Why this matters for honesty. A single-rubric service that defaults to venture framing would tell a stay-at-home parent or a side-hustler that their idea has “no moat” or “small TAM” — technically correct, but irrelevant to their actual goal. That's being right inside the wrong question. We'd rather ask the question first.

Read the methodology → · See all 8 ideas →

Run a Reality Check on your idea · $29 →We'll ask your goal profile up front and evaluate against it specifically.