/labs/idea-forge
← Back to /labs
IDEA FORGE LABS · RUBRIC-FLIP STUDY

yOptions — meta self-audit

We ran yOptions through yOptions. Same product, two goal profiles, two honest verdicts.

Confidence delta: 39 pts
THE IDEA · INPUT TO BOTH AUDITS

Idea Forge — a $29 honest-verdict consumer AI service for solo founders. The wedge: ChatGPT cheerleads, friends won't tell the truth, Reddit has no signal. Founders submit an idea and receive one cited verdict — confidence number, flip-conditions, real fetched competitor pricing pages, hard questions, and a goal-aware rubric (lifestyle vs. venture vs. replacement-income). The brand promise is honesty under fire, demonstrated through a published eval harness, visible system prompts, and citation receipts. Currently calibration mode (operator writes verdicts manually); planning Tier 2 with real LLM, then Vercel AI Gateway for multi-model resilience. Built solo by Derek Johnson at Ekow Solutions Group.

Two rubrics. Two verdicts.

Venture rubric
Large TAM. Moats. Scaling unit economics.
PURSUE w/ caveats
61%
The honesty positioning is the most genuinely differentiated feature in a crowded category, but the path to $50k+/mo MRR runs through distribution channels that haven't been built yet — the product moat is real, the funnel is the open question.
Real wedge, real moat, unsolved distribution.
Three structural strengths in Idea Forge that competitors don't match. First, goal-aware rubric is the sharpest moat. None of DimeADozen, ValidatorAI, IdeaProof, GooValid, or ChatGPT ask the founder what success means before evaluating. A stay-at-home parent submitting a $2k/mo info-product idea to DimeADozen gets a venture-rubric verdict and walks away thinking the idea is bad. Idea Forge correctly evaluates against their stated goal. This is a defensible position no one else has — not because they couldn't add it, but because the eval harness investment behind it (rubric specs, scored eval …
Read full venture rubric report →
Lifestyle rubric
$1–5k/month. Under 10 hours/week.
GOAL MISMATCH
22%
Idea Forge is venture-shaped by design — multi-rubric eval engine, prompt iteration loop, agent runner, schema migrations, ongoing prompt-drift management, customer support, refund handling. None of that fits a $1–5k/month with under 10 hours per week budget. The audit isn't grading the concept, it's grading whether the vehicle matches a lifestyle goal.
Genuinely good venture project. Structurally not a lifestyle business.
Three structural problems for the lifestyle rubric — none of which are problems under the venture rubric. First, build cost vs time budget. The architecture investment is already substantial: multi-rubric eval engine, prompt iteration to ≥80% accuracy on a 30-case eval set, pack-router branching on goalProfile, agent runner with cost-bomb defense, citation validator, ConciergeRequest schema, /labs comparison pages, /admin queues. That's 60+ hours of architecture done. Phase 2 (real LLM, Vercel AI Gateway, prompt iteration) is another 30+ hours documented in docs/external-facing-readiness.md. …
Read full lifestyle rubric report →
HOW IS THIS HONEST? · WHY THE SAME IDEA GETS TWO VERDICTS

The audit didn't change its mind. It answered a different question.

Venture rubric asks

Could this be a fundable, scaling business?

Lifestyle rubric asks

Could one person at <10h/week reach $1–5k/month?

Same facts. Inverted signal weights. The audit doesn't reconsider the evidence — it reweights it. What counts as a positive signal under one rubric can be a fatal negative under the other:

  • Small TAMconcern (no path to scale)fine (only ~100 customers needed at $20/mo)
  • No moatconcern (incumbents will copy)fine (organic discovery + niche knowledge IS the moat)
  • 6–12 month sales cycleacceptable for B2B SaaSfatal (no revenue within time budget)
  • Ops linear to revenuefixable with team at scalefatal (no time budget for support)
  • Security-review burdenamortize over many customersfatal (same friction at any scale)

Why this matters for honesty. A single-rubric service that defaults to venture framing would tell a stay-at-home parent or a side-hustler that their idea has “no moat” or “small TAM” — technically correct, but irrelevant to their actual goal. That's being right inside the wrong question. We'd rather ask the question first.

Read the methodology → · See all 8 ideas →

Run a Reality Check on your idea · $29 We'll ask your goal profile up front and evaluate against it specifically.