ThinkFirst
Eval-first startup validation workbench. Turns vague founder ideas into testable assumptions, evidence thresholds, and decision-grade briefs. Built with Cursor, Claude Code, and Braintrust. Golden dataset of 8 builder archetypes across 5 graded dimensions — evals on Gemini 2.5 Flash before any engineering investment. V2 adds multi-pass extraction and Proceed / Investigate / Park / Revise decision logic.