Ideas in motion

Lab

A working matrix of 12product ideas I've scoped - each one a hypothetical build that maps a real PM skill to a real technical challenge. Most will stay ideas. A few will become projects. All of them are me thinking in public about what's worth making.

By domain

AI Infra

2 ideas

AI Product

1 idea

AI Tooling

1 idea

Career

1 idea

Data Product

1 idea

Developer Tooling

1 idea

Finance Tooling

1 idea

HR Tech

1 idea

Personal Tooling

1 idea

Productivity

1 idea

Vertical AI

1 idea

The matrix - sorted by difficulty

Aviation Ops Anomaly FeedExpert

Aviation ops teams drown in green-dashboard fatigue. A feed that only surfaces anomalies (delay drivers, crew-conflict clusters, gate-change cascades) with root-cause breadcrumbs.

Vertical AI · Domain modelling, alert-fatigue management

Eval Harness as a ServiceHard

Small AI-PM teams keep rebuilding the same golden-set → re-score → diff pipeline. Offer it as a hosted tool with a drop-in SDK, shadow-traffic mode, and per-prompt-version accuracy deltas.

AI Infra · Developer-tool PM, eval design

PRD → Eval Set ConverterHard

PMs write PRDs; engineers ship without measurable acceptance. Parse a PRD's success criteria into a structured eval set (golden cases, scoring rubric, pass/fail gate) before a single line of code.

AI Tooling · Structured thinking, acceptance criteria

Open-Source Contribution ScoutHard

First-time contributors can't find tractable issues. Given a GitHub profile + stack, surface good-first-issues across repos filtered by real complexity (not just the label).

Developer Tooling · Funnel design, trust metrics

Cost-Per-Answer CalculatorMedium

Every AI team underestimates unit economics until an invoice lands. A live calculator that takes model, tokens-in, tokens-out, cache hit rate, and batch size → returns $/request + $/user/month + envelope check.

AI Infra · Unit-economics thinking, pricing

Churn-Risk ExplainerMedium

Churn models output a score; CS teams need a *reason*. Produce a one-sentence, feature-level explanation per at-risk user, ranked by recoverability.

Data Product · Data-driven retention, narrative from numbers

Plant-Health Citizen DatasetMedium

Aarkid's golden set is 50 photos. Crowdsource a labelled dataset (10k+ photos across failure modes) with contributor attribution and licence clarity.

AI Product · Marketplace liquidity, contributor incentives

Meeting → Decision LogMedium

Transcripts are noise. Extract *decisions made*, *blockers raised*, and *owners assigned* into a searchable log that survives the meeting.

Productivity · Information architecture, decision hygiene

Interview-Loop Feedback AggregatorEasy

Hiring loops collect 4 independent writeups, then drown in a debrief Slack thread. Aggregate them into a structured rubric summary + dissenting-opinion highlight before the debrief.

HR Tech · Hiring process design, signal vs noise

Reading-Log → Idea GraphMedium

Highlights sit in Readwise; ideas sit in notes; neither talk. Build a graph view where highlights cluster by concept and surface adjacency between books.

Personal Tooling · Information design, spaced repetition

Micro-SaaS Unit-Economics SimEasy

Solo founders need a back-of-envelope that accepts MRR curve, CAC, churn, infra cost/user, and returns month-by-month cash + break-even date without a spreadsheet.

Finance Tooling · Business modelling, scenario planning

AI-PM Interview Prep KitMedium

AI-PM interview loops are new enough that nobody has a clean rubric. Ship a structured practice kit: problem framings, eval-design exercises, cost-envelope drills, mock transcripts with feedback.

Career · Pedagogy, interview-loop design

Source data: ideas.json. Want to build one of these together? Let's talk.