Deep analysis of 793 Y Combinator companies across 5 batches (W25-W26). NLP clustering, founder archetypes, competitive overlap, AI vs deep-tech breakdown, partner preferences, and industry trends. Data from 1,625 founder bios.
Hey PH 👋
I kept hearing "YC only funds AI wrappers now" and wanted to check if it's actually true. So I scraped every company from the last 5 YC batches.
793 companies. 1,625 founder bios. Every tag, industry, and partner assignment.
Some findings that surprised me:
🔸 Only 15% are thin wrappers - and it's declining
🔸 Deep tech jumped to 29% of the latest batch
🔸 SF startups hire LESS than remote ones
🔸 YC funds near-identical competitors in the same batch - on purpose
🔸 Each partner has a distinct type (Diana Hu = infra, Garry Tan = contrarian bets)
Built with Python (scraping + NLP analysis) and Chart.js. Single HTML file, no backend.
This is a side project - not a product, not a startup. Just a weekend rabbit hole that got out of hand.
Would love feedback, what patterns do you find most interesting?
— Krishna
Replies
What YC Is Really Betting On?