Deep analysis of 793 Y Combinator companies across 5 batches (W25-W26). NLP clustering, founder archetypes, competitive overlap, AI vs deep-tech breakdown, partner preferences, and industry trends. Data from 1,625 founder bios.
Hey PH đ
I kept hearing "YC only funds AI wrappers now" and wanted to check if it's actually true. So I scraped every company from the last 5 YC batches.
793 companies. 1,625 founder bios. Every tag, industry, and partner assignment.
Some findings that surprised me:
đ¸ Only 15% are thin wrappers - and it's declining
đ¸ Deep tech jumped to 29% of the latest batch
đ¸ SF startups hire LESS than remote ones
đ¸ YC funds near-identical competitors in the same batch - on purpose
đ¸ Each partner has a distinct type (Diana Hu = infra, Garry Tan = contrarian bets)
Built with Python (scraping + NLP analysis) and Chart.js. Single HTML file, no backend.
This is a side project - not a product, not a startup. Just a weekend rabbit hole that got out of hand.
Would love feedback, what patterns do you find most interesting?
â Krishna
Replies
What YC Is Really Betting On?