Aryan Chauhan

Aryan Chauhan

Student

About

Full-stack developer building AI infrastructure tools. Currently working on RAG optimization, prompt compression, and production ML systems. Delhi-based. Open source contributor. Always shipping.

Badges

Tastemaker
Tastemaker
Gone streaking
Gone streaking

Forums

Aryan Chauhan

1d ago

Winnow - Keep the signal. Drop the noise.

Winnow compresses RAG prompts before they hit your LLM, cutting token costs 50%+ while preserving meaning. Uses question-guided filtering + LLMLingua-2 for semantic accuracy. Key features: • FastAPI server with OpenAI-compatible proxy • Batch compression API • Question-aware filtering keeps answer-relevant tokens • Docker self-hosting, pip-installable SDK • MIT licensed
View more