Forums
OpenMark - Benchmark AI models for YOUR use case
Test ~100 AI models against YOUR specific prompts. Get deterministic scores, real API costs, and stability metrics.
Built this after discovering the "best" model for my RAG pipeline was a model that performed better AND cost 10x less.
No LLM-as-judge. No voting. Just reproducible results for your actual use case.
⢠18 scoring modes
⢠Real cost/efficiency calculations from API pricing
⢠Vision & document support
⢠Beginner-friendly yet capable of deep, complex use.
Free tier available
