Ankit Sharma

Mistral OCR - Introducing the world’s best document understanding API

by
Introducing Mistral OCR – a cutting-edge, lightweight Optical Character Recognition model designed for speed, accuracy, and efficiency. Whether extracting text from images or digitizing documents, it delivers state-of-the-art performance with ease.

Add a comment

Replies

Best
Fiona Bao

This is such a powerful tool for anyone working with text extraction!

Egor Martynov

Looks solid! Have you tested Mistral OCR on dev-focused use cases like extracting code snippets or structured data from API docs? We’ve been tinkering with an open-source CLI tool for debugging and automation, and good OCR could be a game-changer for handling unstructured input. Would love to hear if it performs well on that!

Willem van den Eijkel

This looks very interesting! Looking forward to trying this soon!

Brett Hibbler

This is amazing. I have, I kid you not, been working on my own homegrown app to solve this ridiculous problem of OCR document "transforming" that leaves you with a live text but still wonky, sideways, ugly looking document. Based on the examples I'm seeing, I feel slightly better at how getting it to recognize columns and layouts wasn't just my problem, haha.

Couple clarifying questions and forgive me... I have looked but not deeply at your documentation so this may be discussed there:
1. The price is 1000 per dollar...? All your prices have a number next to the $ sign except this one. (So $0.50 for batch processing).
2. It says available via api, cloud coming soon, but then says only selective self-hosting is allowed... so could I use it via api and not self host as lay person at this stage? Or is that still to come? Forgive my ignorance on how Mistral is set up. I've mostly dealt with Anthropic .
3. Is this the correct documentation? OCR and Document Understanding | Mistral AI Large Language Models If so, It looks like a request is returned with markdown? Is there a way to change what it sends back or is markdown the only output option? And how do images get returned accurately per your examples?

Thanks again. Hats off to the crew for this!

Brett

kaylani dulce

hi everyone

Jayanth Neelakanta

Whether extracting text from images or digitizing documents

Would help to know if this is the best at both. Documents can be classified into scans of offline documents, and digitally created documents. Is it the best at both?

Saleh
you guys know what SPEED means
Mounir Mouawad
We can't wait to give it a go over at Portia AI! Well done Mistral team!
Bayram Eker

I really like it

Bayes T

It's good~ We are choosing some document understanding service to support our knowledge base tool R&D.


It's not determined yet. But check our official website: https://www.remio.ai/ if you interested in