Mistral OCR - Introducing the world’s best document understanding API
by•
Introducing Mistral OCR – a cutting-edge, lightweight Optical Character Recognition model designed for speed, accuracy, and efficiency. Whether extracting text from images or digitizing documents, it delivers state-of-the-art performance with ease.
Replies
Best
This is such a powerful tool for anyone working with text extraction!
Report
Looks solid! Have you tested Mistral OCR on dev-focused use cases like extracting code snippets or structured data from API docs? We’ve been tinkering with an open-source CLI tool for debugging and automation, and good OCR could be a game-changer for handling unstructured input. Would love to hear if it performs well on that!
Report
This looks very interesting! Looking forward to trying this soon!
Report
This is amazing. I have, I kid you not, been working on my own homegrown app to solve this ridiculous problem of OCR document "transforming" that leaves you with a live text but still wonky, sideways, ugly looking document. Based on the examples I'm seeing, I feel slightly better at how getting it to recognize columns and layouts wasn't just my problem, haha.
Couple clarifying questions and forgive me... I have looked but not deeply at your documentation so this may be discussed there: 1. The price is 1000 per dollar...? All your prices have a number next to the $ sign except this one. (So $0.50 for batch processing). 2. It says available via api, cloud coming soon, but then says only selective self-hosting is allowed... so could I use it via api and not self host as lay person at this stage? Or is that still to come? Forgive my ignorance on how Mistral is set up. I've mostly dealt with Anthropic . 3. Is this the correct documentation? OCR and Document Understanding | Mistral AI Large Language Models If so, It looks like a request is returned with markdown? Is there a way to change what it sends back or is markdown the only output option? And how do images get returned accurately per your examples?
Whether extracting text from images or digitizing documents
Would help to know if this is the best at both. Documents can be classified into scans of offline documents, and digitally created documents. Is it the best at both?
Replies
This is such a powerful tool for anyone working with text extraction!
Looks solid! Have you tested Mistral OCR on dev-focused use cases like extracting code snippets or structured data from API docs? We’ve been tinkering with an open-source CLI tool for debugging and automation, and good OCR could be a game-changer for handling unstructured input. Would love to hear if it performs well on that!
This looks very interesting! Looking forward to trying this soon!
This is amazing. I have, I kid you not, been working on my own homegrown app to solve this ridiculous problem of OCR document "transforming" that leaves you with a live text but still wonky, sideways, ugly looking document. Based on the examples I'm seeing, I feel slightly better at how getting it to recognize columns and layouts wasn't just my problem, haha.
Couple clarifying questions and forgive me... I have looked but not deeply at your documentation so this may be discussed there:
1. The price is 1000 per dollar...? All your prices have a number next to the $ sign except this one. (So $0.50 for batch processing).
2. It says available via api, cloud coming soon, but then says only selective self-hosting is allowed... so could I use it via api and not self host as lay person at this stage? Or is that still to come? Forgive my ignorance on how Mistral is set up. I've mostly dealt with Anthropic .
3. Is this the correct documentation? OCR and Document Understanding | Mistral AI Large Language Models If so, It looks like a request is returned with markdown? Is there a way to change what it sends back or is markdown the only output option? And how do images get returned accurately per your examples?
Thanks again. Hats off to the crew for this!
Brett
hi everyone
Equip AI Interview
Would help to know if this is the best at both. Documents can be classified into scans of offline documents, and digitally created documents. Is it the best at both?
Portia AI
Python Package Index
I really like it
It's good~ We are choosing some document understanding service to support our knowledge base tool R&D.
It's not determined yet. But check our official website: https://www.remio.ai/ if you interested in