Open and portable generative AI for devs and businesses

Mistral OCR - Introducing the world’s best document understanding API

by•1yr ago

Introducing Mistral OCR – a cutting-edge, lightweight Optical Character Recognition model designed for speed, accuracy, and efficiency. Whether extracting text from images or digitizing documents, it delivers state-of-the-art performance with ease.

Replies

Best

This is such a powerful tool for anyone working with text extraction!

Report

1yr ago

Looks solid! Have you tested Mistral OCR on dev-focused use cases like extracting code snippets or structured data from API docs? We’ve been tinkering with an open-source CLI tool for debugging and automation, and good OCR could be a game-changer for handling unstructured input. Would love to hear if it performs well on that!

Report

1yr ago

This looks very interesting! Looking forward to trying this soon!

Report

1yr ago

This is amazing. I have, I kid you not, been working on my own homegrown app to solve this ridiculous problem of OCR document "transforming" that leaves you with a live text but still wonky, sideways, ugly looking document. Based on the examples I'm seeing, I feel slightly better at how getting it to recognize columns and layouts wasn't just my problem, haha.

Couple clarifying questions and forgive me... I have looked but not deeply at your documentation so this may be discussed there:
1. The price is 1000 per dollar...? All your prices have a number next to the $ sign except this one. (So $0.50 for batch processing).
2. It says available via api, cloud coming soon, but then says only selective self-hosting is allowed... so could I use it via api and not self host as lay person at this stage? Or is that still to come? Forgive my ignorance on how Mistral is set up. I've mostly dealt with Anthropic .
3. Is this the correct documentation? OCR and Document Understanding | Mistral AI Large Language Models If so, It looks like a request is returned with markdown? Is there a way to change what it sends back or is markdown the only output option? And how do images get returned accurately per your examples?

Thanks again. Hats off to the crew for this!

Brett

Report

1yr ago

hi everyone

Report

1yr ago

Equip AI Interview

Whether extracting text from images or digitizing documents

Would help to know if this is the best at both. Documents can be classified into scans of offline documents, and digitally created documents. Is it the best at both?

Report

1yr ago