We just shipped GPU-powered OCR inside Docpose.cloud API
Today we launched integrated OCR for paid API users at Docpose.cloud.
If you’ve ever converted a scanned PDF and realized it’s still not searchable, you know the pain.
Now OCR runs directly inside our conversion pipeline.
What’s new:
GPU-accelerated processing
100+ languages supported
Automatic language detection
Optional manual language override
Table detection with structured layout preservation
Complex document handling (multi-column, forms, mixed layouts)
Searchable PDF output
Supported OCR output formats include:
TXT, RTF, DOCX, XLSX, XLS, CSV, PPTX, searchable PDF, EPUB, DJVU, FB2, and more supported by the pipeline.
Examples:
Scanned legal agreement → searchable PDF
Scanned contract → editable DOCX
Scanned invoice → structured XLSX
Scanned book → EPUB
Image → clean TXT or CSV
Why we built this:
Many SaaS teams using our API were handling invoices, compliance files, contracts, and legal documents. Converting format wasn’t enough, they needed structured, machine-readable content.
Instead of forcing a separate OCR workflow, we integrated it directly into the job pipeline.
Architecture notes for builders:
Files are processed via our S3-based system
No local persistence
Source files can be deleted instantly
Converted files follow 24-hour retention
Logs remain scoped for analytics and debugging
This is available now for paid API users.
If you’re building document-heavy workflows, I’d genuinely love feedback, especially around edge cases like multilingual docs or complex legal formatting.

Replies