We just shipped GPU-powered OCR inside Docpose.cloud API

Today we launched integrated OCR for paid API users at Docpose.cloud.

If you’ve ever converted a scanned PDF and realized it’s still not searchable, you know the pain.

Now OCR runs directly inside our conversion pipeline.

What’s new:

GPU-accelerated processing
100+ languages supported
Automatic language detection
Optional manual language override
Table detection with structured layout preservation
Complex document handling (multi-column, forms, mixed layouts)
Searchable PDF output

Supported OCR output formats include:

TXT, RTF, DOCX, XLSX, XLS, CSV, PPTX, searchable PDF, EPUB, DJVU, FB2, and more supported by the pipeline.

Examples:

Scanned legal agreement → searchable PDF
Scanned contract → editable DOCX
Scanned invoice → structured XLSX
Scanned book → EPUB
Image → clean TXT or CSV

Why we built this:

Many SaaS teams using our API were handling invoices, compliance files, contracts, and legal documents. Converting format wasn’t enough, they needed structured, machine-readable content.

Instead of forcing a separate OCR workflow, we integrated it directly into the job pipeline.

Architecture notes for builders:

Files are processed via our S3-based system
No local persistence
Source files can be deleted instantly
Converted files follow 24-hour retention
Logs remain scoped for analytics and debugging

This is available now for paid API users.

If you’re building document-heavy workflows, I’d genuinely love feedback, especially around edge cases like multilingual docs or complex legal formatting.

22 views

We just shipped GPU-powered OCR inside Docpose.cloud API

Replies