Why This Matters
Unlimited OCR introduces a groundbreaking one-shot long-horizon parsing model that significantly enhances the ability to process complex, multi-page documents and images efficiently. Its deployment on popular frameworks like Huggingface and compatibility with NVIDIA GPUs makes it accessible for a wide range of applications, promising to advance document analysis and automation in the tech industry.
Key Takeaways
- Supports multi-page and PDF parsing with high accuracy and efficiency.
- Built on transformer architecture, optimized for NVIDIA GPUs and popular AI frameworks.
- Available on ModelScope, enabling easy integration into existing workflows.
Unlimited OCR Works
Welcome the Era of One-shot Long-horizon Parsing.
Release
[2026/06/23] π Our paper is now available on arXiv.
[2026/06/23] π€ Thanks to the ModelScope community for their support. Our model is now available at ModelScope.
[2026/06/22] π We present Unlimited-OCR, aiming to push Deepseek-OCR one step further.
Inference
Transformers
Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.12.3 + CUDA12.9οΌ
torch==2.10.0 torchvision==0.25.0 transformers==4.57.1 Pillow==12.1.1 matplotlib==3.10.8 einops==0.8.2 addict==2.4.0 easydict==1.13 pymupdf==1.27.2.2 psutil==7.2.2
... continue reading