📥 Model Download | 📄 Paper Link | 📄 Arxiv Paper Link |
Explore the boundaries of visual-text compression.
Release
[2025/x/x]🚀🚀🚀 We release DeepSeek-OCR, a model to investigate the role of vision encoders from an LLM-centric viewpoint.
Contents
Install
Our environment is cuda11.8+torch2.6.0.
Clone this repository and navigate to the DeepSeek-OCR folder
git clone https://github.com/deepseek-ai/DeepSeek-OCR.git
Conda
... continue reading