๐ VLMs OCR Playground
Compare and use multiple state-of-the-art Vision-Language OCR models:
- DeepSeek-OCR-2: Document to markdown with layout detection
- GLM-OCR: Specialized recognition for text, formulas, and tables
- PaddleOCR-VL-1.5: Full-page document parsing with layout detection
Select a model below to get started!
โ ๏ธ Current Deployment Status:
- ๐ DeepSeek-OCR-2: Requires GPU - temporarily unavailable on this CPU-only deployment
- ๐ฎ GLM-OCR: Available โ
- ๐ PaddleOCR-VL-1.5: Available โ (requires API key)
Select OCR Model
Full-page document parsing with layout detection and element-level recognition.
Task
Example Images