Multimodal OCR
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
All in One Banana for you!
Detect AI-generated content in images, videos, and audio
Detect and annotate objects in images
Real-time video captioning powered by FastVLM
Easy converting PDF and Office docs into Markdown and JSON
A multilingual PDF translator that preserves document layout
Generate a podcast to discuss the topic of your choice!
Convert PDFs to text using OCR
Generate a preview image from a PDF file
Split & merge PDFs in-memory (fast and private!)
Extract text and metadata from PDF files
Generate spokenβstyle scripts from documents
Convert PDF to text using OCR
Traduit n'import quel doc dans n'importe quelle langue
Convert text/image/audio/video from src language to English
Clarity AI Upscaler Reproduction