A large-scale synthetic Arabic OCR dataset comprising 843,622 book-style document images across 10 fonts, designed to advance VLM for Arabic Texts
Robotics and Interne-of-Things
riotu-lab
AI & ML interests
None yet
Recent Activity
updated a dataset about 6 hours ago
riotu-lab/SARD upvoted a paper 2 months ago
MURAD: A Large-Scale Multi-Domain Unified Reverse Arabic Dictionary Dataset upvoted a paper 2 months ago
SARD: A Large-Scale Synthetic Arabic OCR Dataset for Book-Style Text
RecognitionOrganizations
None yet