Dataset ID:
IMG_OCR_ARU002_CN
Dataset Name:
Arabic (UAE) printed text annotated OCR
Common Use Cases:
Document Processing, Document Search, Text detection
Language:
Arabic
Country:
United Arab Emirates
Language Code:
ara
Country Code:
ARE
Product Type
Image
Detailed Product Type
Document OCR
Unit
20000 images
Recording Device
Mobile phone
Recording Condition
Mixed lighting conditions
Contributors
N/A
Utterances
N/A
Unique Words
N/A
Sample Rate (kHz):
N/A
Channels
N/A
Data Format
jpg + json
Source
Appen China
Additional Info:
- Images containing text, such as slogans, advertisements, maps, store names, menus, product outer packaging, indication board. Includes bounding box annotations, 50 boxes per image, with all text annotated (Arabic, non-Arabic characters, special characters, numbers)
Year of Collection
2023
Get Started with Off-the-Shelf AI Training Datasets
Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.
Talk to an expert