Off-the-shelf (OTS) Datasets

English (United States) receipts **in development**

Dataset ID:
IMG_OCR_USE_RECEIPTS
Dataset Name:
English (United States) receipts **in development**
Common Use Cases:
Image recognition, Object recognition, OCR, Text detection
Language:
English
Country:
United States
Language Code:
eng
Country Code:
USA
Product Type
Image
Detailed Product Type
OCR
Unit
4500 images
Recording Device
Camera
Recording Condition
Mixed lighting conditions
Contributors
Available upon request
Utterances
N/A
Unique Words
N/A
Sample Rate (kHz):
N/A
Channels
N/A
Data Format
jpg
Source
Appen Global
Additional Info:
  • Photos of receipts, bills or invoices, annotated with bounding boxes and transcribed text. PII redacted.
  • Data collected, QA is underway, expected to be ready end of Q1 2025. Can be prioritized upon request.
Year of Collection
2024

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert