Off-the-shelf (OTS) Datasets

Business-to-business printed text document OCR

Dataset ID:
IMG_OCR_B2B
Dataset Name:
Business-to-business printed text document OCR
Common Use Cases:
Document Processing, Document Search, Text detection
Language:
N/A
Country:
N/A
Language Code:
N/A
Country Code:
N/A
Product Type
Image
Detailed Product Type
Document OCR
Unit
5,838 documents
Recording Device
Camera, scan
Recording Condition
Mixed lighting conditions
Contributors
N/A
Utterances
N/A
Unique Words
N/A
Sample Rate (kHz):
N/A
Channels
N/A
Data Format
png
Source
Appen Global
Additional Info:
  • Scans and photographs of business-to-business documents containing printed text. 38% Premium Quality images in 10 languages, 25 countries, including Purchase Order, Payment Advice or Remittance Advice, Order Confirmation and Delivery note. 64% Standard Quality images in various challenging conditions in 11 languages, 34 countries, in a wider range of categories including Complaints or Return, Delivery advice, Delivery note, Dunning, Goods receipt, Invoice, Offer, Order confirmation, Pay slip, Payment Advice or Remittance Advice, Purchase Order, Receipt, and Supplier load
Year of Collection
2021

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert