Filters
Search
Product type
Language
Country
Year of Collection

Action videos

More info
Common Use CasesMovement detection, Human Body Movement Recognition, Action Classification
Dataset IDVID_ACTION_US
TypeVideo
Unit281 videos
LanguageN/A
CountryUnited States

Arabic (UAE) printed text annotated OCR

More info
Common Use CasesDocument Processing, Document Search, Text detection
Dataset IDIMG_OCR_ARU002_CN
TypeImage
Unit20000 images
LanguageArabic
CountryUnited Arab Emirates

Business-to-business printed text document OCR

More info
Common Use CasesDocument Processing, Document Search, Text detection
Dataset IDIMG_OCR_B2B
TypeImage
Unit5,838 documents
LanguageN/A
CountryN/A

Dutch (Netherlands & Belgium) scripted in-car

More info
Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment
Dataset IDDutch and Flemish SpeechDat-Car
TypeAudio
Unit27 hours
LanguageDutch
CountryNetherland - Belgium

English (United States) Ultra High-Volume labeled speech

More info
Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant
Dataset IDUSE_UHV001
TypeAudio
Unit1196 hours
LanguageEnglish
CountryUnited States

Finnish (Finland) printed text OCR

More info
Common Use CasesDocument Processing, Document Search, Text detection
Dataset IDIMG_OCR_FIN_CN
TypeImage
Unit7293 images
LanguageFinnish
CountryFinland

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert