Filters
Search
Product type
Language
Country
Year of Collection

Arabic NER news text

More info
Common Use CasesNER, Content Classification, Search Engines
Dataset IDARB_NER001
TypeText
Unit20,774 sentences
LanguageArabic (Standard)
CountryN/A

Cantonese (China) business dialogues

More info
Common Use CasesASR, Conversational AI, Speech Analytics, Business Intelligence
Dataset IDYYDH_ASR001_CN
TypeAudio
Unit98.35 hours
LanguageCantonese
CountryChina

English (United States) product labels **in development**

More info
Common Use CasesImage recognition, Object recognition, Retail
Dataset IDIMG_OCR_USE_ProductLabels
TypeImage
Unit60000 images
LanguageEnglish
CountryUnited States

English NER news text

More info
Common Use CasesNER, Content Classification, Search Engines
Dataset IDENG_NER001
TypeText
Unit22,768 sentences
LanguageEnglish
CountryN/A

Farsi/Persian NER news text

More info
Common Use CasesNER, Content Classification, Search Engines
Dataset IDFAR_NER001
TypeText
Unit19,584 sentences
LanguageIranian Persian
CountryIran

Garments image and video collection **in development**

More info
Common Use CasesImage recognition, Object recognition, Retail, e-commerce
Dataset IDIMG_VID_GARMENTS_US
TypeVideo
Unit300 sessions
LanguageN/A
CountryUnited States

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert