Filters
Search
Product type
Language
Country
Year of Collection

English (United States) street signs **in development**

More info
Common Use CasesImage recognition, Object recognition, OCR, Text detection
Dataset IDIMG_OCR_USE_STREET002
TypeImage
Unit3500 images
LanguageEnglish
CountryUnited States

English (United States) symbols **in development**

More info
Common Use CasesImage recognition, Object recognition, OCR
Dataset IDIMG_SYMBOLS_US
TypeImage
Unit1500 images
LanguageEnglish
CountryUnited States

English (United States) Ultra High-Volume labeled speech

More info
Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant
Dataset IDUSE_UHV001
TypeAudio
Unit1196 hours
LanguageEnglish
CountryUnited States

Finnish (Finland) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfin_FIN_POS
TypeText
Unit10,000 words
LanguageFinnish
CountryFinland

Finnish (Finland) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfin_FIN_PHON
TypeText
Unit86,000 words
LanguageFinnish
CountryFinland

French (Algeria) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfra_DZA_PHON
TypeText
Unit4,000 words
LanguageFrench
CountryAlgeria

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert