Filters
Search
Product type
See more See less
Language
Country
Year of Collection

Somali (Somalia) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDsom_SOM_PHON
TypeText
Unit76,000 words
LanguageSomali
CountrySomalia

Sorani (Iraq) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDkur_IRQ_PHON
TypeText
Unit26,000 words
LanguageSorani
CountryIraq

Sorani (Kurdish) conversational telephony

More info
Common Use CasesASR, Conversational AI, Speech Analytics
Dataset IDSOR_ASR001
TypeAudio
Unit5 hours
LanguageCentral Kurdish (Iran)
CountryIran

Spanish (Argentina) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDspa_ARG_PHON
TypeText
Unit15,000 words
LanguageSpanish
CountryArgentina

Spanish (Chile) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDspa_CHL_PHON
TypeText
Unit15,000 words
LanguageSpanish
CountryChile

Spanish (Colombia) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDspa_COL_PHON
TypeText
Unit15,000 words
LanguageSpanish
CountryColombia

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert