Off-the-Shelf AI Training Datasets

Somali (Somalia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDsom_SOM_PHON

TypeText

Unit76,000 words

LanguageSomali

CountrySomalia

Sorani (Iraq) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkur_IRQ_PHON

TypeText

Unit26,000 words

LanguageSorani

CountryIraq

Sorani (Kurdish) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSOR_ASR001

TypeAudio

Unit5 hours

LanguageCentral Kurdish (Iran)

CountryIran

Spanish (Argentina) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDspa_ARG_PHON

TypeText

Unit15,000 words

LanguageSpanish

CountryArgentina

Spanish (Chile) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDspa_CHL_PHON

TypeText

Unit15,000 words

LanguageSpanish

CountryChile

Spanish (Colombia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDspa_COL_PHON

TypeText

Unit15,000 words

LanguageSpanish

CountryColombia

Somali (Somalia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Sorani (Iraq) Pronunciation Dictionary

Dataset successfully added to the Quote List

Sorani (Kurdish) conversational telephony

Dataset successfully added to the Quote List

Spanish (Argentina) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Chile) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Colombia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets