Off-the-Shelf AI Training Datasets

Dari (Afghanistan) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDprs_AFG_PHON

TypeText

Unit31,000 words

LanguageDari

CountryAfghanistan

Dholuo (Kenya) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDluo_KEN_PHON

TypeText

Unit23,000 words

LanguageDholuo

CountryKenya

Dongbei dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDDONGBEI_ASR001_CN

TypeAudio

Unit84.6 hours

LanguageDongbei dialect

CountryChina

Dongbei dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDDONGBEI_ASR002_CN

TypeAudio

Unit75.2 hours

LanguageDongbei dialect

CountryChina

Dutch (Belgium) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDSpeecon Dutch from Belgium

TypeAudio

Unit47 hours

LanguageDutch

CountryBelgium

Dutch (Belgium) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDFlemish SpeechDat(II) FDB-1000 (FIXED1FL)

TypeAudio

Unit80 hours

LanguageDutch

CountryBelgium

Dari (Afghanistan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Dholuo (Kenya) Pronunciation Dictionary

Dataset successfully added to the Quote List

Dongbei dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Dongbei dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Dutch (Belgium) scripted microphone

Dataset successfully added to the Quote List

Dutch (Belgium) scripted telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets