Off-the-Shelf AI Training Datasets

Dari (Afghanistan) broadcast

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Automatic Captioning, Keyword Spotting

Dataset IDDAR_BRC001

TypeAudio

Unit49 hours

LanguageDari

CountryAfghanistan

Dari (Afghanistan) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDprs_AFG_PHON

TypeText

Unit31,000 words

LanguageDari

CountryAfghanistan

Dholuo (Kenya) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDluo_KEN_PHON

TypeText

Unit23,000 words

LanguageDholuo

CountryKenya

Dutch (Belgium) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDSpeecon Dutch from Belgium

TypeAudio

Unit47 hours

LanguageDutch

CountryBelgium

Dutch (Netherlands) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDnld_NLD_PHON

TypeText

Unit45,000 words

LanguageDutch

CountryNetherlands

Dutch (Netherlands) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDSpeecon Dutch from the Netherlands

TypeAudio

Unit68 hours

LanguageDutch

CountryNetherlands

Dari (Afghanistan) broadcast

Dataset successfully added to the Quote List

Dari (Afghanistan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Dholuo (Kenya) Pronunciation Dictionary

Dataset successfully added to the Quote List

Dutch (Belgium) scripted microphone

Dataset successfully added to the Quote List

Dutch (Netherlands) Pronunciation Dictionary

Dataset successfully added to the Quote List

Dutch (Netherlands) scripted microphone

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets