Off-the-Shelf AI Training Datasets

Czech (Czech Republic) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDCzech SpeechDat(E) Dataset

TypeAudio

Unit93 hours

LanguageCzech

CountryCzech Republic

Danish (Denmark) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDdan_DNK_POS

TypeText

Unit100,000 words

LanguageDanish

CountryDenmark

Danish (Denmark) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDdan_DNK_PHON

TypeText

Unit107,000 words

LanguageDanish

CountryDenmark

Danish (Denmark) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDSpeecon Danish

TypeAudio

Unit53 hours

LanguageDanish

CountryDenmark

Dari (Afghanistan) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDprs_AFG_PHON

TypeText

Unit31,000 words

LanguageDari

CountryAfghanistan

Dholuo (Kenya) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDluo_KEN_PHON

TypeText

Unit23,000 words

LanguageDholuo

CountryKenya

Czech (Czech Republic) scripted telephony

Dataset successfully added to the Quote List

Danish (Denmark) Part of Speech Dictionary

Dataset successfully added to the Quote List

Danish (Denmark) Pronunciation Dictionary

Dataset successfully added to the Quote List

Danish (Denmark) scripted microphone

Dataset successfully added to the Quote List

Dari (Afghanistan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Dholuo (Kenya) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets