Off-the-Shelf AI Training Datasets

Czech (Czech Republic) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDces_CZE_PHON

TypeText

Unit50,000 words

LanguageCzech

CountryCzech Republic

Danish (Denmark) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDdan_DNK_POS

TypeText

Unit100,000 words

LanguageDanish

CountryDenmark

Danish (Denmark) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDdan_DNK_PHON

TypeText

Unit107,000 words

LanguageDanish

CountryDenmark

Dari (Afghanistan) broadcast

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Automatic Captioning, Keyword Spotting

Dataset IDDAR_BRC001

TypeAudio

Unit49 hours

LanguageDari

CountryAfghanistan

Dari (Afghanistan) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDDAR_ASR001

TypeAudio

Unit40 hours

LanguageDari

CountryAfghanistan

Dari (Afghanistan) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDprs_AFG_PHON

TypeText

Unit31,000 words

LanguageDari

CountryAfghanistan

Czech (Czech Republic) Pronunciation Dictionary

Dataset successfully added to the Quote List

Danish (Denmark) Part of Speech Dictionary

Dataset successfully added to the Quote List

Danish (Denmark) Pronunciation Dictionary

Dataset successfully added to the Quote List

Dari (Afghanistan) broadcast

Dataset successfully added to the Quote List

Dari (Afghanistan) conversational telephony

Dataset successfully added to the Quote List

Dari (Afghanistan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets