Off-the-Shelf AI Training Datasets

Basque (Spain) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDeus_ESP_PHON

TypeText

Unit10,000 words

LanguageBasque

CountrySpain

Bengali (Bangladesh) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDBEN_ASR001

TypeAudio

Unit47 hours

LanguageBengali

CountryBangladesh

Bengali (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDben_IND_PHON

TypeText

Unit29,000 words

LanguageBengali

CountryIndia

Bulgarian (Bulgaria) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDBUL_ASR001

TypeAudio

Unit38 hours

LanguageBulgarian

CountryBulgaria

Bulgarian (Bulgaria) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDbul_BGR_PHON

TypeText

Unit55,000 words

LanguageBulgarian

CountryBulgaria

Cantonese (China) business dialogues

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, Business Intelligence

Dataset IDYYDH_ASR001_CN

TypeAudio

Unit98.35 hours

LanguageCantonese

CountryChina

Basque (Spain) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bengali (Bangladesh) conversational telephony

Dataset successfully added to the Quote List

Bengali (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bulgarian (Bulgaria) conversational telephony

Dataset successfully added to the Quote List

Bulgarian (Bulgaria) Pronunciation Dictionary

Dataset successfully added to the Quote List

Cantonese (China) business dialogues

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets