Off-the-Shelf AI Training Datasets

Arabic (United Arab Emirates (UAE)/ Saudi Arabia) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDCGA_ASR001

TypeAudio

Unit86 hours

LanguageArabic

CountryUnited Arab Emirates (UAE) - Saudi Arabia

Assamese (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDasm_IND_PHON

TypeText

Unit40,000 words

LanguageAssamese

CountryIndia

Baby crying audio

More info

Dataset successfully added to the Quote List

Common Use CasesBaby Monitor, Security & Other Consumer Applications

Dataset IDCRY_ASR001_CN

TypeAudio

Unit70 hours

LanguageN/A

CountryChina

Basque (Spain) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDeus_ESP_PHON

TypeText

Unit10,000 words

LanguageBasque

CountrySpain

Bengali (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDben_IND_PHON

TypeText

Unit29,000 words

LanguageBengali

CountryIndia

Bulgarian (Bulgaria) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDbul_BGR_PHON

TypeText

Unit55,000 words

LanguageBulgarian

CountryBulgaria

Arabic (United Arab Emirates (UAE)/ Saudi Arabia) scripted microphone

Dataset successfully added to the Quote List

Assamese (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Baby crying audio

Dataset successfully added to the Quote List

Basque (Spain) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bengali (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bulgarian (Bulgaria) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets