Off-the-Shelf AI Training Datasets

Arabic (United Arab Emirates (UAE)) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDOrienTel United Arab Emirates MSA (Modern Standard Arabic)

TypeAudio

Unit31 hours

LanguageArabic

CountryUnited Arab Emirates (UAE)

Arabic (United Arab Emirates (UAE)/ Saudi Arabia) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDCGA_ASR001

TypeAudio

Unit86 hours

LanguageArabic

CountryUnited Arab Emirates (UAE) - Saudi Arabia

Bulgarian (Bulgaria) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDBUL_ASR002

TypeAudio

Unit22 hours

LanguageBulgarian

CountryBulgaria

Chinese and English related texts

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training

Dataset IDGLWB_CN

TypeText

Unit400000

LanguageEnglish/Chinese

CountryN/A

Chinese command and control prompt response corpus

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, Command and Control, TV Player, Device Control

Dataset IDDSDH_corpus_CN

TypeText

Unit20000 sentences

LanguageChinese

CountryChina

Chinese instruction set sentence corpus

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training

Dataset IDZLJ_corpus_CN

TypeText

Unit200000 sentences

LanguageChinese

CountryChina

Arabic (United Arab Emirates (UAE)) scripted telephony

Dataset successfully added to the Quote List

Arabic (United Arab Emirates (UAE)/ Saudi Arabia) scripted microphone

Dataset successfully added to the Quote List

Bulgarian (Bulgaria) scripted microphone

Dataset successfully added to the Quote List

Chinese and English related texts

Dataset successfully added to the Quote List

Chinese command and control prompt response corpus

Dataset successfully added to the Quote List

Chinese instruction set sentence corpus

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets