Off-the-Shelf AI Training Datasets

Shanghai dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSHANGHAI_ASR001_CN

TypeAudio

Unit21 hours

LanguageShanghai dialect

CountryChina

Shanghai dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSHANGHAI_ASR002_CN

TypeAudio

Unit4.5 hours

LanguageShanghai dialect

CountryChina

Somali (Somalia) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSOM_ASR001

TypeAudio

Unit50 hours

LanguageSomali

CountrySomalia

Sorani (Kurdish) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSOR_ASR001

TypeAudio

Unit5 hours

LanguageCentral Kurdish (Iran)

CountryIran

Spanish (Latin America – Chile and Colombia) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Call Centre, Conversational AI, Speech Analytics

Dataset IDESL_ASR002

TypeAudio

Unit22 hours

LanguageSpanish

CountryChile-Columbia

Spanish (Latin America) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDESL_ASR001

TypeAudio

Unit17 hours

LanguageSpanish

CountryCosta Rica

Shanghai dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Shanghai dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Somali (Somalia) conversational telephony

Dataset successfully added to the Quote List

Sorani (Kurdish) conversational telephony

Dataset successfully added to the Quote List

Spanish (Latin America – Chile and Colombia) conversational telephony

Dataset successfully added to the Quote List

Spanish (Latin America) scripted microphone

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets