Off-the-Shelf AI Training Datasets

Korean (South Korea) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDKOR_ASR001

TypeAudio

Unit20 hours

LanguageKorean

CountrySouth Korea

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Mandarin Chinese (China) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDMAC_ASR002

TypeAudio

Unit26 hours

LanguageMandarin Chinese

CountryChina

Mandarin Chinese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDCMN_ITN001

TypeText

Unit4230 test cases

LanguageMandarin Chinese

CountryN/A

Polish (Poland) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDPOL_ASR001

TypeAudio

Unit25 hours

LanguagePolish

CountryPoland

Polish (Poland) scripted smartphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDPOL_ASR002_CN

TypeAudio

Unit293 hours

LanguagePolish

CountryPoland

Korean (South Korea) scripted microphone

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin Chinese (China) scripted microphone

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Polish (Poland) scripted microphone

Dataset successfully added to the Quote List

Polish (Poland) scripted smartphone

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets