Off-the-Shelf AI Training Datasets

Japanese (Japan) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDJPN_ASR001

TypeAudio

Unit33 hours

LanguageJapanese

CountryJapan

Japanese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDJPN_ITN001

TypeText

Unit5363 test cases

LanguageJapanese

CountryN/A

Korean (South Korea) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDKOR_ASR001

TypeAudio

Unit20 hours

LanguageKorean

CountrySouth Korea

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Mandarin Chinese (China) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDMAC_ASR002

TypeAudio

Unit26 hours

LanguageMandarin Chinese

CountryChina

Mandarin Chinese (China) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDMAC_ASR001

TypeAudio

Unit323 hours

LanguageMandarin Chinese

CountryChina

Japanese (Japan) scripted microphone

Dataset successfully added to the Quote List

Japanese Inverse text normalisation

Dataset successfully added to the Quote List

Korean (South Korea) scripted microphone

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin Chinese (China) scripted microphone

Dataset successfully added to the Quote List

Mandarin Chinese (China) scripted telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets