Off-the-Shelf AI Training Datasets

German Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDDEU_ITN001

TypeText

Unit8001 test cases

LanguageGerman

CountryN/A

GlobalPhone Multilingual Text & Speech Database

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Identification, Multilingual Speech Synthesis, Virtual Assistant, Chatbot

Dataset IDGLOBALPHONE

TypeAudio

Unit450 hours

LanguageN/A

CountryGlobal coverage

Greek (Greece) scripted smartphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDGRE_ASR001_CN

TypeAudio

Unit191 hours

LanguageGreek

CountryGreece

Hausa scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDHAU_ASR001

TypeAudio

Unit20 hours

LanguageHausa

CountryCameroon

Hindi Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDHIN_ITN001

TypeText

Unit6924 test cases

LanguageHindi

CountryN/A

Hungarian (Hungary) scripted smartphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDHUN_ASR001_CN

TypeAudio

Unit286 hours

LanguageHungarian

CountryHungary

German Inverse text normalisation

Dataset successfully added to the Quote List

GlobalPhone Multilingual Text & Speech Database

Dataset successfully added to the Quote List

Greek (Greece) scripted smartphone

Dataset successfully added to the Quote List

Hausa scripted microphone

Dataset successfully added to the Quote List

Hindi Inverse text normalisation

Dataset successfully added to the Quote List

Hungarian (Hungary) scripted smartphone

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets