Off-the-Shelf AI Training Datasets

Dari (Afghanistan) broadcast

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Automatic Captioning, Keyword Spotting

Dataset IDDAR_BRC001

TypeAudio

Unit49 hours

LanguageDari

CountryAfghanistan

Dutch (Netherlands & Belgium) scripted in-car

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment

Dataset IDDutch and Flemish SpeechDat-Car

TypeAudio

Unit27 hours

LanguageDutch

CountryNetherland - Belgium

English (United States) Ultra High-Volume labeled speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant

Dataset IDUSE_UHV001

TypeAudio

Unit1196 hours

LanguageEnglish

CountryUnited States

French (France) In-Car

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment

Dataset IDFrench SpeechDat-Car

TypeAudio

Unit113 hours

LanguageFrench

CountryFrance

GlobalPhone Multilingual Text & Speech Database

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Identification, Multilingual Speech Synthesis, Virtual Assistant, Chatbot

Dataset IDGLOBALPHONE

TypeAudio

Unit450 hours

LanguageN/A

CountryGlobal coverage

Italian (Italy) scripted microphone in-car

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment

Dataset IDITA_ASR002

TypeAudio

Unit47 hours

LanguageItalian

CountryItaly

Dari (Afghanistan) broadcast

Dataset successfully added to the Quote List

Dutch (Netherlands & Belgium) scripted in-car

Dataset successfully added to the Quote List

English (United States) Ultra High-Volume labeled speech

Dataset successfully added to the Quote List

French (France) In-Car

Dataset successfully added to the Quote List

GlobalPhone Multilingual Text & Speech Database

Dataset successfully added to the Quote List

Italian (Italy) scripted microphone in-car

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets