Off-the-Shelf AI Training Datasets

English (United States) scripted sentences in development

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDUSE_ASR005

TypeAudio

Unit500 hours

LanguageEnglish

CountryUnited States

English (United States) Text message conversations

More info

Dataset successfully added to the Quote List

Common Use CasesChatbot, Virtual Assistant, Conversational AI

Dataset IDeng_USA_SMS003

TypeText

Unit100 conversations

LanguageEnglish

CountryUnited States

English (United States) Ultra High-Volume labeled speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant

Dataset IDUSE_UHV001

TypeAudio

Unit1196 hours

LanguageEnglish

CountryUnited States

Finnish (Finland) printed text OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_FIN_CN

TypeImage

Unit7293 images

LanguageFinnish

CountryFinland

French (Belgium) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDBelgian French SpeechDat(II) FDB-1000 (FIXED1BF)

TypeAudio

Unit76 hours

LanguageFrench

CountryBelgium

French (Canada) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDFRC_ASR002

TypeAudio

Unit46 hours

LanguageFrench

CountryCanada

English (United States) scripted sentences **in development**

Dataset successfully added to the Quote List

English (United States) Text message conversations

Dataset successfully added to the Quote List

English (United States) Ultra High-Volume labeled speech

Dataset successfully added to the Quote List

Finnish (Finland) printed text OCR

Dataset successfully added to the Quote List

French (Belgium) scripted telephony

Dataset successfully added to the Quote List

French (Canada) scripted microphone

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

English (United States) scripted sentences in development