Off-the-Shelf AI Training Datasets

English (United States) Ultra High-Volume labeled speech

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant

Dataset IDUSE_UHV001

TypeAudio

Unit1196 hours

LanguageEnglish

CountryUnited States

English Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDENG_ITN001

TypeText

Unit4454 test cases

LanguageEnglish

CountryN/A

European License Plate Detection Annotations

More info

Dataset successfully added to the Quote List

Common Use CasesLicense plate detection for vehicles on the road

Dataset IDLICENSE_ANNO

TypeImage Annotation

Unit100,000 bounding boxes

LanguageN/A

CountryGermany, France, Switzerland

Finnish (Finland) printed text OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_FIN_CN

TypeImage

Unit7293 images

LanguageFinnish

CountryFinland

French (Canada) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDFRC_ASR003

TypeAudio

Unit9 hours

LanguageFrench

CountryCanada

French (France) conversational smartphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDFRF_ASR004

TypeAudio

Unit159 hours

LanguageFrench

CountryFrance

English (United States) Ultra High-Volume labeled speech

Dataset successfully added to the Quote List

English Inverse text normalisation

Dataset successfully added to the Quote List

European License Plate Detection Annotations

Dataset successfully added to the Quote List

Finnish (Finland) printed text OCR

Dataset successfully added to the Quote List

French (Canada) conversational telephony

Dataset successfully added to the Quote List

French (France) conversational smartphone

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets