Off-the-Shelf AI Training Datasets

Home environment pictures

More info

Dataset successfully added to the Quote List

Common Use CasesImage recognition

Dataset IDIMG_HOME_CN

TypeImage

Unit10000 images

LanguageN/A

CountryN/A

Japanese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDJPN_ITN001

TypeText

Unit5363 test cases

LanguageJapanese

CountryN/A

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Mandarin Chinese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDCMN_ITN001

TypeText

Unit4230 test cases

LanguageMandarin Chinese

CountryN/A

Pashto (Afghanistan) broadcast

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Automatic Captioning, Keyword Spotting

Dataset IDPAS_BRC001

TypeAudio

Unit51 hours

LanguageNorthern Pashto - Southern Pashto

CountryAfghanistan

Spanish (Latin America – Chile and Colombia) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Call Centre, Conversational AI, Speech Analytics

Dataset IDESL_ASR002

TypeAudio

Unit22 hours

LanguageSpanish

CountryChile-Columbia

Home environment pictures

Dataset successfully added to the Quote List

Japanese Inverse text normalisation

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Pashto (Afghanistan) broadcast

Dataset successfully added to the Quote List

Spanish (Latin America – Chile and Colombia) conversational telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets