Off-the-Shelf AI Training Datasets

Lao (Laos) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDlao_LAO_PHON

TypeText

Unit9,000 words

LanguageLao

CountryLaos

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Mandarin (Traditional) (Taiwan) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDzho_TWN_PHON

TypeText

Unit50,000 words

LanguageMandarin (Traditional)

CountryTaiwan

Mandarin Chinese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDCMN_ITN001

TypeText

Unit4230 test cases

LanguageMandarin Chinese

CountryN/A

Object Image Collection text descriptions in development

More info

Dataset successfully added to the Quote List

Common Use CasesImage label recognition training, Accessibility, LLM image generation

Dataset IDIMG_TAG_CN

TypeImage

Unit2000 images

LanguageN/A

CountryN/A

Roomba view images

More info

Dataset successfully added to the Quote List

Common Use CasesImage recognition, Object recognition, Retail, e-commerce

Dataset IDIMG_SDJ_CN

TypeImage

Unit82000 images

LanguageN/A

CountryN/A

Off-the-shelf (OTS) Datasets

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin (Traditional) (Taiwan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Object Image Collection text descriptions in development

Dataset successfully added to the Quote List

Roomba view images

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin (Traditional) (Taiwan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Object Image Collection **text descriptions in development**

Dataset successfully added to the Quote List

Roomba view images

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch

Object Image Collection text descriptions in development