Off-the-Shelf AI Training Datasets

Location entrance human body movement videos

More info

Dataset successfully added to the Quote List

Common Use CasesSecurity, Movement detection, Human Body Movement Recognition

Dataset IDHUMAN_BODY_VID002

TypeVideo

Unit130 videos

LanguageN/A

CountryUnited Kingdom, Philippines

Mandarin Chinese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDCMN_ITN001

TypeText

Unit4230 test cases

LanguageMandarin Chinese

CountryN/A

Mandarin NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDMAC_NER001

TypeText

Unit17,313 sentences

LanguageMandarin Chinese

CountryChina

Object Image Collection text descriptions in development

More info

Dataset successfully added to the Quote List

Common Use CasesImage label recognition training, Accessibility, LLM image generation

Dataset IDIMG_TAG_CN

TypeImage

Unit2000 images

LanguageN/A

CountryN/A

Russian NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDRUS_NER001

TypeText

Unit29,888 sentences

LanguageRussian

CountryRussia

Urdu NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDURD_NER001

TypeText

Unit20,634 sentences

LanguageUrdu

CountryPakistan

Location entrance human body movement videos

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin NER news text

Dataset successfully added to the Quote List

Object Image Collection **text descriptions in development**

Dataset successfully added to the Quote List

Russian NER news text

Dataset successfully added to the Quote List

Urdu NER news text

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Object Image Collection text descriptions in development