Off-the-Shelf AI Training Datasets

Handwritten text document OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_Handwritten

TypeImage

Unit663 images

LanguageN/A

CountryN/A

Japanese NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDJPY_NER001

TypeText

Unit20,629 sentences

LanguageJapanese

CountryJapan

Korean NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDKOR_NER001

TypeText

Unit25,830 sentences

LanguageKorean

CountrySouth Korea

Mandarin NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDMAC_NER001

TypeText

Unit17,313 sentences

LanguageMandarin Chinese

CountryChina

Object Image Collection text descriptions in development

More info

Dataset successfully added to the Quote List

Common Use CasesImage label recognition training, Accessibility, LLM image generation

Dataset IDIMG_TAG_CN

TypeImage

Unit2000 images

LanguageN/A

CountryN/A

Pashto (Afghanistan) broadcast

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Automatic Captioning, Keyword Spotting

Dataset IDPAS_BRC001

TypeAudio

Unit51 hours

LanguageNorthern Pashto - Southern Pashto

CountryAfghanistan

Handwritten text document OCR

Dataset successfully added to the Quote List

Japanese NER news text

Dataset successfully added to the Quote List

Korean NER news text

Dataset successfully added to the Quote List

Mandarin NER news text

Dataset successfully added to the Quote List

Object Image Collection **text descriptions in development**

Dataset successfully added to the Quote List

Pashto (Afghanistan) broadcast

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Object Image Collection text descriptions in development