Off-the-Shelf AI Training Datasets

Arabic NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDARB_NER001

TypeText

Unit20,774 sentences

LanguageArabic (Standard)

CountryN/A

Dari (Afghanistan) broadcast

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Automatic Captioning, Keyword Spotting

Dataset IDDAR_BRC001

TypeAudio

Unit49 hours

LanguageDari

CountryAfghanistan

East African facial images

More info

Dataset successfully added to the Quote List

Common Use CasesFacial Recognition

Dataset IDIMG_FACE_KEN_CN

TypeImage

Unit13500 images

LanguageN/A

CountryKenya

English Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDENG_ITN001

TypeText

Unit4454 test cases

LanguageEnglish

CountryN/A

English NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDENG_NER001

TypeText

Unit22,768 sentences

LanguageEnglish

CountryN/A

European License Plate Detection Annotations

More info

Dataset successfully added to the Quote List

Common Use CasesLicense plate detection for vehicles on the road

Dataset IDLICENSE_ANNO

TypeImage Annotation

Unit100,000 bounding boxes

LanguageN/A

CountryGermany, France, Switzerland

Arabic NER news text

Dataset successfully added to the Quote List

Dari (Afghanistan) broadcast

Dataset successfully added to the Quote List

East African facial images

Dataset successfully added to the Quote List

English Inverse text normalisation

Dataset successfully added to the Quote List

English NER news text

Dataset successfully added to the Quote List

European License Plate Detection Annotations

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets