Off-the-Shelf AI Training Datasets

Arabic NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDARB_NER001

TypeText

Unit20,774 sentences

LanguageArabic (Standard)

CountryN/A

Chinese command and control prompt response corpus

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, Command and Control, TV Player, Device Control

Dataset IDDSDH_corpus_CN

TypeText

Unit20000 sentences

LanguageChinese

CountryChina

English Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDENG_ITN001

TypeText

Unit4454 test cases

LanguageEnglish

CountryN/A

English NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDENG_NER001

TypeText

Unit22,768 sentences

LanguageEnglish

CountryN/A

Farsi/Persian NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDFAR_NER001

TypeText

Unit19,584 sentences

LanguageIranian Persian

CountryIran

French Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDFRA_ITN001

TypeText

Unit3274 test cases

LanguageFrench

CountryN/A

Arabic NER news text

Dataset successfully added to the Quote List

Chinese command and control prompt response corpus

Dataset successfully added to the Quote List

English Inverse text normalisation

Dataset successfully added to the Quote List

English NER news text

Dataset successfully added to the Quote List

Farsi/Persian NER news text

Dataset successfully added to the Quote List

French Inverse text normalisation

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets