Off-the-Shelf AI Training Datasets

Arabic NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDARB_NER001

TypeText

Unit20,774 sentences

LanguageArabic (Standard)

CountryN/A

Dari (Afghanistan) broadcast

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Automatic Captioning, Keyword Spotting

Dataset IDDAR_BRC001

TypeAudio

Unit49 hours

LanguageDari

CountryAfghanistan

English (United States) Ultra High-Volume labeled speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant

Dataset IDUSE_UHV001

TypeAudio

Unit1196 hours

LanguageEnglish

CountryUnited States

English NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDENG_NER001

TypeText

Unit22,768 sentences

LanguageEnglish

CountryN/A

Farsi/Persian NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDFAR_NER001

TypeText

Unit19,584 sentences

LanguageIranian Persian

CountryIran

Japanese NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDJPY_NER001

TypeText

Unit20,629 sentences

LanguageJapanese

CountryJapan

Arabic NER news text

Dataset successfully added to the Quote List

Dari (Afghanistan) broadcast

Dataset successfully added to the Quote List

English (United States) Ultra High-Volume labeled speech

Dataset successfully added to the Quote List

English NER news text

Dataset successfully added to the Quote List

Farsi/Persian NER news text

Dataset successfully added to the Quote List

Japanese NER news text

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets