Off-the-Shelf AI Training Datasets

Farsi/Persian NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDFAR_NER001

TypeText

Unit19,584 sentences

LanguageIranian Persian

CountryIran

Finnish (Finland) printed text OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_FIN_CN

TypeImage

Unit7293 images

LanguageFinnish

CountryFinland

French (France) In-Car

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment

Dataset IDFrench SpeechDat-Car

TypeAudio

Unit113 hours

LanguageFrench

CountryFrance

Handwritten text document OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_Handwritten

TypeImage

Unit663 images

LanguageN/A

CountryN/A

Italian (Italy) scripted microphone in-car

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment

Dataset IDITA_ASR002

TypeAudio

Unit47 hours

LanguageItalian

CountryItaly

Japanese NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDJPY_NER001

TypeText

Unit20,629 sentences

LanguageJapanese

CountryJapan

Farsi/Persian NER news text

Dataset successfully added to the Quote List

Finnish (Finland) printed text OCR

Dataset successfully added to the Quote List

French (France) In-Car

Dataset successfully added to the Quote List

Handwritten text document OCR

Dataset successfully added to the Quote List

Italian (Italy) scripted microphone in-car

Dataset successfully added to the Quote List

Japanese NER news text

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets