Off-the-Shelf AI Training Datasets

Farsi/Persian NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDFAR_NER001

TypeText

Unit19,584 sentences

LanguageIranian Persian

CountryIran

French (France) In-Car

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment

Dataset IDFrench SpeechDat-Car

TypeAudio

Unit113 hours

LanguageFrench

CountryFrance

French Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDFRA_ITN001

TypeText

Unit3274 test cases

LanguageFrench

CountryN/A

German Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDDEU_ITN001

TypeText

Unit8001 test cases

LanguageGerman

CountryN/A

Hindi Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDHIN_ITN001

TypeText

Unit6924 test cases

LanguageHindi

CountryN/A

Italian (Italy) scripted microphone in-car

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment

Dataset IDITA_ASR002

TypeAudio

Unit47 hours

LanguageItalian

CountryItaly

Farsi/Persian NER news text

Dataset successfully added to the Quote List

French (France) In-Car

Dataset successfully added to the Quote List

French Inverse text normalisation

Dataset successfully added to the Quote List

German Inverse text normalisation

Dataset successfully added to the Quote List

Hindi Inverse text normalisation

Dataset successfully added to the Quote List

Italian (Italy) scripted microphone in-car

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets