Off-the-Shelf AI Training Datasets

Arabic (MSA) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDarb_MSA_PHON

TypeText

Unit40,000 words

LanguageArabic (Standard)

CountryN/A

Arabic NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDARB_NER001

TypeText

Unit20,774 sentences

LanguageArabic (Standard)

CountryN/A

Finnish (Finland) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDfin_FIN_POS

TypeText

Unit10,000 words

LanguageFinnish

CountryFinland

Finnish (Finland) printed text OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_FIN_CN

TypeImage

Unit7293 images

LanguageFinnish

CountryFinland

Finnish (Finland) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDfin_FIN_PHON

TypeText

Unit86,000 words

LanguageFinnish

CountryFinland

Hausa (Nigeria) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDHAU_ASR002

TypeAudio

Unit33 hours

LanguageHausa

CountryNigeria

Off-the-shelf (OTS) Datasets

Arabic (MSA) Pronunciation Dictionary

Dataset successfully added to the Quote List

Arabic NER news text

Dataset successfully added to the Quote List

Finnish (Finland) Part of Speech Dictionary

Dataset successfully added to the Quote List

Finnish (Finland) printed text OCR

Dataset successfully added to the Quote List

Finnish (Finland) Pronunciation Dictionary

Dataset successfully added to the Quote List

Hausa (Nigeria) conversational telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Arabic (MSA) Pronunciation Dictionary

Dataset successfully added to the Quote List

Arabic NER news text

Dataset successfully added to the Quote List

Finnish (Finland) Part of Speech Dictionary

Dataset successfully added to the Quote List

Finnish (Finland) printed text OCR

Dataset successfully added to the Quote List

Finnish (Finland) Pronunciation Dictionary

Dataset successfully added to the Quote List

Hausa (Nigeria) conversational telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch