Off-the-Shelf AI Training Datasets

Finnish (Finland) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDfin_FIN_POS

TypeText

Unit10,000 words

LanguageFinnish

CountryFinland

Finnish (Finland) printed text OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_FIN_CN

TypeImage

Unit7293 images

LanguageFinnish

CountryFinland

Finnish (Finland) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDfin_FIN_PHON

TypeText

Unit86,000 words

LanguageFinnish

CountryFinland

Lithuanian (Lithuania) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDlit_LTU_PHON

TypeText

Unit71,000 words

LanguageLithuanian

CountryLithuania

Malayalam (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDmal_IND_PHON

TypeText

Unit19,000 words

LanguageMalayalam

CountryIndia

Tagalog (Philippines) Offensive Wordlist

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDtgl_PHL_NER001

TypeText

Unit4,526 words

LanguageTagalog

CountryPhilippines

Off-the-shelf (OTS) Datasets

Finnish (Finland) Part of Speech Dictionary

Dataset successfully added to the Quote List

Finnish (Finland) printed text OCR

Dataset successfully added to the Quote List

Finnish (Finland) Pronunciation Dictionary

Dataset successfully added to the Quote List

Lithuanian (Lithuania) Pronunciation Dictionary

Dataset successfully added to the Quote List

Malayalam (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Tagalog (Philippines) Offensive Wordlist

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Finnish (Finland) Part of Speech Dictionary

Dataset successfully added to the Quote List

Finnish (Finland) printed text OCR

Dataset successfully added to the Quote List

Finnish (Finland) Pronunciation Dictionary

Dataset successfully added to the Quote List

Lithuanian (Lithuania) Pronunciation Dictionary

Dataset successfully added to the Quote List

Malayalam (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Tagalog (Philippines) Offensive Wordlist

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch