Off-the-shelf (OTS) Datasets

Accelerate your AI projects with licensable datasets

Browse our extensive catalog of over 270 audio, image, video and text datasets in over 80 languages. Our pre-labeled datasets are available immediately so you can get started right away.

Browse catalog

Greek (Greece) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDell_GRC_PHON

TypeText

Unit5,000 words

LanguageGreek

CountryGreece

Italian (Italy) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDita_ITA_POS

TypeText

Unit171,000 words

LanguageItalian

CountryItaly

Italian (Italy) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDita_ITA_PHON

TypeText

Unit197,000 words

LanguageItalian

CountryItaly

Malayalam (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDmal_IND_PHON

TypeText

Unit19,000 words

LanguageMalayalam

CountryIndia

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert

Off-the-shelf (OTS) Datasets

Greek (Greece) Pronunciation Dictionary

Dataset successfully added to the Quote List

Italian (Italy) Part of Speech Dictionary

Dataset successfully added to the Quote List

Italian (Italy) Pronunciation Dictionary

Dataset successfully added to the Quote List

Malayalam (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch