Filters
Search
Product type
Language
Country
Year of Collection

English (United States) Medical Terms Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDeng_USA_Med_PHON
TypeText
Unit8,000 words
LanguageEnglish
CountryUnited States

English (United States) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDeng_USA_POS
TypeText
Unit263,000 words
LanguageEnglish
CountryUnited States

English (United States) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDeng_USA_PHON
TypeText
Unit358,000 words
LanguageEnglish
CountryUnited States

English NER news text

More info
Common Use CasesNER, Content Classification, Search Engines
Dataset IDENG_NER001
TypeText
Unit22,768 sentences
LanguageEnglish
CountryN/A

Farsi/Persian NER news text

More info
Common Use CasesNER, Content Classification, Search Engines
Dataset IDFAR_NER001
TypeText
Unit19,584 sentences
LanguageIranian Persian
CountryIran

Finnish (Finland) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfin_FIN_POS
TypeText
Unit10,000 words
LanguageFinnish
CountryFinland

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert