Filters
Search
Product type
Language
Country
Year of Collection

English (United States) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDeng_USA_PHON
TypeText
Unit358,000 words
LanguageEnglish
CountryUnited States

English Inverse text normalisation

More info
Common Use CasesASR, Language Modelling, Closed Captioning
Dataset IDENG_ITN001
TypeText
Unit4454 test cases
LanguageEnglish
CountryN/A

Finnish (Finland) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfin_FIN_POS
TypeText
Unit10,000 words
LanguageFinnish
CountryFinland

Finnish (Finland) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfin_FIN_PHON
TypeText
Unit86,000 words
LanguageFinnish
CountryFinland

French (Algeria) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfra_DZA_PHON
TypeText
Unit4,000 words
LanguageFrench
CountryAlgeria

French (Canada) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDfra_CAN_PHON
TypeText
Unit67,000 words
LanguageFrench
CountryCanada

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert