Filters
Search
Product type
Language
Country
Year of Collection

Polish (Poland) scripted smartphone

More info
Common Use CasesASR, Virtual Assistant, Chatbot
Dataset IDPOL_ASR002_CN
TypeAudio
Unit293 hours
LanguagePolish
CountryPoland

Portuguese (Brazil) microphone

More info
Common Use CasesASR, Virtual Assistant, Chatbot
Dataset IDPTB_ASR001
TypeAudio
Unit26 hours
LanguagePortuguese
CountryBrazil

Portuguese (Brazil) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDpor_BRA_POS
TypeText
Unit98,000 words
LanguagePortuguese
CountryBrazil

Portuguese (Brazil) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDpor_BRA_PHON
TypeText
Unit102,000 words
LanguagePortuguese
CountryBrazil

Portuguese (Portugal) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDpor_PRT_POS
TypeText
Unit60,000 words
LanguagePortuguese
CountryPortugal

Portuguese (Portugal) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDpor_PRT_PHON
TypeText
Unit112,000 words
LanguagePortuguese
CountryPortugal

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert