Filters
Search
Product type
Language
Country
Year of Collection

Romanian (Romania) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDron_ROU_PHON
TypeText
Unit16,000 words
LanguageRomanian
CountryRomania

Russian (Russia) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDrus_RUS_POS
TypeText
Unit100,000 words
LanguageRussian
CountryRussia

Russian (Russia) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDrus_RUS_PHON
TypeText
Unit120,000 words
LanguageRussian
CountryRussia

Russian + German Female TTS

More info
Common Use CasesTTS
Dataset IDED_TTS001_CN
TypeAudio
Unit2.32 hours
LanguageRussian/German
CountryRussia/Germany

Selfie image and video collection

More info
Common Use CasesFacial Recognition, Human Body Movement Recognition
Dataset IDIMG_VID_SELFIE_US
TypeImage, Video
Unit2938 files (1403 images, 1535 videos)
LanguageN/A
CountryUnited States

Serbian (Serbia) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDsrp_SRB_PHON
TypeText
Unit29,000 words
LanguageSerbian
CountrySerbia

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert