Filters
Search
Product type
Language
Country
Year of Collection

Dari (Afghanistan) broadcast

More info
Common Use CasesASR, Automatic Captioning, Keyword Spotting
Dataset IDDAR_BRC001
TypeAudio
Unit49 hours
LanguageDari
CountryAfghanistan

English (United Kingdom) TTS female scripted microphone

More info
Common Use CasesTTS
Dataset IDTC-STAR female baseline voice Laura
TypeAudio
Unit11 hours
LanguageEnglish
CountryUnited Kingdom

English (United Kingdom) TTS male scripted microphone

More info
Common Use CasesTTS
Dataset IDTC-STAR male baseline voice Ian
TypeAudio
Unit7 hours
LanguageEnglish
CountryUnited Kingdom

English (United States) Ultra High-Volume labeled speech

More info
Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant
Dataset IDUSE_UHV001
TypeAudio
Unit1196 hours
LanguageEnglish
CountryUnited States

GlobalPhone Multilingual Text & Speech Database

More info
Common Use CasesASR, Language Identification, Multilingual Speech Synthesis, Virtual Assistant, Chatbot
Dataset IDGLOBALPHONE
TypeAudio
Unit450 hours
LanguageN/A
CountryGlobal coverage

Hindi (India) conversational telephony

More info
Common Use CasesASR, Conversational AI, Speech Analytics, TTS
Dataset IDHIN_ASR002
TypeAudio
Unit32 hours
LanguageHindi
CountryIndia

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert