Filters
Search
Product type
Language
Country
Year of Collection

English (United States) Harmful and harmless prompts and responses **in development**

More info
Common Use CasesLLM training, LLM Red teaming, Chatbot
Dataset IDeng_USA_LLM001
TypeText
Unit300 prompts
LanguageEnglish
CountryUnited States

English (United States) Ultra High-Volume labeled speech

More info
Common Use CasesASR, Conversational AI, Speech Analytics, Automatic Captioning, In Car HMI & Entertainment, Virtual Assistant
Dataset IDUSE_UHV001
TypeAudio
Unit1196 hours
LanguageEnglish
CountryUnited States

French (France) In-Car

More info
Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment
Dataset IDFrench SpeechDat-Car
TypeAudio
Unit113 hours
LanguageFrench
CountryFrance

Hand gesture videos **in development**

More info
Common Use CasesMovement detection, Human Body Movement Recognition, Action Classification
Dataset IDHUMAN_BODY_VID003
TypeVideo
Unit5000 videos
LanguageN/A
CountryUnited States

Italian (Italy) scripted microphone in-car

More info
Common Use CasesASR, Virtual Assistant, In Car HMI & Entertainment
Dataset IDITA_ASR002
TypeAudio
Unit47 hours
LanguageItalian
CountryItaly

Location entrance human body movement videos

More info
Common Use CasesSecurity, Movement detection, Human Body Movement Recognition
Dataset IDHUMAN_BODY_VID002
TypeVideo
Unit130 videos
LanguageN/A
CountryUnited Kingdom, Philippines

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert