Off-the-Shelf AI Training Datasets

Baby crying audio

More info

Dataset successfully added to the Quote List

Common Use CasesBaby Monitor, Security & Other Consumer Applications

Dataset IDCRY_ASR001_CN

TypeAudio

Unit70 hours

LanguageN/A

CountryChina

Cantonese (China) business dialogues

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, Business Intelligence

Dataset IDYYDH_ASR001_CN

TypeAudio

Unit98.35 hours

LanguageCantonese

CountryChina

Chinese command and control prompt response corpus

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, Command and Control, TV Player, Device Control

Dataset IDDSDH_corpus_CN

TypeText

Unit20000 sentences

LanguageChinese

CountryChina

English (United States) Adversarial prompts for LLM red teaming in development

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, LLM Red teaming

Dataset IDeng_USA_LLM002

TypeText

Unit500 prompts

LanguageEnglish

CountryUnited States

English (United States) Harmful and harmless prompts and responses in development

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, LLM Red teaming, Chatbot

Dataset IDeng_USA_LLM001

TypeText

Unit300 prompts

LanguageEnglish

CountryUnited States

European License Plate Detection Annotations

More info

Dataset successfully added to the Quote List

Common Use CasesLicense plate detection for vehicles on the road

Dataset IDLICENSE_ANNO

TypeImage Annotation

Unit100,000 bounding boxes

LanguageN/A

CountryGermany, France, Switzerland

Baby crying audio

Dataset successfully added to the Quote List

Cantonese (China) business dialogues

Dataset successfully added to the Quote List

Chinese command and control prompt response corpus

Dataset successfully added to the Quote List

English (United States) Adversarial prompts for LLM red teaming **in development**

Dataset successfully added to the Quote List

English (United States) Harmful and harmless prompts and responses **in development**

Dataset successfully added to the Quote List

European License Plate Detection Annotations

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

English (United States) Adversarial prompts for LLM red teaming in development

English (United States) Harmful and harmless prompts and responses in development