Off-the-Shelf AI Training Datasets

Cantonese (China) business dialogues

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, Business Intelligence

Dataset IDYYDH_ASR001_CN

TypeAudio

Unit98.35 hours

LanguageCantonese

CountryChina

East African facial images

More info

Dataset successfully added to the Quote List

Common Use CasesFacial Recognition

Dataset IDIMG_FACE_KEN_CN

TypeImage

Unit13500 images

LanguageN/A

CountryKenya

English (United States) Adversarial prompts for LLM red teaming in development

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, LLM Red teaming

Dataset IDeng_USA_LLM002

TypeText

Unit500 prompts

LanguageEnglish

CountryUnited States

English (United States) Harmful and harmless prompts and responses in development

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, LLM Red teaming, Chatbot

Dataset IDeng_USA_LLM001

TypeText

Unit300 prompts

LanguageEnglish

CountryUnited States

English Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDENG_ITN001

TypeText

Unit4454 test cases

LanguageEnglish

CountryN/A

French Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDFRA_ITN001

TypeText

Unit3274 test cases

LanguageFrench

CountryN/A

Cantonese (China) business dialogues

Dataset successfully added to the Quote List

East African facial images

Dataset successfully added to the Quote List

English (United States) Adversarial prompts for LLM red teaming **in development**

Dataset successfully added to the Quote List

English (United States) Harmful and harmless prompts and responses **in development**

Dataset successfully added to the Quote List

English Inverse text normalisation

Dataset successfully added to the Quote List

French Inverse text normalisation

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

English (United States) Adversarial prompts for LLM red teaming in development

English (United States) Harmful and harmless prompts and responses in development