Off-the-Shelf AI Training Datasets

Chinese multidisciplinary test questions corpus

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training

Dataset IDMTQ_CN

TypeText

Unit319970 sentences

LanguageChinese

CountryChina

Chinese news text summaries corpus

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training

Dataset IDDMXWB_corpus_CN

TypeText

Unit20000 summaries

LanguageChinese

CountryChina

Code Q&A Dataset

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training

Dataset IDDM_CNRD

TypeText

Unit12 million pairs

LanguageEnglish

CountryN/A

English (United States) Adversarial prompts for LLM red teaming in development

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, LLM Red teaming

Dataset IDeng_USA_LLM002

TypeText

Unit500 prompts

LanguageEnglish

CountryUnited States

English (United States) Chatbot conversations in development

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, Chatbot, Virtual Assistant

Dataset IDeng_USA_LLM003

TypeText

Unit1800 prompts

LanguageEnglish

CountryUnited States

English (United States) Harmful and harmless prompts and responses in development

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training, LLM Red teaming, Chatbot

Dataset IDeng_USA_LLM001

TypeText

Unit300 prompts

LanguageEnglish

CountryUnited States

Chinese multidisciplinary test questions corpus

Dataset successfully added to the Quote List

Chinese news text summaries corpus

Dataset successfully added to the Quote List

Code Q&A Dataset

Dataset successfully added to the Quote List

English (United States) Adversarial prompts for LLM red teaming **in development**

Dataset successfully added to the Quote List

English (United States) Chatbot conversations **in development**

Dataset successfully added to the Quote List

English (United States) Harmful and harmless prompts and responses **in development**

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

English (United States) Adversarial prompts for LLM red teaming in development

English (United States) Chatbot conversations in development

English (United States) Harmful and harmless prompts and responses in development