Off-the-Shelf AI Training Datasets

Chinese news text summaries corpus

More info

Dataset successfully added to the Quote List

Common Use CasesLLM training

Dataset IDDMXWB_corpus_CN

TypeText

Unit20000 summaries

LanguageChinese

CountryChina

Dongbei dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDDONGBEI_ASR001_CN

TypeAudio

Unit84.6 hours

LanguageDongbei dialect

CountryChina

Dongbei dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDDONGBEI_ASR002_CN

TypeAudio

Unit75.2 hours

LanguageDongbei dialect

CountryChina

Electric vehicles in elevators

More info

Dataset successfully added to the Quote List

Common Use CasesImage recognition

Dataset IDIMG_DDC_CN

TypeImage

Unit17132 images

LanguageN/A

CountryChina

German (Turkey) telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDOrienTel German Spoken by Turkish

TypeAudio

Unit31 hours

LanguageGerman

CountryTurkey

Kurmanji (Turkey) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkur_TUR_PHON

TypeText

Unit60,000 words

LanguageKurmanji

CountryTurkey

Off-the-shelf (OTS) Datasets

Chinese news text summaries corpus

Dataset successfully added to the Quote List

Dongbei dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Dongbei dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Electric vehicles in elevators

Dataset successfully added to the Quote List

German (Turkey) telephony

Dataset successfully added to the Quote List

Kurmanji (Turkey) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Chinese news text summaries corpus

Dataset successfully added to the Quote List

Dongbei dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Dongbei dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Electric vehicles in elevators

Dataset successfully added to the Quote List

German (Turkey) telephony

Dataset successfully added to the Quote List

Kurmanji (Turkey) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch