Off-the-Shelf AI Training Datasets

Mandarin Chinese (China) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDMAC_ASR001

TypeAudio

Unit323 hours

LanguageMandarin Chinese

CountryChina

Mandarin NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDMAC_NER001

TypeText

Unit17,313 sentences

LanguageMandarin Chinese

CountryChina

Shanghai dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSHANGHAI_ASR001_CN

TypeAudio

Unit21 hours

LanguageShanghai dialect

CountryChina

Shanghai dialect (China) Conversational Speech

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSHANGHAI_ASR002_CN

TypeAudio

Unit4.5 hours

LanguageShanghai dialect

CountryChina

Slovenian (Slovenian) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDslv_SVN_PHON

TypeText

Unit28,000 words

LanguageSlovenian

CountrySlovenia

Slovenian (Slovenian) telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDSlovenian SpeechDat(II) FDB-1000

TypeAudio

Unit76 hours

LanguageSlovenian

CountrySlovenia

Off-the-shelf (OTS) Datasets

Mandarin Chinese (China) scripted telephony

Dataset successfully added to the Quote List

Mandarin NER news text

Dataset successfully added to the Quote List

Shanghai dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Shanghai dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Slovenian (Slovenian) Pronunciation Dictionary

Dataset successfully added to the Quote List

Slovenian (Slovenian) telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Mandarin Chinese (China) scripted telephony

Dataset successfully added to the Quote List

Mandarin NER news text

Dataset successfully added to the Quote List

Shanghai dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Shanghai dialect (China) Conversational Speech

Dataset successfully added to the Quote List

Slovenian (Slovenian) Pronunciation Dictionary

Dataset successfully added to the Quote List

Slovenian (Slovenian) telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch