Off-the-Shelf AI Training Datasets

Slovak (Slovakia) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDSlovak SpeechDat(E) Database

TypeAudio

Unit65 hours

LanguageSlovak

CountrySlovakia

Slovenian (Slovenian) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDslv_SVN_PHON

TypeText

Unit28,000 words

LanguageSlovenian

CountrySlovenia

Slovenian (Slovenian) telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDSlovenian SpeechDat(II) FDB-1000

TypeAudio

Unit76 hours

LanguageSlovenian

CountrySlovenia

Somali (Somalia) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSOM_ASR001

TypeAudio

Unit50 hours

LanguageSomali

CountrySomalia

Somali (Somalia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDsom_SOM_PHON

TypeText

Unit76,000 words

LanguageSomali

CountrySomalia

Sorani (Iraq) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkur_IRQ_PHON

TypeText

Unit26,000 words

LanguageSorani

CountryIraq

Slovak (Slovakia) scripted telephony

Dataset successfully added to the Quote List

Slovenian (Slovenian) Pronunciation Dictionary

Dataset successfully added to the Quote List

Slovenian (Slovenian) telephony

Dataset successfully added to the Quote List

Somali (Somalia) conversational telephony

Dataset successfully added to the Quote List

Somali (Somalia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Sorani (Iraq) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets