Off-the-Shelf AI Training Datasets

Russian (Russia) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDSpeecon Russian Database

TypeAudio

Unit46 hours

LanguageRussian

CountryRussia

Russian (Russia) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDRussian SpeechDat(E) Database

TypeAudio

Unit180 hours

LanguageRussian

CountryRussia

Russian NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDRUS_NER001

TypeText

Unit29,888 sentences

LanguageRussian

CountryRussia

Slovak (Slovakia) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDSlovak SpeechDat(E) Database

TypeAudio

Unit65 hours

LanguageSlovak

CountrySlovakia

Slovenian (Slovenian) telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant

Dataset IDSlovenian SpeechDat(II) FDB-1000

TypeAudio

Unit76 hours

LanguageSlovenian

CountrySlovenia

Spanish (Latin America) scripted microphone

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, Chatbot

Dataset IDESL_ASR001

TypeAudio

Unit17 hours

LanguageSpanish

CountryCosta Rica

Russian (Russia) scripted microphone

Dataset successfully added to the Quote List

Russian (Russia) scripted telephony

Dataset successfully added to the Quote List

Russian NER news text

Dataset successfully added to the Quote List

Slovak (Slovakia) scripted telephony

Dataset successfully added to the Quote List

Slovenian (Slovenian) telephony

Dataset successfully added to the Quote List

Spanish (Latin America) scripted microphone

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets