Off-the-Shelf AI Training Datasets

Amharic (Ethiopia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDamh_ETH_PHON

TypeText

Unit49,000 words

LanguageAmharic

CountryEthiopia

Hindi (India) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics, TTS

Dataset IDHIN_ASR002

TypeAudio

Unit32 hours

LanguageHindi

CountryIndia

Hindi (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDhin_IND_PHON

TypeText

Unit35,000 words

LanguageHindi

CountryIndia

Hindi (India) scripted telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Virtual Assistant, TTS

Dataset IDHIN_ASR001

TypeAudio

Unit224 hours

LanguageHindi

CountryIndia

Korean (South Korea) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkor_KOR_POS

TypeText

Unit100,000 words

LanguageKorean

CountrySouth Korea

Korean (South Korea) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkor_KOR_PHON

TypeText

Unit105,000 words

LanguageKorean

CountrySouth Korea

Off-the-shelf (OTS) Datasets

Amharic (Ethiopia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Hindi (India) conversational telephony

Dataset successfully added to the Quote List

Hindi (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Hindi (India) scripted telephony

Dataset successfully added to the Quote List

Korean (South Korea) Part of Speech Dictionary

Dataset successfully added to the Quote List

Korean (South Korea) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Amharic (Ethiopia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Hindi (India) conversational telephony

Dataset successfully added to the Quote List

Hindi (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Hindi (India) scripted telephony

Dataset successfully added to the Quote List

Korean (South Korea) Part of Speech Dictionary

Dataset successfully added to the Quote List

Korean (South Korea) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch