Off-the-Shelf AI Training Datasets

Amharic (Ethiopia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDamh_ETH_PHON

TypeText

Unit49,000 words

LanguageAmharic

CountryEthiopia

Igbo (Nigeria) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDibo_NGA_PHON

TypeText

Unit32,000 words

LanguageIgbo

CountryNigeria

Javanese (Indonesia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDjav_IDN_PHON

TypeText

Unit22,000 words

LanguageJavanese

CountryIndonesia

Lao (Laos) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDlao_LAO_PHON

TypeText

Unit9,000 words

LanguageLao

CountryLaos

Urdu (India/ Pakistan) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDURD_ASR001

TypeAudio

Unit47 hours

LanguageUrdu

CountryIndia - Pakistan

Urdu (Pakistan) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDurd_PAK_POS

TypeText

Unit12,000 words

LanguageUrdu

CountryPakistan

Off-the-shelf (OTS) Datasets

Amharic (Ethiopia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Igbo (Nigeria) Pronunciation Dictionary

Dataset successfully added to the Quote List

Javanese (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Urdu (India/ Pakistan) conversational telephony

Dataset successfully added to the Quote List

Urdu (Pakistan) Part of Speech Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Amharic (Ethiopia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Igbo (Nigeria) Pronunciation Dictionary

Dataset successfully added to the Quote List

Javanese (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Urdu (India/ Pakistan) conversational telephony

Dataset successfully added to the Quote List

Urdu (Pakistan) Part of Speech Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch