Off-the-Shelf AI Training Datasets

Malaysian (Malaysia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDmsa_MYS_PHON

TypeText

Unit26,000 words

LanguageMalaysian

CountryMalaysia

Oriya (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDori_IND_PHON

TypeText

Unit19,000 words

LanguageOriya

CountryIndia

Tagalog (Philippines) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDtgl_PHL_PHON

TypeText

Unit34,000 words

LanguageTagalog

CountryPhilippines

Telugu (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDtel_IND_PHON

TypeText

Unit51,000 words

LanguageTelugu

CountryIndia

Urdu (India/ Pakistan) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDURD_ASR001

TypeAudio

Unit47 hours

LanguageUrdu

CountryIndia - Pakistan

Urdu (Pakistan) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDurd_PAK_POS

TypeText

Unit12,000 words

LanguageUrdu

CountryPakistan

Off-the-shelf (OTS) Datasets

Malaysian (Malaysia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Oriya (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Tagalog (Philippines) Pronunciation Dictionary

Dataset successfully added to the Quote List

Telugu (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Urdu (India/ Pakistan) conversational telephony

Dataset successfully added to the Quote List

Urdu (Pakistan) Part of Speech Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Malaysian (Malaysia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Oriya (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Tagalog (Philippines) Pronunciation Dictionary

Dataset successfully added to the Quote List

Telugu (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Urdu (India/ Pakistan) conversational telephony

Dataset successfully added to the Quote List

Urdu (Pakistan) Part of Speech Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch