Off-the-Shelf AI Training Datasets

Amharic (Ethiopia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDamh_ETH_PHON

TypeText

Unit49,000 words

LanguageAmharic

CountryEthiopia

Arabic (MSA) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDarb_MSA_PHON

TypeText

Unit40,000 words

LanguageArabic (Standard)

CountryN/A

Bahasa Indonesia conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDBAH_ASR001

TypeAudio

Unit31 hours

LanguageIndonesian

CountryIndonesia

English Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDENG_ITN001

TypeText

Unit4454 test cases

LanguageEnglish

CountryN/A

French Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDFRA_ITN001

TypeText

Unit3274 test cases

LanguageFrench

CountryN/A

German Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDDEU_ITN001

TypeText

Unit8001 test cases

LanguageGerman

CountryN/A

Off-the-shelf (OTS) Datasets

Amharic (Ethiopia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Arabic (MSA) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bahasa Indonesia conversational telephony

Dataset successfully added to the Quote List

English Inverse text normalisation

Dataset successfully added to the Quote List

French Inverse text normalisation

Dataset successfully added to the Quote List

German Inverse text normalisation

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Amharic (Ethiopia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Arabic (MSA) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bahasa Indonesia conversational telephony

Dataset successfully added to the Quote List

English Inverse text normalisation

Dataset successfully added to the Quote List

French Inverse text normalisation

Dataset successfully added to the Quote List

German Inverse text normalisation

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch