Off-the-Shelf AI Training Datasets

Cantonese (China) Traditional Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDyue_HKG_PHON

TypeText

Unit40,000 words

LanguageCantonese

CountryChina

Catalan (Spain) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDcat_ESP_PHON

TypeText

Unit10,000 words

LanguageCatalan

CountrySpain

Cebuano (Philippines) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDceb_PHL_PHON

TypeText

Unit21,000 words

LanguageCebuano

CountryPhilippines

Croatian (Croatia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDhrv_HRV_PHON

TypeText

Unit19,000 words

LanguageCroatian

CountryCroatia

Czech (Czech Republic) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDces_CZE_PHON

TypeText

Unit50,000 words

LanguageCzech

CountryCzech Republic

Danish (Denmark) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDdan_DNK_POS

TypeText

Unit100,000 words

LanguageDanish

CountryDenmark

Cantonese (China) Traditional Pronunciation Dictionary

Dataset successfully added to the Quote List

Catalan (Spain) Pronunciation Dictionary

Dataset successfully added to the Quote List

Cebuano (Philippines) Pronunciation Dictionary

Dataset successfully added to the Quote List

Croatian (Croatia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Czech (Czech Republic) Pronunciation Dictionary

Dataset successfully added to the Quote List

Danish (Denmark) Part of Speech Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets