Off-the-Shelf AI Training Datasets

French (France) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDfra_FRA_POS

TypeText

Unit95,000 words

LanguageFrench

CountryFrance

French (France) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDfra_FRA_PHON

TypeText

Unit112,000 words

LanguageFrench

CountryFrance

French Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDFRA_ITN001

TypeText

Unit3274 test cases

LanguageFrench

CountryN/A

Georgian (Georgia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkat_GEO_PHON

TypeText

Unit67,000 words

LanguageGeorgian

CountryGeorgia

German (Germany) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDdeu_DEU_PHON

TypeText

Unit146,000 words

LanguageGerman

CountryGermany

German (Switzerland) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDdeu_CHE_PHON

TypeText

Unit27,000 words

LanguageGerman

CountrySwitzerland

French (France) Part of Speech Dictionary

Dataset successfully added to the Quote List

French (France) Pronunciation Dictionary

Dataset successfully added to the Quote List

French Inverse text normalisation

Dataset successfully added to the Quote List

Georgian (Georgia) Pronunciation Dictionary

Dataset successfully added to the Quote List

German (Germany) Pronunciation Dictionary

Dataset successfully added to the Quote List

German (Switzerland) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets