Off-the-Shelf AI Training Datasets

Korean (South Korea) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkor_KOR_POS

TypeText

Unit100,000 words

LanguageKorean

CountrySouth Korea

Korean (South Korea) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkor_KOR_PHON

TypeText

Unit105,000 words

LanguageKorean

CountrySouth Korea

Kurmanji (Turkey) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkur_TUR_PHON

TypeText

Unit60,000 words

LanguageKurmanji

CountryTurkey

Lao (Laos) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDlao_LAO_PHON

TypeText

Unit9,000 words

LanguageLao

CountryLaos

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Lithuanian (Lithuania) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDlit_LTU_PHON

TypeText

Unit71,000 words

LanguageLithuanian

CountryLithuania

Korean (South Korea) Part of Speech Dictionary

Dataset successfully added to the Quote List

Korean (South Korea) Pronunciation Dictionary

Dataset successfully added to the Quote List

Kurmanji (Turkey) Pronunciation Dictionary

Dataset successfully added to the Quote List

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Lithuanian (Lithuania) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets