Off-the-Shelf AI Training Datasets

Indonesian (Indonesia) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDind_IDN_POS

TypeText

Unit10,000 words

LanguageIndonesian

CountryIndonesia

Indonesian (Indonesia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDind_IDN_PHON

TypeText

Unit95,000 words

LanguageIndonesian

CountryIndonesia

Japanese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDJPN_ITN001

TypeText

Unit5363 test cases

LanguageJapanese

CountryN/A

Javanese (Indonesia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDjav_IDN_PHON

TypeText

Unit22,000 words

LanguageJavanese

CountryIndonesia

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Mandarin Chinese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDCMN_ITN001

TypeText

Unit4230 test cases

LanguageMandarin Chinese

CountryN/A

Off-the-shelf (OTS) Datasets

Indonesian (Indonesia) Part of Speech Dictionary

Dataset successfully added to the Quote List

Indonesian (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Japanese Inverse text normalisation

Dataset successfully added to the Quote List

Javanese (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Indonesian (Indonesia) Part of Speech Dictionary

Dataset successfully added to the Quote List

Indonesian (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Japanese Inverse text normalisation

Dataset successfully added to the Quote List

Javanese (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch