Off-the-Shelf AI Training Datasets

Igbo (Nigeria) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDibo_NGA_PHON

TypeText

Unit32,000 words

LanguageIgbo

CountryNigeria

Javanese (Indonesia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDjav_IDN_PHON

TypeText

Unit22,000 words

LanguageJavanese

CountryIndonesia

Lao (Laos) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDlao_LAO_PHON

TypeText

Unit9,000 words

LanguageLao

CountryLaos

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Spanish (Argentina) Offensive Wordlist

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDspa_ARG_NER001

TypeText

Unit5,000 words

LanguageSpanish

CountryArgentina

Spanish (Argentina) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDspa_ARG_PHON

TypeText

Unit15,000 words

LanguageSpanish

CountryArgentina

Off-the-shelf (OTS) Datasets

Igbo (Nigeria) Pronunciation Dictionary

Dataset successfully added to the Quote List

Javanese (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Spanish (Argentina) Offensive Wordlist

Dataset successfully added to the Quote List

Spanish (Argentina) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Igbo (Nigeria) Pronunciation Dictionary

Dataset successfully added to the Quote List

Javanese (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Spanish (Argentina) Offensive Wordlist

Dataset successfully added to the Quote List

Spanish (Argentina) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch