Off-the-Shelf AI Training Datasets

Lao (Laos) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDlao_LAO_PHON

TypeText

Unit9,000 words

LanguageLao

CountryLaos

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Pashto (Afghanistan) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDpus_AFG_PHON

TypeText

Unit64,000 words

LanguagePashto

CountryAfghanistan

Spanish (Argentina) Offensive Wordlist

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDspa_ARG_NER001

TypeText

Unit5,000 words

LanguageSpanish

CountryArgentina

Spanish (Argentina) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDspa_ARG_PHON

TypeText

Unit15,000 words

LanguageSpanish

CountryArgentina

Spanish (Chile) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDspa_CHL_PHON

TypeText

Unit15,000 words

LanguageSpanish

CountryChile

Off-the-shelf (OTS) Datasets

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Pashto (Afghanistan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Argentina) Offensive Wordlist

Dataset successfully added to the Quote List

Spanish (Argentina) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Chile) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Lao (Laos) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Pashto (Afghanistan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Argentina) Offensive Wordlist

Dataset successfully added to the Quote List

Spanish (Argentina) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Chile) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch