Off-the-Shelf AI Training Datasets

Cebuano (Philippines) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDceb_PHL_PHON

TypeText

Unit21,000 words

LanguageCebuano

CountryPhilippines

Georgian (Georgia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkat_GEO_PHON

TypeText

Unit67,000 words

LanguageGeorgian

CountryGeorgia

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Somali (Somalia) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDSOM_ASR001

TypeAudio

Unit50 hours

LanguageSomali

CountrySomalia

Somali (Somalia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDsom_SOM_PHON

TypeText

Unit76,000 words

LanguageSomali

CountrySomalia

Spanish (Argentina) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDspa_ARG_PHON

TypeText

Unit15,000 words

LanguageSpanish

CountryArgentina

Off-the-shelf (OTS) Datasets

Cebuano (Philippines) Pronunciation Dictionary

Dataset successfully added to the Quote List

Georgian (Georgia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Somali (Somalia) conversational telephony

Dataset successfully added to the Quote List

Somali (Somalia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Argentina) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Off-the-shelf (OTS) Datasets

Cebuano (Philippines) Pronunciation Dictionary

Dataset successfully added to the Quote List

Georgian (Georgia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Somali (Somalia) conversational telephony

Dataset successfully added to the Quote List

Somali (Somalia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Spanish (Argentina) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets

Get in touch