Off-the-Shelf AI Training Datasets

Arabic (UAE) printed text annotated OCR

More info

Dataset successfully added to the Quote List

Common Use CasesDocument Processing, Document Search, Text detection

Dataset IDIMG_OCR_ARU002_CN

TypeImage

Unit20000 images

LanguageArabic

CountryUnited Arab Emirates

Arabic (United Arab Emirates (UAE)) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDara_ARE_PHON

TypeText

Unit75,000 words

LanguageArabic

CountryUnited Arab Emirates (UAE)

Assamese (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDasm_IND_PHON

TypeText

Unit40,000 words

LanguageAssamese

CountryIndia

Bahasa Indonesia conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDBAH_ASR001

TypeAudio

Unit31 hours

LanguageIndonesian

CountryIndonesia

Basque (Spain) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDeus_ESP_PHON

TypeText

Unit10,000 words

LanguageBasque

CountrySpain

Bengali (Bangladesh) conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDBEN_ASR001

TypeAudio

Unit47 hours

LanguageBengali

CountryBangladesh

Arabic (UAE) printed text annotated OCR

Dataset successfully added to the Quote List

Arabic (United Arab Emirates (UAE)) Pronunciation Dictionary

Dataset successfully added to the Quote List

Assamese (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bahasa Indonesia conversational telephony

Dataset successfully added to the Quote List

Basque (Spain) Pronunciation Dictionary

Dataset successfully added to the Quote List

Bengali (Bangladesh) conversational telephony

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets