Off-the-Shelf AI Training Datasets

Arabic NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDARB_NER001

TypeText

Unit20,774 sentences

LanguageArabic (Standard)

CountryN/A

Assamese (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDasm_IND_PHON

TypeText

Unit40,000 words

LanguageAssamese

CountryIndia

Baby crying audio

More info

Dataset successfully added to the Quote List

Common Use CasesBaby Monitor, Security & Other Consumer Applications

Dataset IDCRY_ASR001_CN

TypeAudio

Unit70 hours

LanguageN/A

CountryChina

Bahasa Indonesia conversational telephony

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Conversational AI, Speech Analytics

Dataset IDBAH_ASR001

TypeAudio

Unit31 hours

LanguageIndonesian

CountryIndonesia

Baking Pictures

More info

Dataset successfully added to the Quote List

Common Use CasesImage recognition

Dataset IDIMG_BAKE_CN

TypeImage

Unit6000 images

LanguageN/A

CountryChina

Basque (Spain) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDeus_ESP_PHON

TypeText

Unit10,000 words

LanguageBasque

CountrySpain

Arabic NER news text

Dataset successfully added to the Quote List

Assamese (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Baby crying audio

Dataset successfully added to the Quote List

Bahasa Indonesia conversational telephony

Dataset successfully added to the Quote List

Baking Pictures

Dataset successfully added to the Quote List

Basque (Spain) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets