Off-the-Shelf AI Training Datasets

Japanese NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDJPY_NER001

TypeText

Unit20,629 sentences

LanguageJapanese

CountryJapan

Javanese (Indonesia) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDjav_IDN_PHON

TypeText

Unit22,000 words

LanguageJavanese

CountryIndonesia

Kannada (India) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkan_IND_PHON

TypeText

Unit49,000 words

LanguageKannada

CountryIndia

Kazakh (Kazakhstan) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkaz_KAZ_PHON

TypeText

Unit31,000 words

LanguageKazakh

CountryKazakhstan

Korean (South Korea) Part of Speech Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkor_KOR_POS

TypeText

Unit100,000 words

LanguageKorean

CountrySouth Korea

Korean (South Korea) Pronunciation Dictionary

More info

Dataset successfully added to the Quote List

Common Use CasesASR, TTS, Language Modelling

Dataset IDkor_KOR_PHON

TypeText

Unit105,000 words

LanguageKorean

CountrySouth Korea

Japanese NER news text

Dataset successfully added to the Quote List

Javanese (Indonesia) Pronunciation Dictionary

Dataset successfully added to the Quote List

Kannada (India) Pronunciation Dictionary

Dataset successfully added to the Quote List

Kazakh (Kazakhstan) Pronunciation Dictionary

Dataset successfully added to the Quote List

Korean (South Korea) Part of Speech Dictionary

Dataset successfully added to the Quote List

Korean (South Korea) Pronunciation Dictionary

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets