Filters
Search
Product type
Language
Country
Year of Collection

Japanese NER news text

More info
Common Use CasesNER, Content Classification, Search Engines
Dataset IDJPY_NER001
TypeText
Unit20,629 sentences
LanguageJapanese
CountryJapan

Javanese (Indonesia) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDjav_IDN_PHON
TypeText
Unit22,000 words
LanguageJavanese
CountryIndonesia

Kannada (India) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDkan_IND_PHON
TypeText
Unit49,000 words
LanguageKannada
CountryIndia

Kazakh (Kazakhstan) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDkaz_KAZ_PHON
TypeText
Unit31,000 words
LanguageKazakh
CountryKazakhstan

Korean (South Korea) Part of Speech Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDkor_KOR_POS
TypeText
Unit100,000 words
LanguageKorean
CountrySouth Korea

Korean (South Korea) Pronunciation Dictionary

More info
Common Use CasesASR, TTS, Language Modelling
Dataset IDkor_KOR_PHON
TypeText
Unit105,000 words
LanguageKorean
CountrySouth Korea

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert