Off-the-Shelf AI Training Datasets

Hindi Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDHIN_ITN001

TypeText

Unit6924 test cases

LanguageHindi

CountryN/A

Japanese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDJPN_ITN001

TypeText

Unit5363 test cases

LanguageJapanese

CountryN/A

Japanese NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDJPY_NER001

TypeText

Unit20,629 sentences

LanguageJapanese

CountryJapan

Korean NER news text

More info

Dataset successfully added to the Quote List

Common Use CasesNER, Content Classification, Search Engines

Dataset IDKOR_NER001

TypeText

Unit25,830 sentences

LanguageKorean

CountrySouth Korea

Latin American Spanish Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDSPA_ITN001

TypeText

Unit3795 test cases

LanguageSpanish

CountryN/A

Mandarin Chinese Inverse text normalisation

More info

Dataset successfully added to the Quote List

Common Use CasesASR, Language Modelling, Closed Captioning

Dataset IDCMN_ITN001

TypeText

Unit4230 test cases

LanguageMandarin Chinese

CountryN/A

Hindi Inverse text normalisation

Dataset successfully added to the Quote List

Japanese Inverse text normalisation

Dataset successfully added to the Quote List

Japanese NER news text

Dataset successfully added to the Quote List

Korean NER news text

Dataset successfully added to the Quote List

Latin American Spanish Inverse text normalisation

Dataset successfully added to the Quote List

Mandarin Chinese Inverse text normalisation

Dataset successfully added to the Quote List

Get Started with Off-the-Shelf AI Training Datasets