Dataset ID:
DMXWB_corpus_CN
Dataset Name:
Chinese news text summaries corpus
Common Use Cases:
LLM training
Language:
Chinese
Country:
China
Language Code:
chn
Country Code:
CHN
Product Type
Text
Detailed Product Type
LLM training
Unit
20000 summaries
Recording Device
N/A
Recording Condition
N/A
Contributors
N/A
Utterances
N/A
Unique Words
N/A
Sample Rate (kHz):
N/A
Channels
N/A
Data Format
xls
Source
Appen China
Additional Info:
- Summaries of main events and themes from news data in 15 domains (Finance and economics, Lottery ticket, House property, Share certificate, Home furnishings, Education, Science & Technology, Society & people's livelihood, Fashion, Politics, Sports activities, Constellation, Game, Entertainment)
Year of Collection
2023
Get Started with Off-the-Shelf AI Training Datasets
Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.
Talk to an expert