Dataset ID:
YYDH_ASR001_CN
Dataset Name:
Cantonese (China) business dialogues
Common Use Cases:
ASR, Conversational AI, Speech Analytics, Business Intelligence
Language:
Cantonese
Country:
China
Language Code:
yue
Country Code:
CHN
Product Type
Audio
Detailed Product Type
Conversational Speech
Unit
98.35 hours
Recording Device
Mobile phone
Recording Condition
Low background noise (home/office)
Contributors
241
Utterances
Available upon request
Unique Words
Available upon request
Sample Rate (kHz):
16
Channels
2
Data Format
wav
Source
Appen China
Additional Info:
- Business meetings and conversations audio with transcription and timestamping, from a variety of industries.
- 30% male participants, 70% female
Year of Collection
2024
Get Started with Off-the-Shelf AI Training Datasets
Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.
Talk to an expert