Dataset ID:
DEU_ASR001
Dataset Name:
German (Germany) scripted microphone
Common Use Cases:
ASR, Virtual Assistant, Chatbot
Language:
German
Country:
Germany
Language Code:
deu
Country Code:
DEU
Product Type
Audio
Detailed Product Type
Scripted Speech
Unit
16 hours
Recording Device
Microphone
Recording Condition
Low background noise (studio)
Contributors
127
Utterances
12,700
Unique Words
6,826
Sample Rate (kHz):
48
Channels
2
Data Format
raw PCM
Source
Appen Global
Additional Info:
- Dataset is fully transcribed and timestamped
- Dataset is accompanied by a pronunciation lexicon containing all transcribed words
- Each speaker read 100 prompts including digits, natural numbers, personal and city names, telephone numbers, generic command and control items, phonetically rich sentences and words
Year of Collection
2009
Get Started with Off-the-Shelf AI Training Datasets
Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.
Talk to an expert