NE
Nexdata-AI/600-Hours-American-English-Full-Duplex-Multi-Channel-Speech-Dataset
600-Hours-American-English-Full-Duplex-Multi-Channel-Speech-Dataset
Description
American English Multi-stream Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.
For
more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1770?source=Github
Specifications
Format
16 kHz, 16 bit, uncompressed wav, mono channel,speaker channel separation;
Content category
Dialogue based on given topics
Recording condition
Low background noise (indoor)
Recording device
Android smartphone, iPhone
Country
the United States(USA);
Language(Region) Code
en-US
Language
English
Features of annotation
Transcription text, timestamp, speaker ID, gender, noise
Accuracy rate
Word accuracy rate(WAR) 98%
Licensing Information
Commercial License
On this page
Contributors
Created September 15, 2025
Updated September 28, 2025