The objective of the SDC Program is to develop speech corpora to support in-car automatic speech recognition technology research and Serial Number Data Names Speakers Sound Parameter Utterances Prompts/spk Total *King-ASR-120 Chinese in car Speech Corpus 1200 people 16 K,16 bit 480 576000 Four Channels *King-ASR-129 Canadian French in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-121 Korean in car Speech Corpus 1000 people 16 K,16 bit 360 360000 Four Channels *King-ASR-125 Japanese in car Speech Corpus 800 people 16 K,16 bit 360 288000 Four Channels *King-ASR-132 France French in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-135 UK English in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-141 Spain Spanish in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-144 Mexico Spanish in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-145 USA Spanish in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-147 Italy Italian in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-150 German in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels *King-ASR-153 Russian in car Speech Corpus 300 people 16 K,16 bit 330 99000 Four Channels
development. This project is formed by a series of in-car data collections in various languages conducted in native countries, and the project will increase by request. These corpora were specially designed for the purpose of training and the testing of recent ASR in car applications and will be licensed upon request.
SDC data overview
Languages: Chinese, Korean, Japanese, Canadian French, France French, Spain Spanish, Mexico Spanish, American Spanish, Italian, German UK English and Russian.
Note: the language can be increased by request.
General information
Script Design
All the scripts were specially designed for in-car speech recognition application training and testing purposes.
Recording platforms
The recording tracks are made in Windows XP SP2 by a multi-channel recording system, AUDIOREC. The recording equipment is listed below. (Table 2-1)
|
VEHICLE |
TWO POPULAR CARS WILL USED IN THE COUNTRY |
2 |
|
Computer |
IBM T43 |
1 |
|
Microphone |
Shure SM10A / Sennheiser ME104 /AKG Q400 |
1/1/2 |
|
Audio Card |
PocketPC 440 MX |
1 |
Table 2-1 Recording platform
Speaker's demographic information Age group Minimum # speakers (%) 18 – 30 years 30 % 31 – 45 years 20% 46 – 65 years 15% Sort Speed Recording Sessions Noise Condition a 0km/h Stop, Motor running Climate control off, windows closed. b 40-80km/h City road With noise of climate control, windows open or wiper on. c 80-120km/h High way with noise of radio on, air conditioning on and etc.
In order to cover as many speaker specific factors as possible in the database recordings,the project will be performed in native countries and the following three broad categories are identified for coverage: gender distribution, age distribution and dialectal distribution.
Gender balance
The database will consist of 50% (± 5%) male + 50% (± 5%) female speakers.
Age distribution
For this project, speech data will be collected in the following age categories:
Speakers above 65 or lower than 18 are optional.
Dialectal regions
The dialectal regions of particular language are carefully identified and a balance between the number of dialectical regions and the number of speakers are specially made to reflect the number of inhabitants of the region for a particular language.
Recording Conditions
All the speakers were recorded in three real driving sessions with noise conditions.
Data transcribing and annotation
All data was transcribed and annotated based on special rules. Each signal file is accompanied by the relevant descriptive information.
For detailed information, please contact us for samples.
| Privacy Policy | Terms of Use | Sitemap | Feedback | Contact Us | Copyright Speech Ocean All rights reserved |
