english english chinese chinese
Bookmark and Share
Home > Data Service > Current Project > Speech Data---Car (SDC)

Speech Data---Car (SDC)

The objective of the SDC Program is to develop speech corpora to support in-car automatic speech recognition technology research and development. This project is formed by a series of in-car data collections in various languages conducted in native countries, and the project will increase by request. These corpora were specially designed for the purpose of training and the testing of recent ASR in car applications and will be licensed upon request.

SDC data overview
Languages:
Chinese, Korean, Japanese, Canadian French, France French, Spain Spanish, Mexico Spanish, American Spanish, Italian, German UK English and Russian.
Note: the language can be increased by request. 

General information

Serial Number

Data Names

Speakers

Sound Parameter

Utterances

Prompts/spk

Total

*King-ASR-120

Chinese in car Speech Corpus

1200 people

16 K,16 bit

480

576000

Four Channels

*King-ASR-129

Canadian French in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-121

Korean in car Speech Corpus

1000 people

16 K,16 bit

360

360000

Four Channels

*King-ASR-125

Japanese in car Speech Corpus

800 people

16 K,16 bit

360

288000

Four Channels

*King-ASR-132

France French in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-135

UK English in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-141

Spain Spanish in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-144

Mexico Spanish in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-145

USA Spanish in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-147

Italy Italian in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-150

German in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels

*King-ASR-153

Russian in car Speech Corpus

300 people

16 K,16 bit

330

99000

Four Channels


Script Design
All the scripts were specially designed for in-car speech recognition application training and testing purposes.

Recording platforms
The recording tracks are made in Windows XP SP2 by a multi-channel recording system, AUDIOREC. The recording equipment is listed below. (Table 2-1)

VEHICLE

TWO POPULAR CARS WILL USED IN THE COUNTRY

2

Computer

IBM T43

1

Microphone

Shure SM10A / Sennheiser ME104 /AKG Q400

1/1/2

Audio Card

PocketPC 440 MX

1

                                                           Table 2-1 Recording platform

Speaker's demographic information
In order to cover as many speaker specific factors as possible in the database recordings,the project will be performed in native countries and the following three broad categories are identified for coverage: gender distribution, age distribution and dialectal distribution.

   Gender balance
The database will consist of 50% (± 5%) male + 50% (± 5%) female speakers.

   Age distribution
For this project, speech data will be collected in the following age categories:


Age group

Minimum # speakers (%)

18 – 30 years

30 %

31 – 45 years

20%

46 – 65 years

15%


Speakers above 65 or lower than 18 are optional.

    Dialectal regions
The dialectal regions of particular language are carefully identified and a balance between the number of dialectical regions and the number of speakers are specially made to reflect the number of inhabitants of the region for a particular language.

    Recording Conditions
All the speakers were recorded in three real driving sessions with noise conditions.

Sort

Speed

Recording Sessions

Noise Condition

a

0km/h

Stop, Motor running

Climate control off, windows closed.

b

40-80km/h

City road

With noise of climate control, windows open or wiper on.

c

80-120km/h

High way

with noise of radio on, air conditioning on and etc.


    Data transcribing and annotation
All data was transcribed and annotated based on special rules. Each signal file is accompanied by the relevant descriptive information.
For detailed information, please contact us for samples.