| ID |
| King-ASR-137 |
|
| Name |
| Korean Speech Recognition Database—(Mobile)--1000 speakers |
|
| Author |
| Speechocean |
|
| Language |
| Korean |
|
| Type |
| ASR |
|
| Sub-type |
| Mobile |
|
| Environment |
| quiet(office), noisy(outside) |
|
| Parameters |
| Sampling rate:16K,16bit; Channels:One Channel; Pure Recording Hours:45; |
|
| labeling |
| All the data has been transcribed and annotated precisely. |
|
| Resources purpose |
| The database was made for the tuning and testing purpose of speech recognition system- for IVR / mobile. |
|
The Korean mobile speech Recognition database which was collected in Korea, contains the voices of 1000 different native speakers (500±5%males, 500±5% females) who were balanced according to age (mainly 16 – 30,31 – 45,46 – 60), Gender and regional accents (for the details, please see the technical document).
The script was specially designed to provide material for both training and testing of many classes of speech recognizers which contain 15 categories and 35 sub-categories for each speaker (for the detail script structure design, please see the technical document).
Each speaker has recorded 300 utterances under two environments, one in a quiet session (Office/Home) and one in a noisy session (Garden/roadside/restaurant/bus).
Each speaker has recorded 150 utterances and spontaneous sentences per session and totally 300 utterances were recorded by each speaker.
Popular mobiles in this country were used for collecting this data such as Samsung, Nokia, HTC, etc. The speech data is stored as sequences of 16 kHz, 16 bit and uncompressed.
Each utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
A pronunciation lexicon with a phonemic transcription is also included.
All the data was transcribed and labeled.