CallHome - Japanese Corpus


Participants: 120
Type of Study: phone call
Location: United States
Media type: audio
DOI: doi:10.21415/T5H59V

Browsable transcripts

Download transcripts

Media folder

Citation information

Some citation here.

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Project Description

This is the Japanese portion of CallHome.

Speakers were solicited by the LDC to participate in this telephone speech collection effort via the internet, publications (advertisements), and personal contacts. A total of 200 call originators were found, each of whom placed a telephone call via a toll-free robot operator maintained by the LDC. Access to the robot operator was possible via a unique Personal Identification Number (PIN) issued by the recruiting staff at the LDC when the caller enrolled in the project. The participants were made aware that their telephone call would be recorded, as were the call recipients. The call was allowed only if both parties agreed to being recorded. Each caller was allowed to talk up to 30 minutes. Upon successful completion of the call, the caller was paid $20 (in addition to making a free long-distance telephone call). Each caller was allowed to place only one telephone call.

Although the goal of the call collection effort was to have unique speakers in all calls, a handful of repeat speakers are included in the corpus. In all, 200 calls were transcribed. Of these, 80 have been designated as training calls, 20 as development test calls, and 100 as evaluation test calls. For each of the training and development test calls, a contiguous 10-minute region was selected for transcription; for the evaluation test calls, a 5-minute region was transcribed. For the present publication, only 20 of the evaluation test calls are being released; the remaining 80 test calls are being held in reserve for future LVCSR benchmark tests.

After a successful call was completed, a human audit of each telephone call was conducted to verify that the proper language was spoken, to check the quality of the recording, and to select and describe the region to be transcribed. The description of the transcribed region provides information about channel quality, number of speakers, their gender, and other attributes.
FileSexAgeAgePlace
ja_0856
ja_09243816
ja_0930
ja_10123116
ja_1032
ja_1041
ja_104841
ja_10574118
ja_1099
ja_1109
ja_1123
ja_1201
ja_12373721
ja_12633721
ja_127733
ja_12884320
ja_129016
ja_13282914
ja_13692512
ja_13702612
ja_1418
ja_142516
ja_14283016
ja_14613414
ja_15093016
ja_15383314
ja_1541
ja_15423518
ja_15571610
ja_15932614
ja_16042819
ja_1607
ja_16084512
ja_161513
ja_16284014
ja_164212
ja_16674416
ja_17103116
ja_17132417
ja_172512
ja_17316312
ja_17383016
ja_17412715
ja_174918
ja_188916
ja_18992212
ja_19251914
ja_192813
ja_19993116
ja_20045812
ja_2041
ja_20854017
ja_20963617
ja_21111912
ja_21341813
ja_215713
ja_21803616
ja_218818
ja_2199
ja_22042512
ja_22062215
ja_22073112
ja_22083316
ja_2209828
ja_22102814
ja_22126110
ja_22154512
ja_22176515
ja_22184716
ja_22192818
ja_22204322
ja_2222
ja_22242920
ja_22255412
ja_22313120
ja_22342616
ja_22355012
ja_22372916
ja_22392914
ja_22431812
ja_0743
ja_0922
ja_0988
ja_1003
ja_1069
ja_1622
ja_16295414
ja_16702821
ja_16882112
ja_16901911
ja_196716
ja_20353014
ja_2214
ja_22382923
ja_3002
ja_3004
ja_3005
ja_3008
ja_4061
ja_4275
ja_0696
ja_0862
ja_09863216
ja_1005
ja_10723416
ja_15863518
ja_16745416
ja_18321913
ja_18673016
ja_19662716
ja_205314
ja_20744616
ja_21962819
ja_22162518
ja_22233616
ja_223613
ja_22424814
ja_3001
ja_3006
ja_3007