L3HK Corpus


Xing Kang
Chinese University of Hong Kong

Participants: 906
Type of Study: frog story
Location: Hong Kong
Media type: audio
DOI:

Browsable transcripts

Download transcripts

Media folder

Citation information

Kang, X., Yip, V., Matthews, S., & Wong, P. C. (2023). A large-scale repository of spoken narratives in French, German and Spanish from Cantonese-speaking learners. Scientific data, 10(1), 183.

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by the above reference.

Project Description

See this article for full information and this table for full demographics.

The original CHAT transcripts are in this .zip file. The German version passes CHECK but it would take several hours to get the French and Spanish through CHECK and Chatter. The transcripts on the web were created by ASR and have not be checked or cleaned up yet.