|
Mark Johnson Macquarie University |
Mark Johnson contributed this corpus of CDS (child-directed speech) from the CHILDES Sesotho corpus. The goal of the corpus was to train an automatic segmenter. The available materials in this .zip file include the Python script that can be run on the Sesotho corpus, along with the output in the form of sentences of child directed speech (CDS).