|
Imane Bou-Saboun Department of Linguistics Yale University imane.bou-saboun@yale.edu |
| Participants: | 2 |
| Type of Study: | longitudinal |
| Location: | Italy |
| Media type: | audio |
| DOI: | xxx |
Bou-Saboun, I. (2025). The Acquisition of Wh-Questions in Tashlhiyt Berber: Novel Behavioral and Corpus Studies (Doctoral dissertation, University of Maryland, College Park).
The transcription is made using IPA and it is largely a phonetic one, to avoid making claims about the underlying form.
The mid vowel [e] and the mid-back vowel [o] are not vowel phonemes of Tashlhiyt but they may occur as allophones in pharyngealization contexts. For instance, we opted for transcribing the TB word for ‘no’ as ‘oho’ instead of ‘uhu’. The former is a more accurate phonetic transcription, but ‘uhu’ would be the appropriate underlying form, that reflects TB’s proper set of phonemes.
In this transcription, I chose not to mark pharyngealization on the consonants, but on the vowel level, pharyngealized u and i are rendered o and e. Another note is that I transcribed the vowels as a broad transcription, so why, for instance, would be transcribed as [maʕʕ] instead of [mæʕʕ].
The account of wh-questions followed in the transcript is one where there is an inaudible complementizer [a] following the ma wh-word. This ma a transcription will be found in the %xcod: tier under some of the glossed and non-glossed wh-question instances. This specific convention is crucially not meant to signal what is heard, but rather, what rules underlie wh-question formation in Berber languages, since in all of these instances, the ‘ma-a’ string is pronounced [ma].
Similarly, the one context where the “underlying representation” overrode phonetic realization was that of certain assimilation processes. I attempted as much as possible not to transcribe assimilation as such and maintain word boundaries. For instance, let us consider the question “Whose is it?” that appears in the transcript for the final session: *GRAN: win mit aj? %gls: Of Who DEM? %eng: Whose is it?
It is actually pronounced as [wimmitaj], as can be ascertained from the linked audio file. An alternate possible transcription could be to just show both the assimilation process in the main utterance line and do the segmentation in an %xcod: line, but the workaround reported here was intended for time efficiency.
Improvements and corrections of the transcript on the part of the community are highly encouraged. The goal of making this transcription available in the current state is for improvements to be made by the community of speakers of Tashlhiyt Berbers since it was not possible to recruit people to do the double-checking during the corpus construction process. The transcription was made to the best of the transcriber’s knowledge, drawing from the Berber language literature as much as possible. Negation, for instance, is rendered as “ur”, following Mettouchi 2009, Bensoukas 2009 inter alia. However, the adverbial “sur” is rendered with a word-final [r], despite it being pronounced as [sul] and reported as [sul] in some documentation work from the region, because that is how it is pronounced by the speakers of the dialect here represented in the corpus. All inconsistencies are mine.
Grammar categories At the moment, no MOR grammar or UD have been created for Tashlhiyt. Here are some of the conventions used for morphological analysis throughout the corpus. %com: tier is used as a mode both to label the type of construction in %the independent tier and also as a comment on the context in which the %independent tier utterance was produced. This is also used to give %context that is not available in the video.
The acronyms in the glosses are generally adapted from the Leipzig glossing rules, plus Berber-specific additions, specified below.
Acronym in gloss or comment tier What it stands for Description AAE Anti-agreement effect A suffix that shows up on the verb when the subject is fronted in wh-questions, relative clauses and focus clefts CS Construct state Special marking on subjects in post-verbal position FS Free State ‘Elsewhere’ marking for NPs in TB
Tashlhiyt Child Forms: These are variants of certain strings that were used by the children first and then incorporated into the family’s language, sometimes used in adult-directed speech as well. Child form Standard TB Translation skukkur/skokkor smuqqul/smoqqol Look otto ʕmmtˤu/ʕmmto Aunt (on paternal side) namnam tirmt food ʕammʕamm tirmt food diddi ma-d-udnh That which hurts/booboo baħħa ma-d-udnh That which hurts/booboo baski@c - Bike Ninni gʷin Sleep Nuhnnu/nohnno gʷin Sleep tiwtiw Tiglaj/taglajt eggs Mummu/mommo rrosom (borrowing from Arabic word for drawings) cartoons
I want to thank my students who chose to sign up and attend my advanced course on the acquisition of understudied languages (LING419M) for all the discussions and for really sharing your minds and wonderfully daring ideas and opinions with me. In alphabetical order: Sofia Bendana, Adelaide Bouthet, Tzipporah Harker, Eli Herbst, Takuya Kameyama, Madeline Keen, Alexa Kolosey, Sara Riso.
I would like to thank the Contact and Documentation reading group at Yale University for welcoming me and for the great discussions about the data here presented during the first year of my postdoc.
Many thanks also go to researchers from other universities whose advice improved the execution of this project. Thank you to Athulya Aravind, Karim Bensoukas, Claire Bowern, Abdellah Elouatiq, Dalila Dehbia Gaoua, Jenia Gutova, Mohamed Lahrouchi, Brian MacWhinney and Sophie Pierson.
Although the original names of the two children are kept in the transcripts, please only refer to the names of the children through pseudonyms when reporting the results. The original names are kept in the transcript due to some morpho-phonological phenomena such as the construct state that surface in nouns that start in A-, such as the name of one of the girls.
Corpus-specific correspondence through: tashlhiytberbercorpus@gmail.com