We expect that researchers will contribute corpora constructed with TalkBank programs and tools. It is the obligation of TalkBank and TalkBank users to assure that these contributions are properly acknowledged and cited and that the data are correctly stored and distributed.
To contribute a new data set to the Discourse PsychosisBank:
For PsychosisBank contributions, please write an email message to
discourseinpsychosis@gmail.com, Brian MacWhinney (macw@cmu.edu) and
Lena Palaniyappan (lena.palaniyappan@mcgill.ca) describing your
contribution.
Please complete this contribution form , scan it, and
include the scan in the upload.
Both audio files and transcriptions are welcomed. If you have
transcripts, please be aware that TalkBank uses CHAT files, which must
pass CLAN's check program.
However, it is also possible to contribute transcripts with a different
format. Please specify the format of your transcriptions in your
contribution description and we will work to find the best solution to
upload your data.
If you happen to have transcripts in CHAT format, please note that
TalkBank uses a strict system for matching transcripts to media. This
requires that each transcript align with only one media file and that
the names of the transcript file and the media file be the same
(ignoring the extensions). For example, the file 020456.cha must have a
matching 020456.mp4 (or .wav or .mp3) media file. In addition, the
@Media line in the *.cha file should use the name of the media which
matches the name of the transcript. In general, please try to use short
file names to make processing easier. Information already provided in
folder names and the @ID lines does not need to be duplicated in file
names.
Please combine your audios, transcripts and documentation files
into a single .zip file and send that file as an encrypted email
attachment to Brian MacWhinney (macw@cmu.edu).
Documentation should include information for a web page, such as this one .
Our recommendations for media formats are given in section 5.1 of
the CLAN manual. Audio should be WAV and we can then create MP3 files
from the WAV.
Because audio files are usually too large to send through email, you will need to transfer them through WeTransfer, following these steps:
You will have to check a box saying that you agree to their terms and conditions. You will only have to do this once.
In the box on the left, enter macw@cmu.edu as "email to".
Enter your email as "your email" and add a message, if you wish.
Then click the "Plus" icon in the upper right.
Drag the files you wish to transfer into the "Add Your Files" window or click the plus to locate them.
Click the "Transfer" button and watch the transfer going through. You can do other work on your computer during this time.
The WeTransferPlus facility then sends us an email advising us when the transfer is complete. Please use only WeTransfer and not Dropbox, Box, or Google Drive for file transfer.
Once everything is in place, we will create a webpage for your corpus, like this one along with a DOI number, and we will announce the addition of the new corpus to the PsychosisBank list.
Please remember to cite any corpus using the APA format.
We are very thankful for the kindness and collegiality you are showing in contributing your hard-won data.
Guidelines for corpus documentation are given in section 4.5 of the CHAT manual .