Good practices in the compilation of FOLK, the Research and Teaching Corpus of Spoken German
- 19 September 2016
- journal article
- Published by John Benjamins Publishing Company in Corpus Studies of Language Through Time
- Vol. 21 (3), 396-418
- https://doi.org/10.1075/ijcl.21.3.05sch
Abstract
This paper presents practices in the compilation of FOLK, the Research and Teaching Corpus of Spoken German, a large collection of spontaneous verbal interaction from diverse discourse domains. After introducing the aims and organisational circumstances of the construction of FOLK, the general idea discussed is that good practices cannot be developed without considering methodological, technological and organisational aspects on equal footing. Starting from this idea, this paper inspects more closely some actual practices in FOLK, namely the handling of legal (especially privacy protection) issues, the decisions taken for the transcription and annotation workflow, and the question of how to best disseminate a corpus like FOLK. The final section sketches some possible future improvements for practices in FOLK.Keywords
This publication has 11 references indexed in Scilit:
- Some current transcription systems for spoken discourse: A critical analysisPragmatics, 2022
- Are transcripts reproducible?Pragmatics, 2022
- Schriftliche und mündliche Korpora am IDS als Grundlage für die empirische ForschungPublished by Walter de Gruyter GmbH ,2015
- 2. Grundeinheiten der Sprache und des SprechensPublished by Walter de Gruyter GmbH ,2015
- The GeWiss corpusPublished by John Benjamins Publishing Company ,2012
- Was gehört in ein nationales Gesprächskorpus? Kriterien, Probleme und Prioritäten der Stratifikation des „Forschungs- und Lehrkorpus Gesprochenes Deutsch“ (FOLK) am Institut für Deutsche Sprache (Mannheim)Published by Walter de Gruyter GmbH ,2011
- Data structures for the analysis of regional language variationPublished by Walter de Gruyter GmbH ,2008
- Accessing the spoken wordInternational Journal on Digital Libraries, 2005
- Seven Dimensions of Portability for Language Documentation and DescriptionLanguage, 2003
- A formal framework for linguistic annotationSpeech Communication, 2001