博雅讲坛393讲 《国际语料库语言学期刊》主编Michaela Mahlberg教授讲座
讲座题目：CLiC及语言与文学的界限（CLiC and the boundaries of language and literature）
Michaela Mahlberg is Professor of corpus linguistics at the University of Birmingham, UK, where she is also the Director of the Centre for Corpus Research and the Director of Research and Knowledge Transfer for the College of Arts and Law. Michaela is the editor of the International Journal of Corpus Linguistics (John Benjamins) and together with Wolfgang Teubert she edits the book series Corpus and Discourse (Bloomsbury). One of her main areas of research is Dickens’s fiction and the socio-cultural context of the 19th century. Her publications include Corpus Stylistics and Dickens’s Fiction (Routledge, 2013), English General Nouns: a Corpus Theoretical Approach (John Benjamins, 2005) and Text, Discourse and Corpora. Theory and Analysis (Continuum, 2007, co-authored with Michael Hoey, Michael Stubbs and Wolfgang Teubert). Michaela was the Principal Investigator on the AHRC-funded project CLiC Dickens: Characterisation in the representation of speech and body language from a corpus linguistic perspective which led to the development of the CLiC web app.
Corpus linguistic methods open up new avenues for our understanding of literary language. Carter (2004) has argued that the notion of a clear-cut distinction between literary and non-literary language is an unhelpful one. Instead, language use is best described in terms of a cline of literariness. In this paper, I will illustrate corpus linguistic methods that offer new ways of exploring the fuzzy boundaries between literary and non-literary patterns of language use. Focusing on the textual creation of fictional characters, I will look at patterns that account for the presentation of speech in fiction and the description of body language. The CLiC web application (http://clic.bham.ac.uk/) - that has been specifically designed for the study of literary texts - will be used to show how such patterns can be identified and analysed. The most recent version of the CLiC corpora now comprise more than 130 books across four subcorpora: the corpus of Dickens’s Novels, the 19th Century Reference Corpus (19C), the Corpus of 19th Century Children’s Literature (ChiLit) and the Corpus of Additional Requested Texts (ArTs). For all CLiC texts, direct speech and specific places around speech have been marked up (Mahlberg et al. 2016) so that CLiC can run searches across defined textual subsets. The ability to focus on subsets makes it possible to identify specific literary patterns that can then be usefully compared to non-literary language. To illustrate key functionalities of CLiC I will provide examples retrieved with the most recent version of the CLiC KWICgrouper as well as examples where manually added tags support the analysis of concordances lines.
Carter, R. (2004). Language and Creativity. The Art of Common Talk. London: Routledge.
Mahlberg, M., Stockwell, P., Joode, J. de, Smith, C., & O’Donnell, M. B. (2016). CLiC Dickens:
novel uses of concordances for the integration of corpus stylistics and cognitive poetics. Corpora, 11(3), 433–463.