David Oakey (University of Birmingham)

An isotextual approach to comparisons of lexical bundles across disciplines


In the decade since its initial description by Biber et al. (1999), the lexical bundle has been widely studied as a phraseological unit for making comparisons between corpora of language in different registers, such as conversation and academic prose (Biber et al. 2004). In addition to comparisons of the forms and structure of lexical bundles, further work has focused on their different discourse functions (Cortes 2004; Biber 2006; Hyland 2008), particularly in written academic genres.

This paper attempts to show that the original definition of a lexical bundle - a fixed string of three or more words which occurs more than 10 times per million words - is problematic in the case of lexical bundles which have been assigned discourse functions. It first discusses methodological issues relating to the construction of comparative written academic corpora and suggests a distinction between isolexical comparisons, in which subcorpora containing a similar number of tokens are compared, and isotextual comparisons, in which subcorpora containing a similar number of texts are compared. It then presents a comparison of lexical bundle frequencies between isolexical and isotextual subcorpora of research articles in different disciplines. The results from this study suggest that isotextual comparisons will reveal more about the discourse functions of lexical bundles in research articles.


