Introducing and comparing two techniques for key lexical bundles analysis

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Multiword units, specifically lexical bundles, have been found to be important building blocks in language production and processing. We also know that using the text rather than the full corpus as the unit of analysis increases the linguistic validity of the results, given that written language is produced through texts (e.g., Egbert & Biber, 2019). However, researchers wishing to look at which bundles are characteristic of, or key to, a population (e.g., students from a specific first-language background) are currently out of luck if they are interested in using the text as the unit of analysis. The present paper introduces two methods designed for looking at key lexical bundles using texts as the unit of analysis: text dispersion keyness and mean text frequency keyness. We subsequently compare the results from these methods to existing whole-corpus frequency keyness. The results show that the techniques produce similar lists, but that mean text frequency keyness produced the largest number of content generalizable bundles (i.e., bundles that can be generalized across texts in the corpus). By contrast, text dispersion keyness helped us obtain the largest number of content distinctive bundles (i.e., bundles that clearly distinguish the target corpus from the reference corpus). Text dispersion keyness also produced the highest number of bundles that were both content generalizable and distinctive. Researchers may therefore wish to make a choice among these methods based on the objectives of their analysis.

Original languageEnglish (US)
Article number100245
JournalResearch Methods in Applied Linguistics
Volume4
Issue number3
DOIs
StatePublished - Dec 2025

Keywords

  • Keyness
  • Lexical bundles
  • Multiword units
  • Register variation
  • Text dispersion

ASJC Scopus subject areas

  • Social Sciences (miscellaneous)
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Introducing and comparing two techniques for key lexical bundles analysis'. Together they form a unique fingerprint.

Cite this