TY - JOUR
T1 - Exploring potential unknown subgroups in your data
T2 - An introduction to finite mixture models for applied linguistics
AU - Larsson, Tove
AU - Hancock, Gregory R.
N1 - Publisher Copyright:
© 2024 Elsevier Ltd
PY - 2024/8
Y1 - 2024/8
N2 - This article provides an introduction to finite mixture models in an applied linguistics context. Mixture models can be used to address questions relating to whether there are unknown subgroups in one's data, and if so, which participants/texts are likely to belong to which subgroup. Put differently, the technique enables us to assess whether our data might come from a heterogeneous population that is made up of latent classes. As such, mixture models offer a model-based framework to answer research questions for which the field previously has either attempted to use nonparametric heuristic techniques (e.g., cluster analysis) or has left entirely unanswered. An example of such research questions would be, ‘Does the treatment work equally well for all the participants, or are there unknown subgroups in the data that respond differently to the treatment?’ The article starts by introducing univariate mixture models and then broadens the scope to cover bivariate and multivariate mixture models. It also discusses some known pitfalls of the technique and how one might ameliorate these in practice.
AB - This article provides an introduction to finite mixture models in an applied linguistics context. Mixture models can be used to address questions relating to whether there are unknown subgroups in one's data, and if so, which participants/texts are likely to belong to which subgroup. Put differently, the technique enables us to assess whether our data might come from a heterogeneous population that is made up of latent classes. As such, mixture models offer a model-based framework to answer research questions for which the field previously has either attempted to use nonparametric heuristic techniques (e.g., cluster analysis) or has left entirely unanswered. An example of such research questions would be, ‘Does the treatment work equally well for all the participants, or are there unknown subgroups in the data that respond differently to the treatment?’ The article starts by introducing univariate mixture models and then broadens the scope to cover bivariate and multivariate mixture models. It also discusses some known pitfalls of the technique and how one might ameliorate these in practice.
KW - Data heterogeneity
KW - Latent classes
KW - Mixture modeling
KW - Population subgroups
KW - Underlying groupings
UR - http://www.scopus.com/inward/record.url?scp=85193821655&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85193821655&partnerID=8YFLogxK
U2 - 10.1016/j.rmal.2024.100117
DO - 10.1016/j.rmal.2024.100117
M3 - Article
AN - SCOPUS:85193821655
SN - 2772-7661
VL - 3
JO - Research Methods in Applied Linguistics
JF - Research Methods in Applied Linguistics
IS - 2
M1 - 100117
ER -