TY - JOUR
T1 - Bootstrapping in Applied Linguistics
T2 - Assessing its Potential Using Shared Data
AU - Plonsky, Luke
AU - Egbert, Jesse
AU - Laflair, Geoffrey T.
N1 - Publisher Copyright:
© 2014 Oxford University Press 2014.
PY - 2015/12/1
Y1 - 2015/12/1
N2 - Parametric analyses such as t tests and ANOVAs are the norm - if not the default - statistical tests found in quantitative applied linguistics research (Gass 2009). Applied statisticians and one applied linguist (Larson-Hall 2010, 2012; Larson-Hall and Herrington 2010), however, have argued that this approach may not be appropriate for small samples and/or nonnormally distributed data (e.g. Wilcox 2003), both common in second language (L2) research. They recommend instead 'robust statistics' such as bootstrapping, a nonparametric procedure that randomly resamples from an observed data set to produce a simulated but more stable and statistically accurate outcome. The present study tests the usefulness of bootstrapping by reanalyzing raw data from 26 studies of applied linguistics research. Our results found no evidence of Type II error (false negative). However, 4 out of 16 statistically significant results were not replicated (i.e. a Type I error 'misfit' five times higher than an alpha of. 05). We discuss empirically justified suggestions for the use of bootstrapping in the context of broader methodological issues and reforms in applied linguistics (see Plonsky 2013, 2014).
AB - Parametric analyses such as t tests and ANOVAs are the norm - if not the default - statistical tests found in quantitative applied linguistics research (Gass 2009). Applied statisticians and one applied linguist (Larson-Hall 2010, 2012; Larson-Hall and Herrington 2010), however, have argued that this approach may not be appropriate for small samples and/or nonnormally distributed data (e.g. Wilcox 2003), both common in second language (L2) research. They recommend instead 'robust statistics' such as bootstrapping, a nonparametric procedure that randomly resamples from an observed data set to produce a simulated but more stable and statistically accurate outcome. The present study tests the usefulness of bootstrapping by reanalyzing raw data from 26 studies of applied linguistics research. Our results found no evidence of Type II error (false negative). However, 4 out of 16 statistically significant results were not replicated (i.e. a Type I error 'misfit' five times higher than an alpha of. 05). We discuss empirically justified suggestions for the use of bootstrapping in the context of broader methodological issues and reforms in applied linguistics (see Plonsky 2013, 2014).
UR - http://www.scopus.com/inward/record.url?scp=84894096224&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84894096224&partnerID=8YFLogxK
U2 - 10.1093/applin/amu001
DO - 10.1093/applin/amu001
M3 - Article
AN - SCOPUS:84894096224
SN - 0142-6001
VL - 36
SP - 591
EP - 610
JO - Applied Linguistics
JF - Applied Linguistics
IS - 5
ER -