TY - JOUR
T1 - Gap-filling missing data in eddy covariance measurements using multiple imputation (MI) for annual estimations
AU - Hui, Dafeng
AU - Wan, Shiqiang
AU - Su, Bo
AU - Katul, Gabriel
AU - Monson, Russell
AU - Luo, Yiqi
N1 - Funding Information:
This study was supported by the NSF/DOE/NASA/USDA/EPA/NOAA Interagency Program on Terrestrial Ecology and Global Change (TECO) by DOE under DE-FG03-99ER62800 to YL at the University of Oklahoma and the NIGEC SouthCentral Regional Center at Tulane University. Eddy-covariance flux measurements were supported by the NIGEC Southeastern Regional Center at the University of Alabama, Tascaloosa (DOE Cooperative Agreement DE-FC03-90ER61010), the NIGEC SouthCentral Regional Center at Tulane University, and the Terrestrial Carbon Processes (TCP) program at DOE and the US Department of Energy (Terrestrial Carbon Program) and NASA/GEWEX. We thank Dr. Dennis Baldocchi and Dr. Kell Wilson for providing eddy-covariance flux measurements at WBW site.
PY - 2004/1/20
Y1 - 2004/1/20
N2 - Missing data is a ubiquitous problem in evaluating long-term experimental measurements, such as those associated with the FluxNet project, due to the equipment failures, system maintenance, power-failure, and lightning strikes among other things. To estimate annual values of net ecosystem carbon exchange (NEE), latent heat flux (LE) and sensible heat flux (H), such gaps in the measured data must be filled or imputed. So far, no standardized method has been accepted and the imputation methods used are largely dependent on the researchers' choice. Here, we used multiple imputation (MI) to gap-fill the missing data for annual estimations of NEE, LE and H at three flux sites associated with the FluxNet effort. MI is a Monte Carlo technique in which the missing values are replaced by several simulated values. Each data set imputed is a complete one where the observed values are the same as those in the original data set; only the missing values are different. Thus, the normal statistical analysis (e.g. annual total calculation) can be applied to each data set separately. The results of each analysis can be recombined into one summary. We applied the MI method to eddy covariance measurements collected from Walker Branch Watershed (WBW) site (a deciduous forest), Duke site (a coniferous forest) and Niwot site (a subalpine forest). Results showed that annual estimations of NEE, LE and H by MI were comparable to other imputation methods but MI was much easier to apply because of readily available software and standard algorithms. Besides the normal statistical analyses, MI also provided confidence intervals for each estimated parameter. This confidence interval is most useful when assessing energy, water, and carbon balance closures at a given tower site. Significant differences in annual NEE, LE and H were found among years at the three AmeriFlux sites. NEE at the Niwot Ridge site was lower and LE and H were higher than at the other two sites. With the available software and realistic gap-filling capability, MI has the potential to become a standardized method to gap-fill eddy covariance flux data for annual estimations and to improve the analysis of uncertainties associated with annual estimations of NEE, LE and H from regional and global flux networks.
AB - Missing data is a ubiquitous problem in evaluating long-term experimental measurements, such as those associated with the FluxNet project, due to the equipment failures, system maintenance, power-failure, and lightning strikes among other things. To estimate annual values of net ecosystem carbon exchange (NEE), latent heat flux (LE) and sensible heat flux (H), such gaps in the measured data must be filled or imputed. So far, no standardized method has been accepted and the imputation methods used are largely dependent on the researchers' choice. Here, we used multiple imputation (MI) to gap-fill the missing data for annual estimations of NEE, LE and H at three flux sites associated with the FluxNet effort. MI is a Monte Carlo technique in which the missing values are replaced by several simulated values. Each data set imputed is a complete one where the observed values are the same as those in the original data set; only the missing values are different. Thus, the normal statistical analysis (e.g. annual total calculation) can be applied to each data set separately. The results of each analysis can be recombined into one summary. We applied the MI method to eddy covariance measurements collected from Walker Branch Watershed (WBW) site (a deciduous forest), Duke site (a coniferous forest) and Niwot site (a subalpine forest). Results showed that annual estimations of NEE, LE and H by MI were comparable to other imputation methods but MI was much easier to apply because of readily available software and standard algorithms. Besides the normal statistical analyses, MI also provided confidence intervals for each estimated parameter. This confidence interval is most useful when assessing energy, water, and carbon balance closures at a given tower site. Significant differences in annual NEE, LE and H were found among years at the three AmeriFlux sites. NEE at the Niwot Ridge site was lower and LE and H were higher than at the other two sites. With the available software and realistic gap-filling capability, MI has the potential to become a standardized method to gap-fill eddy covariance flux data for annual estimations and to improve the analysis of uncertainties associated with annual estimations of NEE, LE and H from regional and global flux networks.
KW - Eddy covariance
KW - Latent heat
KW - Missing data
KW - Multiple imputation
KW - Net ecosystem carbon exchange
KW - Regression analysis
KW - Sensible heat
UR - http://www.scopus.com/inward/record.url?scp=0346040111&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0346040111&partnerID=8YFLogxK
U2 - 10.1016/S0168-1923(03)00158-8
DO - 10.1016/S0168-1923(03)00158-8
M3 - Article
AN - SCOPUS:0346040111
SN - 0168-1923
VL - 121
SP - 93
EP - 111
JO - Agricultural and Forest Meteorology
JF - Agricultural and Forest Meteorology
IS - 1-2
ER -