International Journal of Statistics in Medical Research

Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study
Pages 287-295
Yang Liu and Anindya De
DOI:
http://dx.doi.org/10.6000/1929-6029.2015.04.03.7
Published: 19 August 2015


Abstract: Missing data commonly occur in large epidemiologic studies. Ignoring incompleteness or handling the data inappropriately may bias study results, reduce power and efficiency, and alter important risk/benefit relationships. Standard ways of dealing with missing values, such as complete case analysis (CCA), are generally inappropriate due to the loss of precision and risk of bias. Multiple imputation by fully conditional specification (FCS MI) is a powerful and statistically valid method for creating imputations in large data sets which include both categorical and continuous variables. It specifies the multivariate imputation model on a variable-by-variable basis and offers a principled yet flexible method of addressing missing data, which is particularly useful for large data sets with complex data structures. However, FCS MI is still rarely used in epidemiology, and few practical resources exist to guide researchers in the implementation of this technique. We demonstrate the application of FCS MI in support of a large epidemiologic study evaluating national blood utilization patterns in a sub-Saharan African country. A number of practical tips and guidelines for implementing FCS MI based on this experience are described.

Keywords: Missing data, multiple imputation, fully conditional specification, complete case analysis, blood utilization.
Download Full Article
Submit to FacebookSubmit to TwitterSubmit to LinkedIn