Missing data in longitudinal studies: Comparison of multiple imputation methods in a real clinical setting

IRIS

Rationale, aims, and objectives Missing data represent a challenge in longitudinal studies. The aim of the study is to compare the performance of the multivariate normal imputation and the fully conditional specification methods, using real data set with missing data partially completed 2 years later. Method The data used came from an ongoing randomized controlled trial with 5-year follow-up. At a certain time, we observed a number of patients with missing data and a number of patients whose data were unobserved because they were not yet eligible for a given follow-up. Both unobserved and missing data were imputed. The imputed unobserved data were compared with the corresponding real information obtained 2 years later. Results Both imputation methods showed similar performance on the accuracy measures and produced minimally biased estimates. Conclusion Despite the large number of repeated measures with intermittent missing data and the non-normal multivariate distribution of data, both methods performed well and was not possible to determine which was better.

Missing data in longitudinal studies: Comparison of multiple imputation methods in a real clinical setting

Rosato R;Pagano E;Testa S;Zola P;di Cuonzo D

2021-01-01

Abstract

Rationale, aims, and objectives Missing data represent a challenge in longitudinal studies. The aim of the study is to compare the performance of the multivariate normal imputation and the fully conditional specification methods, using real data set with missing data partially completed 2 years later. Method The data used came from an ongoing randomized controlled trial with 5-year follow-up. At a certain time, we observed a number of patients with missing data and a number of patients whose data were unobserved because they were not yet eligible for a given follow-up. Both unobserved and missing data were imputed. The imputed unobserved data were compared with the corresponding real information obtained 2 years later. Results Both imputation methods showed similar performance on the accuracy measures and produced minimally biased estimates. Conclusion Despite the large number of repeated measures with intermittent missing data and the non-normal multivariate distribution of data, both methods performed well and was not possible to determine which was better.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2021
			
	Parole chiave
	
				Fully conditional specification
Missing data
Multivariate normal imputation
Quality of life
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14087/7295

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

social impact