Events Conference on Foundations and Advances of Machine Learning in Official Statistics, 3rd to 5th April, 2024

Session 1.2 Data Validation and Imputation

ML-Based Imputation Methods in R Package VIM: Performance and Considerations

Alexander Kowarik* 1, Johannes Gussenbauer1, Nina Niederhametner1

Abstract

The R package VIM (Visualization and Imputation of Missing Values) has incorporated machine learning (ML)-based imputation methods, including xgboost and GPT models. This presentation will elucidate the recent advancements in VIM, with a special emphasis on the performance of these ML models in handling missing data.

*: Speaker

1: Statistics Austria